Skip to main navigation Skip to search Skip to main content

A Study on Automatic Extraction of New Terms

Xing Zhang, Alex Chengyu Fang

    Research output: Chapters, Conference Papers, Creative and Literary WorksRGC 32 - Refereed conference paper (with host publication)peer-review

    Abstract

    This research explores to automatically predict new terms based on linguistic features and statistical behaviors of noun phrases during a special period. It integrates both syntactic function value and TF-IDF value into an automatic term extraction system to weight new term candidates. Research questions include: what are the linguistic and statistic properties of new terms during a special period? Will linguistic features contribute to prediction of new terms? And will statistic features like, TFIDF Value contribute to prediction of new terms? Correspondingly, a series of experiments are conducted on medical corpus to examine a group of new terms' distribution properties and syntactic features across two years in comparison. The results show there does exist significant difference between two groups of values. Regardless of this limitation, this research is meaningful as it attempts to realize automation of selection process of new medical terms, which will greatly avoid subjective decisions and reduce experts' workloads.
    Original languageEnglish
    Title of host publicationProceedings - 2011 4th International Symposium on Knowledge Acquisition and Modeling, KAM 2011
    Pages599-602
    DOIs
    Publication statusPublished - Oct 2011
    Event2011 4th International Symposium on Knowledge Acquisition and Modeling (KAM 2011) - Sanya, China
    Duration: 8 Oct 20119 Oct 2011

    Conference

    Conference2011 4th International Symposium on Knowledge Acquisition and Modeling (KAM 2011)
    PlaceChina
    CitySanya
    Period8/10/119/10/11

    Research Keywords

    • New term
    • old term
    • SF-Value
    • term extraction
    • TFIDF

    Fingerprint

    Dive into the research topics of 'A Study on Automatic Extraction of New Terms'. Together they form a unique fingerprint.

    Cite this