Term recognition using conditional random fields

Xing Zhang, Yan Song, Alex Chengyu Fang

    Research output: Chapters, Conference Papers, Creative and Literary WorksRGC 32 - Refereed conference paper (with host publication)peer-review

    13 Citations (Scopus)

    Abstract

    A machine learning framework, Conditional Random fields (CRF), is constructed in this study, which exploits syntactic information to recognize biomedical terms. Features used in this CRF framework focus on syntactic information in different levels, including parent nodes, syntactic functions, syntactic paths and term ratios. A series of experiments have been done to study the effects of training sizes, general term recognition and novel term recognition. The experiment results show that features as syntactic paths and term ratios can achieve good precision of term recognition, including both general terms and novel terms. However, the recall of novel term recognition is still unsatisfactory, which calls for more effective features to be used. All in all, as this research studies in depth the uses of some unique syntactic features, it is innovative in respect of constructing machine learning based term recognition system. ©2010 IEEE.
    Original languageEnglish
    Title of host publicationProceedings of the 6th International Conference on Natural Language Processing and Knowledge Engineering, NLP-KE 2010
    DOIs
    Publication statusPublished - 2010
    Event6th International Conference on Natural Language Processing and Knowledge Engineering, NLP-KE 2010 - Beijing, China
    Duration: 21 Aug 201023 Aug 2010

    Conference

    Conference6th International Conference on Natural Language Processing and Knowledge Engineering, NLP-KE 2010
    Country/TerritoryChina
    CityBeijing
    Period21/08/1023/08/10

    Research Keywords

    • Conditional random fields
    • General term
    • Novel term
    • Syntactic function
    • Term recognition
    • Tracking

    Fingerprint

    Dive into the research topics of 'Term recognition using conditional random fields'. Together they form a unique fingerprint.

    Cite this