Unsupervised Segmentation Helps Supervised Learning of Character Tagging for Word Segmentation and Named Entity Recognition

Hai Zhao, Chunyu Kit

Research output: Chapters, Conference Papers, Creative and Literary WorksRGC 32 - Refereed conference paper (with host publication)peer-review

104 Citations (Scopus)

Abstract

This paper describes a novel character tagging approach to Chinese word segmentation and named entity recognition (NER) for our participation in Bakeoff-4.1 It integrates unsupervised segmentation and conditional random fields (CRFs) learning successfully, using similar character tags and feature templates for both word segmentation and NER. It ranks at the top in all closed tests of word segmentation and gives promising results for all closed and open NER tasks in the Bakeoff. Tag set selection and unsupervised segmentation play a critical role in this success. © IJCNLP 2008. All rights reserved.
Original languageEnglish
Title of host publicationIJCNLP 2008 - Sixth SIGHAN Workshop on Chinese Language Processing - Proceedings of the Workshop
PublisherAssociation for Computational Linguistics
Pages106-111
Publication statusPublished - 11 Jan 2008
Event6th SIGHAN Workshop on Chinese Language Processing (SIGHAN 2008) - Hyderabad, India
Duration: 11 Jan 200812 Jan 2008
https://aclanthology.org/volumes/I08-4/

Publication series

NameSIGHAN - SIGHAN Workshop on Chinese Language Processing, co-located with International Joint Conference on Natural Language Processing, IJCNLP

Conference

Conference6th SIGHAN Workshop on Chinese Language Processing (SIGHAN 2008)
Abbreviated titleSIGHAN-6
Country/TerritoryIndia
CityHyderabad
Period11/01/0812/01/08
Internet address

Fingerprint

Dive into the research topics of 'Unsupervised Segmentation Helps Supervised Learning of Character Tagging for Word Segmentation and Named Entity Recognition'. Together they form a unique fingerprint.

Cite this