Abstract
In this paper. we present a novel methodology to enhance Chinese test chunking with the aid of transductive Hidden Markov Models (transductive HMMs, henceforth). We consider chunking as a special tagging problem and attempt to utilize, via a number of transformation functions, as much relevant contextual information as possible for model training. These functions enable the models to make use of contextual information to a greater extent and keep us away from costly changes of the original training and tagging process. Each of them results in an individual model with certain pros and cons. Through a number of esperiments, we succeed in integrating the best two models into a significantly better one. We carry out the chunking experiments on the HIT Chinese Treebank corpus. Experimental results show that it is an effective approach, achieving an F score of 82.38%.
| Original language | English |
|---|---|
| Title of host publication | NLP-KE 2003 - 2003 International Conference on Natural Language Processing and Knowledge Engineering, Proceedings |
| Publisher | IEEE |
| Pages | 257-262 |
| ISBN (Print) | 0780379020, 9780780379022 |
| DOIs | |
| Publication status | Published - 2003 |
| Event | International Conference on Natural Language Processing and Knowledge Engineering, NLP-KE 2003 - Beijing, China Duration: 26 Oct 2003 → 29 Oct 2003 |
Conference
| Conference | International Conference on Natural Language Processing and Knowledge Engineering, NLP-KE 2003 |
|---|---|
| Place | China |
| City | Beijing |
| Period | 26/10/03 → 29/10/03 |
Research Keywords
- Text chunking
- Transductive HMM
Fingerprint
Dive into the research topics of 'Transductive HMM based Chinese text chunking'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver