TY - JOUR
T1 - Enhanced language modelling with phonologically constrained morphological analysis
AU - Fang, A. C.
AU - Huckvale, M.
PY - 2000
Y1 - 2000
N2 - Phonologically constrained morphological analysis (PCMA) is the decomposition of words into their component morphemes conditioned by both orthography and pronunciation. This article describes PCMA and its application in large-vocabulary continuous speech recognition to enhance recognition performance in some tasks. Our experiments, based on the British National Corpus and the LOB Corpus for training data and WSJCAM0 for test data, show clearly that PCMA leads to smaller lexicon size, smaller language models, superior word lattices and a decrease in word error rates. PCMA seems to show most benefit in open vocabulary tasks, where the productivity of a morph unit lexicon makes a substantial reduction in out-of-vocabulary rates.
AB - Phonologically constrained morphological analysis (PCMA) is the decomposition of words into their component morphemes conditioned by both orthography and pronunciation. This article describes PCMA and its application in large-vocabulary continuous speech recognition to enhance recognition performance in some tasks. Our experiments, based on the British National Corpus and the LOB Corpus for training data and WSJCAM0 for test data, show clearly that PCMA leads to smaller lexicon size, smaller language models, superior word lattices and a decrease in word error rates. PCMA seems to show most benefit in open vocabulary tasks, where the productivity of a morph unit lexicon makes a substantial reduction in out-of-vocabulary rates.
UR - http://www.scopus.com/inward/record.url?scp=0033676890&partnerID=8YFLogxK
UR - https://www.scopus.com/record/pubmetrics.uri?eid=2-s2.0-0033676890&origin=recordpage
U2 - 10.1109/ICASSP.2000.862081
DO - 10.1109/ICASSP.2000.862081
M3 - RGC 22 - Publication in policy or professional journal
SN - 0736-7791
VL - 3
SP - 1711
EP - 1714
JO - ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
JF - ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
T2 - 2000 IEEE Interntional Conference on Acoustics, Speech, and Signal Processing
Y2 - 5 June 2000 through 9 June 2000
ER -