TY - JOUR
T1 - A comparison of lexeme and speech syllables in Dutch
AU - Schiller, Niels O.
AU - Meyer, Antje S.
AU - Baayen, R. Harald
AU - Levelt, Willem J. M.
N1 - Publication details (e.g. title, author(s), publication statuses and dates) are captured on an “AS IS” and “AS AVAILABLE” basis at the time of record harvesting from the data source. Suggestions for further amendments or supplementary information can be sent to [email protected].
PY - 1996
Y1 - 1996
N2 - The CELEX lexical database includes a list of Dutch syllables and their frequencies, based on syllabification of isolated word forms. In connected speech, however, sentence-level phonological rules can modify the syllables and their token frequencies. In order to estimate the changes syllables may undergo in connected speech, an empirical investigation was carried out. A large Dutch text corpus (TROUW) was transcribed, processed by word level rules, and syllabified. The resulting lexeme syllables were evaluated by comparing them to the CELEX lexical database for Dutch. Then additional phonological sentence-level rules were applied to the TROUW corpus, and the frequencies of the resulting connected speech syllables were compared with those of the lexeme syllables from TROUW. The overall correlation between lexeme and speech syllables was very high. However, speech syllables generally had more complex CV structures than lexeme syllables. Implications of the results for research involving syllables are discussed. With respect to the notion of a mental syllabary (a store for precompiled articulatory programs for syllables, see Levelt & Wheeldon, 1994) this study revealed an interesting statistical result. The calculation of the cumulative syllable frequencies showed that 85% of the syllable tokens in Dutch can be covered by the 500 most frequent syllable types, which makes the idea of a syllabary very attractive. © Swets & Zeitlinger.
AB - The CELEX lexical database includes a list of Dutch syllables and their frequencies, based on syllabification of isolated word forms. In connected speech, however, sentence-level phonological rules can modify the syllables and their token frequencies. In order to estimate the changes syllables may undergo in connected speech, an empirical investigation was carried out. A large Dutch text corpus (TROUW) was transcribed, processed by word level rules, and syllabified. The resulting lexeme syllables were evaluated by comparing them to the CELEX lexical database for Dutch. Then additional phonological sentence-level rules were applied to the TROUW corpus, and the frequencies of the resulting connected speech syllables were compared with those of the lexeme syllables from TROUW. The overall correlation between lexeme and speech syllables was very high. However, speech syllables generally had more complex CV structures than lexeme syllables. Implications of the results for research involving syllables are discussed. With respect to the notion of a mental syllabary (a store for precompiled articulatory programs for syllables, see Levelt & Wheeldon, 1994) this study revealed an interesting statistical result. The calculation of the cumulative syllable frequencies showed that 85% of the syllable tokens in Dutch can be covered by the 500 most frequent syllable types, which makes the idea of a syllabary very attractive. © Swets & Zeitlinger.
UR - http://www.scopus.com/inward/record.url?scp=0002523902&partnerID=8YFLogxK
UR - https://www.scopus.com/record/pubmetrics.uri?eid=2-s2.0-0002523902&origin=recordpage
U2 - 10.1080/09296179608590060
DO - 10.1080/09296179608590060
M3 - RGC 21 - Publication in refereed journal
SN - 0929-6174
VL - 3
SP - 8
EP - 28
JO - Journal of Quantitative Linguistics
JF - Journal of Quantitative Linguistics
IS - 1
ER -