Skip to main navigation Skip to search Skip to main content

Applying functional partition in the investigation of lexical tonal-pattern categories in an under-resourced Chinese dialect

Junru Wu*, Yiya Chen, Vincent J. van Heuven, Niels O. Schiller

*Corresponding author for this work

Research output: Chapters, Conference Papers, Creative and Literary WorksRGC 32 - Refereed conference paper (with host publication)peer-review

Abstract

The present study applied functional partition to investigate disyllabic lexical tonal-pattern categories in an under-resourced Chinese dialect, Jinan Mandarin. A Two-Stage partitioning procedure was introduced to process a multi-speaker corpus that contains irregular lexical variants in a semi-automatic way. In the first stage, a program provides suggestions for the phonetician to decide the lexical tonal variants for the recordings of each word, based on the result of a functional k-means partitioning algorithm and tonal information from an available pronunciation dictionary of a related Chinese dialect, i.e. Standard Chinese. The second stage iterates a functional version of k-means partitioning with Silhouette-based criteria to abstract an optimal number of tonal patterns from the whole corpus, which also allows the phoneticians to adjust the results of the automatic procedure in a controlled way and so redo partitioning for a subset of clusters. The procedure yielded eleven disyllabic tonal patterns for Jinan Mandarin, representing the tonal system used by contemporary Jinan Mandarin speakers from a wide range of age groups. The procedure used in this paper is different from previous linguistic descriptions, which were based on more elderly speakers’ pronunciations. This method incorporates phoneticians’ linguistic knowledge and preliminary linguistic resources into the procedure of partitioning. It can improve the efficiency and objectivity in the investigation of lexical tonal-pattern categories when building pronunciation dictionaries for under-resourced languages. © 2018, Springer Nature Singapore Pte Ltd.
Original languageEnglish
Title of host publicationMan-Machine Speech Communication - 14th National Conference, NCMMSC 2017, Revised Selected Papers
PublisherSpringer Verlag
Pages24-35
Volume807
ISBN (Print)9789811081101
DOIs
Publication statusPublished - 2018
Externally publishedYes
Event14th National Conference on Man-Machine Speech Communication, NCMMSC 2017 - Lianyungang, China
Duration: 11 Oct 201713 Oct 2017

Publication series

NameCommunications in Computer and Information Science
Volume807
ISSN (Print)1865-0929

Conference

Conference14th National Conference on Man-Machine Speech Communication, NCMMSC 2017
PlaceChina
CityLianyungang
Period11/10/1713/10/17

Bibliographical note

Publication details (e.g. title, author(s), publication statuses and dates) are captured on an “AS IS” and “AS AVAILABLE” basis at the time of record harvesting from the data source. Suggestions for further amendments or supplementary information can be sent to [email protected].

Research Keywords

  • K-means partition
  • Pattern recognition
  • Phonetics
  • Pronunciation dictionary
  • Tone

Fingerprint

Dive into the research topics of 'Applying functional partition in the investigation of lexical tonal-pattern categories in an under-resourced Chinese dialect'. Together they form a unique fingerprint.

Cite this