Genetic algorithm for optimizing the nonlinear time alignment of automatic speech recognition systems

Research output: Journal Publications and Reviews (RGC: 21, 22, 62)21_Publication in refereed journalpeer-review

14 Scopus Citations
View graph of relations

Author(s)

Detail(s)

Original languageEnglish
Pages (from-to)559-566
Journal / PublicationIEEE Transactions on Industrial Electronics
Volume43
Issue number5
Publication statusPublished - 1996
Externally publishedYes

Abstract

Dynamic time warping (DTW) is a nonlinear timealignment technique for automatic speech recognition (ASK) systems. It had been widely used in many commercial and industrial products, ranging from electronic dailies/dictionaries to wireless voice digit dialers. DTW has the advantages of fast training and searching times, which makes it more popular than other available ASR techniques. However, there exist some limitations to DTW, such as the stringent rule on slope weighting, the nontrivial compulation of the A"-best paths, and the significant increase in computational time when the endpoint constraint is relaxed or the variations of the length of pattern increased. In this paper, a stochastic method called the genetic algorithm (GA), which is used to solve the nonlinear time alignment problem, is presented. Experimental results show that the GA has a better performance than the DTW. In addition, two derivatives of GA: the hybrid GA and the parallel GA are also presented. © 1996 IEEE.

Research Area(s)

  • Automatic speech recognition, Dynamic time warping, Genetic algorithm, Hybrid genetic algorithm, Parallel genetic algorithm, Stochastic method