Abstract
In this paper, we attempt to develop an efficient and accurate algorithm for joint time delay and pitch estimation of a speech signal received at a microphone array. The time delay measurement allows a speaker to be located while the detection of the pitch frequency is useful for analyzing the acoustic properties of the sound. A subspace method based on state-space realization is first introduced for joint time delay and frequency estimation of a synthetic signal consisting of several frequency components. The frequency estimates are obtained directly from the eigenvalues of the state transition matrix whilst the time delay is approximated from the observation matrix and the estimated frequencies using a least square approach. The method is then extended to track both the time delay and pitch frequency of a speech signal, which is modeled by a summation of sinusoids that are harmonically related to the fundamental frequency (pitch) and spectrally shaped by the vocal tract transfer function. Extensive simulation tests have been done to validate the effectiveness and accuracy of the proposed algorithm.
| Original language | English |
|---|---|
| Title of host publication | Proceedings of the 2003 International Symposium on Circuits and Systems |
| Publisher | IEEE |
| Volume | V |
| ISBN (Print) | 0780377613 |
| DOIs | |
| Publication status | Published - May 2003 |
| Event | Proceedings of the 2003 IEEE International Symposium on Circuits and Systems - Imperial Queen's Park Hotel, Bangkok, Thailand Duration: 25 May 2003 → 28 May 2003 |
Conference
| Conference | Proceedings of the 2003 IEEE International Symposium on Circuits and Systems |
|---|---|
| Abbreviated title | ISCAS 2003 |
| Place | Thailand |
| City | Bangkok |
| Period | 25/05/03 → 28/05/03 |
Fingerprint
Dive into the research topics of 'Joint time delay and pitch estimation for speaker localization'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver