Skip to main navigation Skip to search Skip to main content

Pitch-based gender identification with two-stage classification

Yakun Hu, Dapeng Wu, Antonio Nucci

Research output: Journal Publications and ReviewsRGC 21 - Publication in refereed journalpeer-review

Abstract

In this paper, we address the speech-based gender identification problem. Mel-Frequency Cepstral Coefficients (MFCC) of voice samples are typically used as the features for gender identification. However, MFCC-based classification incurs high complexity. This paper proposes a novel pitch-based gender identification system with a two-stage classifier to ensure accurate identification and low complexity. The first stage of the classifier identifies and labels all the speakers whose pitch clearly indicates the gender of the speaker; the complexity of this stage is very low since only threshold-based decision rule on a scalar (i.e., pitch) is used. The ambiguous voice samples from all the other speakers (which cannot be classified with high accuracy by the first stage, and can be regarded as suspicious speakers or difficult cases) are forwarded to the second-stage for finer examination; the second-stage of our classifier uses Gaussian Mixture Model to accurately isolate voice samples based on gender. Experiment results show that our system is speech language/content independent, microphone independent, and robust against noisy recording conditions. Our system is extremely accurate with probability of correct classification of 98.65%, and very efficient with about 5 s required for feature extraction and classification. © 2011 John Wiley & Sons, Ltd.
Original languageEnglish
Pages (from-to)211-225
JournalSecurity and Communication Networks
Volume5
Issue number2
DOIs
Publication statusPublished - Feb 2012
Externally publishedYes

Bibliographical note

Publication details (e.g. title, author(s), publication statuses and dates) are captured on an “AS IS” and “AS AVAILABLE” basis at the time of record harvesting from the data source. Suggestions for further amendments or supplementary information can be sent to [email protected].

Research Keywords

  • Energy separation
  • Gaussian mixture model
  • Gender identification
  • Pitch
  • Suspicious speaker detection

Fingerprint

Dive into the research topics of 'Pitch-based gender identification with two-stage classification'. Together they form a unique fingerprint.

Cite this