MLP-based isolated phoneme classification using likelihood features extracted from reconstructed phase space

Yasser Shekofteh, Farshad Almasganj, Ayoub Daliri

Research output: Contribution to journalArticlepeer-review

12 Scopus citations

Abstract

Nonlinear properties of a complex signal can be represented in reconstructed phase space (RPS). Previously, researchers have developed RPS-based feature extraction approaches to capture nonlinear properties. Typically, these approaches are more computationally demanding - higher run-time - and less accurate than traditional techniques such as Mel-frequency cepstral coefficients (MFCCs) that fail to capture nonlinear properties of signals. To overcome these issues, we propose a new RPS-based feature extraction approach that is based on a previously reported approach. The proposed approach calculates the similarities between the embedded speech signals and a set of predefined speech attractor models in the RPS, and uses the similarities as a set of proper input features for a final phonetic classifier. A set of Gaussian mixture models (GMMs) is trained to represent the variety of all phoneme attractors in the RPS. Using the developed GMMs, for each embedded out-sample speech signal, a feature vector is calculated that consists of the Log-likelihoods. Then, an MLP-based classifier is used to estimate posterior probabilities for the phoneme classes. To test the performance of the proposed approach, we apply the approach to a Persian speech corpus (i.e., FARSDAT). Results show 1.89% absolute classification accuracy improvement in comparison to performance of a baseline system that exploits MFCC features. Combining different classifiers that use the proposed RPS-based features and MFCC features, the classifier gain the highest accuracy of 68.85% phoneme classification rate, with absolute accuracy improvements of 4.78% against a baseline system.

Original languageEnglish (US)
Article number2332
Pages (from-to)1-9
Number of pages9
JournalEngineering Applications of Artificial Intelligence
Volume44
DOIs
StatePublished - Sep 1 2015
Externally publishedYes

Keywords

  • Gaussian mixture models
  • Isolated phoneme classification
  • Nonlinear speech processing
  • Phoneme attractor
  • Reconstructed phase space

ASJC Scopus subject areas

  • Control and Systems Engineering
  • Artificial Intelligence
  • Electrical and Electronic Engineering

Fingerprint

Dive into the research topics of 'MLP-based isolated phoneme classification using likelihood features extracted from reconstructed phase space'. Together they form a unique fingerprint.

Cite this