Repository logo
  • English
  • Català
  • Čeština
  • Deutsch
  • Español
  • Français
  • Gàidhlig
  • Italiano
  • Latviešu
  • Magyar
  • Nederlands
  • Polski
  • Português
  • Português do Brasil
  • Suomi
  • Svenska
  • Türkçe
  • Қазақ
  • বাংলা
  • हिंदी
  • Ελληνικά
  • Yкраї́нська
  • Log In
    or
    New user? Click here to register.Have you forgotten your password?
Repository logo
  • Communities & Collections
  • Research Outputs
  • Fundings & Projects
  • People
  • Statistics
  • English
  • Català
  • Čeština
  • Deutsch
  • Español
  • Français
  • Gàidhlig
  • Italiano
  • Latviešu
  • Magyar
  • Nederlands
  • Polski
  • Português
  • Português do Brasil
  • Suomi
  • Svenska
  • Türkçe
  • Қазақ
  • বাংলা
  • हिंदी
  • Ελληνικά
  • Yкраї́нська
  • Log In
    or
    New user? Click here to register.Have you forgotten your password?
  1. Home
  2. Indian Institute of Technology Madras
  3. Publication8
  4. Effect of feature warping and decorrelation on Mel Filterbank Slope for speaker recognition
 
  • Details
Options

Effect of feature warping and decorrelation on Mel Filterbank Slope for speaker recognition

Date Issued
26-10-2012
Author(s)
Madikeri, Srikanth
Hema A Murthy 
Indian Institute of Technology, Madras
DOI
10.1109/SPCOM.2012.6290222
Abstract
Mel Filterbank Slope (MFS) feature has been shown to consistently perform better than the conventional Mel Frequency Cepstral Co-efficients (MFCC) for speaker recognition. In this work, the issues with respect to the feature's robustness to intersession variability and large dimensionality are addressed. Short term feature warping is used to improve the robustness of MFS. This is observed to give an absolute improvement of 1% in EER on NIST 2003 SRE benchmark dataset. Dimensionality reduction on raw MFS features is performed using Discrete Cosine Transform (DCT). Efficient reduction is obtained using DCT with no deterioration in performance. Feature warping along with DCT is observed to give an absolute improvement of 2% in EER. An overall performance improvement of 3.3% is shown when the feature is fused with temporal information from MFCC. © 2012 IEEE.
Subjects
  • channel compensation

  • feature warping

  • speaker recognition

Indian Institute of Technology Madras Knowledge Repository developed and maintained by the Library

Built with DSpace-CRIS software - Extension maintained and optimized by 4Science

  • Cookie settings
  • Privacy policy
  • End User Agreement
  • Send Feedback