A GAUSSIAN MIXTURE MODEL-BASED SPEAKER RECOGNITION SYSTEM

Kumari Piu Gorai; Thomas Abraham

doi:10.22159/ajpcr.2017.v10s1.19596

Authors

Kumari Piu Gorai School of Computing Science and Engineering, VIT University, Chennai Campus, Tamil Nadu, India
Thomas Abraham School of Computing Science and Engineering, VIT University, Chennai Campus, Tamil Nadu, India

DOI:

https://doi.org/10.22159/ajpcr.2017.v10s1.19596

Keywords:

Speaker recognition, Mel-frequency cepstral coefficients, Gaussian mixture model, Support vector machine, Robust speaker recognition system

Abstract

A human being has lot of unique features and one of them is voice. Speaker recognition is the use of a system to distinguish and identify a person from his/
her vocal sound. A speaker recognition system (SRS) can be used as one of the authentication technique, in addition to the conventional authentication methods. This paper represents the overview of voice signal characteristics and speaker recognition techniques. It also discusses the advantages and problem of current SRS. The only biometric system that allows users to authenticate remotely is voice-based SRS, we are in the need of a robust SRS.

Downloads

Download data is not yet available.

References

Campbell JP. Speaker recognition: A tutorial. Proceedings of the IEEE. Vol. 85. No. 9. September; 1997.

Doddington GR. Speaker recognitionâ€”Identifying people by their voices. Proc IEEE 1985;73;1651-64.

Kenny P, Boulianne G, Ouellet P, Dumouchel P. Speaker and session variability in GMM-based speaker verification. IEEE Trans Audio Speech Lang Process 2007;15(4):1448-60.

Kinnunen T, Li H. An overview of text-independent speaker recognition: From features to supervectors. Speech Commun 2010;52:12-40.

Togneri R, Pullella D. An overview of speaker identification: Accuracy and robustness issues. IEEE Circuits and Systems Magazine Second Quarter. 2011. p. 23-60.

Liu G, Lei Y, Hansen JH. Robust feature front-end for speaker identification. In: Proceeding ICASSP. Kyoto, Japan: March; 2012. p. 4233-6.

Campbell W, Campbell J, Reynolds D, Singer E, Carrasquillo PT. Support vector machines for speaker and language recognition. Comput Speech Lang 2006;20(2-3):210-29.

May T, van de Par S, Kohlrausch A. Noise-robust speaker recognition combining missing data techniques and universal background modeling. IEEE Trans Audio Speech Lang Process 2012;20(1):108-21.

Solomonoff A, Campbell WM, Boardman I. Advances in channel compensation for SVM speaker recognition. In: Proceeding ICASSP. 2005. p. 629-32.

Reynolds D, Rose R. Robust text-independent speaker identification using Gaussian mixture speaker models. IEEE Trans Speech Audio Process 1995;3(1):72-83.

Alam MJ, Kenny P, Shaughnessy DO. Robust feature extraction based on an asymmetric level-dependent auditory filterbank and a subband spectrum enhancement technique. Digit Signal Process 2014;29:147-57.

A GAUSSIAN MIXTURE MODEL-BASED SPEAKER RECOGNITION SYSTEM

Authors

DOI:

Keywords:

Abstract

Downloads

References

Published

How to Cite

Issue

Section