A GAUSSIAN MIXTURE MODEL-BASED SPEAKER RECOGNITION SYSTEM
DOI:
https://doi.org/10.22159/ajpcr.2017.v10s1.19596Keywords:
Speaker recognition, Mel-frequency cepstral coefficients, Gaussian mixture model, Support vector machine, Robust speaker recognition systemAbstract
A human being has lot of unique features and one of them is voice. Speaker recognition is the use of a system to distinguish and identify a person from his/
her vocal sound. A speaker recognition system (SRS) can be used as one of the authentication technique, in addition to the conventional authentication methods. This paper represents the overview of voice signal characteristics and speaker recognition techniques. It also discusses the advantages and problem of current SRS. The only biometric system that allows users to authenticate remotely is voice-based SRS, we are in the need of a robust SRS.
Downloads
References
Campbell JP. Speaker recognition: A tutorial. Proceedings of the IEEE. Vol. 85. No. 9. September; 1997.
Doddington GR. Speaker recognition—Identifying people by their voices. Proc IEEE 1985;73;1651-64.
Kenny P, Boulianne G, Ouellet P, Dumouchel P. Speaker and session variability in GMM-based speaker verification. IEEE Trans Audio Speech Lang Process 2007;15(4):1448-60.
Kinnunen T, Li H. An overview of text-independent speaker recognition: From features to supervectors. Speech Commun 2010;52:12-40.
Togneri R, Pullella D. An overview of speaker identification: Accuracy and robustness issues. IEEE Circuits and Systems Magazine Second Quarter. 2011. p. 23-60.
Liu G, Lei Y, Hansen JH. Robust feature front-end for speaker identification. In: Proceeding ICASSP. Kyoto, Japan: March; 2012. p. 4233-6.
Campbell W, Campbell J, Reynolds D, Singer E, Carrasquillo PT. Support vector machines for speaker and language recognition. Comput Speech Lang 2006;20(2-3):210-29.
May T, van de Par S, Kohlrausch A. Noise-robust speaker recognition combining missing data techniques and universal background modeling. IEEE Trans Audio Speech Lang Process 2012;20(1):108-21.
Solomonoff A, Campbell WM, Boardman I. Advances in channel compensation for SVM speaker recognition. In: Proceeding ICASSP. 2005. p. 629-32.
Reynolds D, Rose R. Robust text-independent speaker identification using Gaussian mixture speaker models. IEEE Trans Speech Audio Process 1995;3(1):72-83.
Alam MJ, Kenny P, Shaughnessy DO. Robust feature extraction based on an asymmetric level-dependent auditory filterbank and a subband spectrum enhancement technique. Digit Signal Process 2014;29:147-57.
Published
How to Cite
Issue
Section
The publication is licensed under CC By and is open access. Copyright is with author and allowed to retain publishing rights without restrictions.