Text-independent Speaker Identification Using Mel-frequency Energy Coefficients And Convolutional Neural Networks

2020

Articles Scientifiques Et Publications

ASJP

Université M'hamed Bougara - Boumerdes

A

Abdiche, Déhia

H

Harrar, Khaled

Résumé: Automatic Speaker Identification (ASI) is a biometric technique, which had achieved reliability in real applications, with standard feature extraction methods such as Linear Predictive Cepstral Coefficients (LPCC), Perceptual Linear Prediction (PLP), and modeling methods such as Gaussian mixture model (GMM), etc. However, the success of these manual approaches was quickly hampered by the emergence of big data, and the inability of scientists to manipulate large amounts of data, which led researchers to move towards automatic methods such as deep neural networks. In this work, a Convolutional Neural Network (CNN) is suggested for speaker identification in text-independent mode. Mel-Frequency Energy Coefficients (MFEC) method was used for extracting the characteristics of audio signals and the obtained coefficients were injected into the convolutional neural network model for classification (identification). In addition, a comparison was made between the proposed method and the existing traditional methods. Experimental results show that the proposed structure resulted in a speaker identification rate of 97.89%, which is much higher than the rates obtained in the old state of the art methods.

Mots-clès:

automatic speaker identification (asi)

mel

frequency energy coefficients (mfec)

convolutional neural network (cnn)

Publié dans la revue:

Nos services universitaires et académiques

Thèses-Algérie vous propose ses divers services d’édition: mise en page, révision, correction, traduction, analyse du plagiat, ainsi que la réalisation des supports graphiques et de présentation (Slideshows).

Obtenez dès à présent et en toute facilité votre devis gratuit et une estimation de la durée de réalisation et bénéficiez d'une qualité de travail irréprochable et d'un temps de livraison imbattable!

Text-independent Speaker Identification Using Mel-frequency Energy Coefficients And Convolutional Neural Networks

Nos services universitaires et académiques

Aucun fichier associé

Si le fichier est volumineux, l'affichage peut échouer. Vous pouvez obtenir le fichier directement en cliquant sur le bouton "Télécharger".

Documents et articles similaires: