Welcome to the IKCEST

Electronics Letters | Vol.55, Issue.14 | | Pages 813-816

Electronics Letters

Study on pairwise LDA for x-vector-based speaker recognition

A. KanagasundaramS. SridharanS. GanapathyC. Fookes  
Abstract

In typical x-vector-based speaker recognition systems, standard linear discriminant analysis (LDA) is used to transform the x-vector space with the aim of maximising the between-speaker discriminant information while minimising the within-speaker variability. For LDA, it is customary to use all the available speakers in the speaker recognition development dataset. In this study, the authors investigate if it would be more beneficial to estimate the between-speaker discriminant information and the within-speaker variability using the most confusing samples and the most distant samples (from the target speaker mean), respectively, in the LDA-based channel compensation. The between-speaker variance is estimated using a pairwise approach where the most confusing non-target speaker samples are found based on the Euclidean distance between the speaker mean and adjacent speaker's samples. The within-speaker variance is estimated using the mean of each speaker and the furthermost samples in the speaker sessions. Experimental results demonstrate the proposed LDA approach for an x-vector-based speaker recognition system achieves over 17% relative improvement on equal error rate over standard LDA-based x-vector speaker recognition systems on the NIST2010 corext-corext condition.

Original Text (This is the original text for your reference.)

Study on pairwise LDA for x-vector-based speaker recognition

In typical x-vector-based speaker recognition systems, standard linear discriminant analysis (LDA) is used to transform the x-vector space with the aim of maximising the between-speaker discriminant information while minimising the within-speaker variability. For LDA, it is customary to use all the available speakers in the speaker recognition development dataset. In this study, the authors investigate if it would be more beneficial to estimate the between-speaker discriminant information and the within-speaker variability using the most confusing samples and the most distant samples (from the target speaker mean), respectively, in the LDA-based channel compensation. The between-speaker variance is estimated using a pairwise approach where the most confusing non-target speaker samples are found based on the Euclidean distance between the speaker mean and adjacent speaker's samples. The within-speaker variance is estimated using the mean of each speaker and the furthermost samples in the speaker sessions. Experimental results demonstrate the proposed LDA approach for an x-vector-based speaker recognition system achieves over 17% relative improvement on equal error rate over standard LDA-based x-vector speaker recognition systems on the NIST2010 corext-corext condition.

+More

Cite this article
APA

APA

MLA

Chicago

A. KanagasundaramS. SridharanS. GanapathyC. Fookes,.Study on pairwise LDA for x-vector-based speaker recognition. 55 (14),813-816.

Disclaimer: The translated content is provided by third-party translation service providers, and IKCEST shall not assume any responsibility for the accuracy and legality of the content.
Translate engine
Article's language
English
中文
Pусск
Français
Español
العربية
Português
Kikongo
Dutch
kiswahili
هَوُسَ
IsiZulu
Action
Recommended articles

Report

Select your report category*



Reason*



By pressing send, your feedback will be used to improve IKCEST. Your privacy will be protected.

Submit
Cancel