HyperAIHyperAI

Command Palette

Search for a command to run...

4 months ago

Generalized End-to-End Loss for Speaker Verification

Li Wan; Quan Wang; Alan Papir; Ignacio Lopez Moreno

Generalized End-to-End Loss for Speaker Verification

Abstract

In this paper, we propose a new loss function called generalized end-to-end (GE2E) loss, which makes the training of speaker verification models more efficient than our previous tuple-based end-to-end (TE2E) loss function. Unlike TE2E, the GE2E loss function updates the network in a way that emphasizes examples that are difficult to verify at each step of the training process. Additionally, the GE2E loss does not require an initial stage of example selection. With these properties, our model with the new loss function decreases speaker verification EER by more than 10%, while reducing the training time by 60% at the same time. We also introduce the MultiReader technique, which allows us to do domain adaptation - training a more accurate model that supports multiple keywords (i.e. "OK Google" and "Hey Google") as well as multiple dialects.

Code Repositories

luomingshuang/GE2E-SV-TI-Timit-LMS
pytorch
Mentioned in GitHub
luomingshuang/GE2E-SV-TI-thchs30-LMS
pytorch
Mentioned in GitHub
hanqingguo/GE2E
pytorch
Mentioned in GitHub
yistLin/dvector
pytorch
Mentioned in GitHub
JeffT13/rd-diarization
pytorch
Mentioned in GitHub
Aurora11111/voiceprint
pytorch
Mentioned in GitHub
Aurora11111/speaker-recognition-pytorch
pytorch
Mentioned in GitHub
JeffT13/VoiceEncoder
pytorch
Mentioned in GitHub
resemble-ai/Resemblyzer
pytorch
Mentioned in GitHub
luomingshuang/GE2E-SV-TI-Chinese-LMS
pytorch
Mentioned in GitHub
HarryVolek/PyTorch_Speaker_Verification
pytorch
Mentioned in GitHub
icewing1996/baseline
pytorch
Mentioned in GitHub
tigthor/Voice-Cloning-AI
pytorch
Mentioned in GitHub
coqui-ai/TTS
pytorch
Mentioned in GitHub
yui-mhcp/base_dl_project
tf
Mentioned in GitHub
cvqluu/GE2E-Loss
pytorch
Mentioned in GitHub
JanhHyun/Speaker_Verification
tf
Mentioned in GitHub
aijianiula0601/ge2eloss-svf
tf
Mentioned in GitHub
luomingshuang/GE2E-SV-TI-Voxceleb-LMS
pytorch
Mentioned in GitHub
rf5/simple-speaker-embedding
pytorch
Mentioned in GitHub
gkv856/speaker_embedding_GE2E_loss
pytorch
Mentioned in GitHub
muskang48/Speaker-Diarization
tf
Mentioned in GitHub
piotrkawa/audio-deepfake-source-tracing
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
speaker-verification-on-callhome-
Cosine EER: 2.38
speaker-verification-on-callhomeGE2E
Cosine EER: 3.55

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp