Home Console Docs News Papers Tutorials Datasets Wiki SOTA LLM Models GPU Leaderboard Events

English

4 months ago

Generalized End-to-End Loss for Speaker Verification

View Paper Details

Li Wan; Quan Wang; Alan Papir; Ignacio Lopez Moreno

Generalized End-to-End Loss for Speaker Verification

Abstract

In this paper, we propose a new loss function called generalized end-to-end (GE2E) loss, which makes the training of speaker verification models more efficient than our previous tuple-based end-to-end (TE2E) loss function. Unlike TE2E, the GE2E loss function updates the network in a way that emphasizes examples that are difficult to verify at each step of the training process. Additionally, the GE2E loss does not require an initial stage of example selection. With these properties, our model with the new loss function decreases speaker verification EER by more than 10%, while reducing the training time by 60% at the same time. We also introduce the MultiReader technique, which allows us to do domain adaptation - training a more accurate model that supports multiple keywords (i.e. "OK Google" and "Hey Google") as well as multiple dialects.

Code Repositories

luomingshuang/GE2E-SV-TI-Timit-LMS

pytorch

Mentioned in GitHub

luomingshuang/GE2E-SV-TI-thchs30-LMS

pytorch

Mentioned in GitHub

hanqingguo/GE2E

pytorch

Mentioned in GitHub

yistLin/dvector

pytorch

Mentioned in GitHub

JeffT13/rd-diarization

pytorch

Mentioned in GitHub

PaddlePaddle/PaddleSpeech

paddle

Aurora11111/voiceprint

pytorch

Mentioned in GitHub

Aurora11111/speaker-recognition-pytorch

pytorch

Mentioned in GitHub

JeffT13/VoiceEncoder

pytorch

Mentioned in GitHub

resemble-ai/Resemblyzer

pytorch

Mentioned in GitHub

luomingshuang/GE2E-SV-TI-Chinese-LMS

pytorch

Mentioned in GitHub

HarryVolek/PyTorch_Speaker_Verification

pytorch

Mentioned in GitHub

Janghyun1230/Speaker_Verification

tf

Mentioned in GitHub

icewing1996/baseline

pytorch

Mentioned in GitHub

Suhee05/Text-Independent-Speaker-Verification

tf

Mentioned in GitHub

google/speaker-id/tree/master/lingvo

Official

CorentinJ/Real-Time-Voice-Cloning

tf

Mentioned in GitHub

tigthor/Voice-Cloning-AI

pytorch

Mentioned in GitHub

pytorch

Mentioned in GitHub

yui-mhcp/base_dl_project

tf

Mentioned in GitHub

cvqluu/GE2E-Loss

pytorch

Mentioned in GitHub

JanhHyun/Speaker_Verification

tf

Mentioned in GitHub

aijianiula0601/ge2eloss-svf

tf

Mentioned in GitHub

luomingshuang/GE2E-SV-TI-Voxceleb-LMS

pytorch

Mentioned in GitHub

rf5/simple-speaker-embedding

pytorch

Mentioned in GitHub

gkv856/speaker_embedding_GE2E_loss

pytorch

Mentioned in GitHub

muskang48/Speaker-Diarization

tf

Mentioned in GitHub

zhangmin4215/PyTorch_Speaker_Verification

pytorch

Mentioned in GitHub

piotrkawa/audio-deepfake-source-tracing

pytorch

Mentioned in GitHub

dalonlobo/diarization-experiments

tf

Mentioned in GitHub

Benchmarks

Benchmark	Methodology	Metrics
speaker-verification-on-callhome	-	Cosine EER: 2.38
speaker-verification-on-callhome	GE2E	Cosine EER: 3.55

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Hyper Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

Powered by MailChimp