HyperAIHyperAI

Command Palette

Search for a command to run...

pyannote.audio: neural building blocks for speaker diarization

Hervé Bredin Ruiqing Yin Juan Manuel Coria Gregory Gelly Pavel Korshunov Marvin Lavechin Diego Fustes Hadrien Titeux Wassim Bouaziz Marie-Philippe Gill

Abstract

We introduce pyannote.audio, an open-source toolkit written in Python for speaker diarization. Based on PyTorch machine learning framework, it provides a set of trainable end-to-end neural building blocks that can be combined and jointly optimized to build speaker diarization pipelines. pyannote.audio also comes with pre-trained models covering a wide range of domains for voice activity detection, speaker change detection, overlapped speech detection, and speaker embedding -- reaching state-of-the-art performance for most of them.


Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing

HyperAI Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp