HyperAI
Home
News
Latest Papers
Tutorials
Datasets
Wiki
SOTA
LLM Models
GPU Leaderboard
Events
Search
About
English
HyperAI
Toggle sidebar
Search the site…
⌘
K
Home
SOTA
Speech Separation
Speech Separation On Whamr
Speech Separation On Whamr
Metrics
SI-SDRi
Results
Performance results of various models on this benchmark
Columns
Model Name
SI-SDRi
Paper Title
Repository
SepReformer-L + DM
17.1
Separate and Reconstruct: Asymmetric Encoder-Decoder for Speech Separation
Bi-LSTM-TASNET
9.2
WHAM!: Extending Speech Separation to Noisy Environments
DPTNET - SRSSN
12.3
Stepwise-Refining Speech Separation Network via Fine-Grained Encoding in High-order Latent Domain
-
TD-Confomer (S)
10.5
On Time Domain Conformer Models for Monaural Speech Separation in Noisy Reverberant Acoustic Environments
MossFormer (L) + DM
16.3
MossFormer: Pushing the Performance Limit of Monaural Speech Separation using Gated Single-Head Transformer with Convolution-Augmented Joint Self-Attentions
TD-Conformer (L) + DM
13.4
On Time Domain Conformer Models for Monaural Speech Separation in Noisy Reverberant Acoustic Environments
VSUNOS
12.2
Voice Separation with an Unknown Number of Multiple Speakers
DPRNN - SRSSN
12.3
Stepwise-Refining Speech Separation Network via Fine-Grained Encoding in High-order Latent Domain
-
TF-Locoformer (M)
18.5
TF-Locoformer: Transformer with Local Modeling by Convolution for Speech Separation and Enhancement
Deformable TCN + Dynamic Mixing
11.1
Deformable Temporal Convolutional Networks for Monaural Noisy Reverberant Speech Separation
TD-Conformer (XL) + DM
14.6
On Time Domain Conformer Models for Monaural Speech Separation in Noisy Reverberant Acoustic Environments
Wavesplit
13.2
Wavesplit: End-to-End Speech Separation by Speaker Clustering
-
Improved Sudo rm -rf (U=36)
13.5
Compute and memory efficient universal sound source separation
Deformable TCN + Shared Weights + Dynamic Mixing
10.1
Deformable Temporal Convolutional Networks for Monaural Noisy Reverberant Speech Separation
MossFormer2
17.0
MossFormer2: Combining Transformer and RNN-Free Recurrent Network for Enhanced Time-Domain Monaural Speech Separation
Sudo rm -rf (U=16)
12.1
Sudo rm -rf: Efficient Networks for Universal Audio Source Separation
TF-Locoformer (S)
17.4
TF-Locoformer: Transformer with Local Modeling by Convolution for Speech Separation and Enhancement
TD-Confomer (M) + DM
12
On Time Domain Conformer Models for Monaural Speech Separation in Noisy Reverberant Acoustic Environments
0 of 18 row(s) selected.
Previous
Next