HyperAIHyperAI

Command Palette

Search for a command to run...

Console

ShiftySpeech Speech Distribution Evaluation Dataset

Date

a month ago

Size

389.35 GB

Organization

Johns Hopkins University

Paper URL

2502.05674

License

Apache 2.0

ShiftySpeech is a large-scale synthetic speech detection benchmark released by Johns Hopkins University in 2025. The related paper is titled "ShiftySpeech: A Large-Scale Synthetic Speech Dataset with Distribution ShiftsThe aim is to study the generalization ability of speech synthesis detection models in the real world when faced with "distribution drift" (including changes in language, speaker, generation model, and recording conditions).

This dataset contains over 3,000 hours of synthesized speech, covering seven source domains, including reading styles, podcasts, YouTube recordings, and other scenarios with background noise or non-standard recording conditions, as well as variations in language, speaker age, accent, and gender. The data covers three languages (English, Chinese, and Japanese), and speech was generated using six TTS (text-to-speech) systems and twelve vocoders (vocoders/waveform generators) to construct different degrees of system distribution drift.

ShiftySpeech.torrent
Seeding 1Downloading 0Completed 0Total Downloads 19
  • ShiftySpeech/
    • README.md
      1.6 KB
    • README.txt
      3.2 KB
      • data/
        • ShiftySpeech.zip
          389.35 GB

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp