HyperAIHyperAI

Command Palette

Search for a command to run...

4 months ago

Attention-Based Models for Speech Recognition

Jan Chorowski; Dzmitry Bahdanau; Dmitriy Serdyuk; Kyunghyun Cho; Yoshua Bengio

Attention-Based Models for Speech Recognition

Abstract

Recurrent sequence generators conditioned on input data through an attention mechanism have recently shown very good performance on a range of tasks in- cluding machine translation, handwriting synthesis and image caption gen- eration. We extend the attention-mechanism with features needed for speech recognition. We show that while an adaptation of the model used for machine translation in reaches a competitive 18.7% phoneme error rate (PER) on the TIMIT phoneme recognition task, it can only be applied to utterances which are roughly as long as the ones it was trained on. We offer a qualitative explanation of this failure and propose a novel and generic method of adding location-awareness to the attention mechanism to alleviate this issue. The new method yields a model that is robust to long inputs and achieves 18% PER in single utterances and 20% in 10-times longer (repeated) utterances. Finally, we propose a change to the at- tention mechanism that prevents it from concentrating too much on single frames, which further reduces PER to 17.6% level.

Code Repositories

mnm-rnd/elsa-voice-asr
pytorch
Mentioned in GitHub
sooftware/OpenSpeech
pytorch
Mentioned in GitHub
sooftware/End-to-end-Speech-Recognition
pytorch
Mentioned in GitHub
biyoml/End-to-End-Mandarin-ASR
pytorch
Mentioned in GitHub
jackjhliu/End-to-End-Mandarin-ASR
pytorch
Mentioned in GitHub
msalhab96/SpeeQ
pytorch
Mentioned in GitHub
s3prl/End-to-end-ASR-Pytorch
pytorch
Mentioned in GitHub
Alexander-H-Liu/End-to-end-ASR-Pytorch
pytorch
Mentioned in GitHub
neil-zeng/asr
pytorch
Mentioned in GitHub
CKRC24/Listen-and-Translate
tf
Mentioned in GitHub
biyoml/Pytorch-End-to-End-ASR-on-TIMIT
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
speech-recognition-on-timitBi-RNN + Attention
Percentage error: 17.6

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp