HyperAIHyperAI

Command Palette

Search for a command to run...

5 months ago

Deep Recurrent Neural Networks for Acoustic Modelling

William Chan; Ian Lane

Deep Recurrent Neural Networks for Acoustic Modelling

Abstract

We present a novel deep Recurrent Neural Network (RNN) model for acoustic modelling in Automatic Speech Recognition (ASR). We term our contribution as a TC-DNN-BLSTM-DNN model, the model combines a Deep Neural Network (DNN) with Time Convolution (TC), followed by a Bidirectional Long Short-Term Memory (BLSTM), and a final DNN. The first DNN acts as a feature processor to our model, the BLSTM then generates a context from the sequence acoustic signal, and the final DNN takes the context and models the posterior probabilities of the acoustic states. We achieve a 3.47 WER on the Wall Street Journal (WSJ) eval92 task or more than 8% relative improvement over the baseline DNN models.

Benchmarks

BenchmarkMethodologyMetrics
speech-recognition-on-wsj-eval92TC-DNN-BLSTM-DNN
Word Error Rate (WER): 3.5

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Deep Recurrent Neural Networks for Acoustic Modelling | Papers | HyperAI