HyperAIHyperAI

Command Palette

Search for a command to run...

4 months ago

Improved Sentence Modeling using Suffix Bidirectional LSTM

Siddhartha Brahma

Improved Sentence Modeling using Suffix Bidirectional LSTM

Abstract

Recurrent neural networks have become ubiquitous in computing representations of sequential data, especially textual data in natural language processing. In particular, Bidirectional LSTMs are at the heart of several neural models achieving state-of-the-art performance in a wide variety of tasks in NLP. However, BiLSTMs are known to suffer from sequential bias - the contextual representation of a token is heavily influenced by tokens close to it in a sentence. We propose a general and effective improvement to the BiLSTM model which encodes each suffix and prefix of a sequence of tokens in both forward and reverse directions. We call our model Suffix Bidirectional LSTM or SuBiLSTM. This introduces an alternate bias that favors long range dependencies. We apply SuBiLSTMs to several tasks that require sentence modeling. We demonstrate that using SuBiLSTM instead of a BiLSTM in existing models leads to improvements in performance in learning general sentence representations, text classification, textual entailment and paraphrase detection. Using SuBiLSTM we achieve new state-of-the-art results for fine-grained sentiment classification and question classification.

Benchmarks

BenchmarkMethodologyMetrics
sentiment-analysis-on-crSuBiLSTM-Tied
Accuracy: 86.5
sentiment-analysis-on-mrSuBiLSTM-Tied
Accuracy: 81.6
sentiment-analysis-on-sst-2-binarySuffix BiLSTM
Accuracy: 91.2
sentiment-analysis-on-sst-5-fine-grainedBCN+Suffix BiLSTM-Tied+CoVe
Accuracy: 56.2

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Improved Sentence Modeling using Suffix Bidirectional LSTM | Papers | HyperAI