3 months ago

Rethinking Complex Neural Network Architectures for Document Classification

{Ashutosh Adhikari Achyudh Ram Raphael Tang Jimmy Lin}

Abstract

Neural network models for many NLP tasks have grown increasingly complex in recent years, making training and deployment more difficult. A number of recent papers have questioned the necessity of such architectures and found that well-executed, simpler models are quite effective. We show that this is also the case for document classification: in a large-scale reproducibility study of several recent neural models, we find that a simple BiLSTM architecture with appropriate regularization yields accuracy and F1 that are either competitive or exceed the state of the art on four standard benchmark datasets. Surprisingly, our simple model is able to achieve these results without attention mechanisms. While these regularization techniques, borrowed from language modeling, are not novel, to our knowledge we are the first to apply them in this context. Our work provides an open-source platform and the foundation for future work in document classification.

Benchmarks

Benchmark	Methodology	Metrics
document-classification-on-imdb-m	LSTM-reg (single model)	Accuracy: 52.8
document-classification-on-reuters-21578	LSTM-reg (single model)	F1: 87.0
text-classification-on-yelp-5	LSTM-reg (single moedl)	Accuracy: 68.7%

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started

Hyper Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning