HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

Rethinking Complex Neural Network Architectures for Document Classification

{Ashutosh Adhikari Achyudh Ram Raphael Tang Jimmy Lin}

Rethinking Complex Neural Network Architectures for Document Classification

Abstract

Neural network models for many NLP tasks have grown increasingly complex in recent years, making training and deployment more difficult. A number of recent papers have questioned the necessity of such architectures and found that well-executed, simpler models are quite effective. We show that this is also the case for document classification: in a large-scale reproducibility study of several recent neural models, we find that a simple BiLSTM architecture with appropriate regularization yields accuracy and F1 that are either competitive or exceed the state of the art on four standard benchmark datasets. Surprisingly, our simple model is able to achieve these results without attention mechanisms. While these regularization techniques, borrowed from language modeling, are not novel, to our knowledge we are the first to apply them in this context. Our work provides an open-source platform and the foundation for future work in document classification.

Benchmarks

BenchmarkMethodologyMetrics
document-classification-on-imdb-mLSTM-reg (single model)
Accuracy: 52.8
document-classification-on-reuters-21578LSTM-reg (single model)
F1: 87.0
text-classification-on-yelp-5LSTM-reg (single moedl)
Accuracy: 68.7%

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Rethinking Complex Neural Network Architectures for Document Classification | Papers | HyperAI