4 months ago

Baseline Needs More Love: On Simple Word-Embedding-Based Models and Associated Pooling Mechanisms

Dinghan Shen; Guoyin Wang; Wenlin Wang; Martin Renqiang Min; Qinliang Su; Yizhe Zhang; Chunyuan Li; Ricardo Henao; Lawrence Carin

Abstract

Many deep learning architectures have been proposed to model the compositionality in text sequences, requiring a substantial number of parameters and expensive computations. However, there has not been a rigorous evaluation regarding the added value of sophisticated compositional functions. In this paper, we conduct a point-by-point comparative study between Simple Word-Embedding-based Models (SWEMs), consisting of parameter-free pooling operations, relative to word-embedding-based RNN/CNN models. Surprisingly, SWEMs exhibit comparable or even superior performance in the majority of cases considered. Based upon this understanding, we propose two additional pooling strategies over learned word embeddings: (i) a max-pooling operation for improved interpretability; and (ii) a hierarchical pooling operation, which preserves spatial (n-gram) information within text sequences. We present experiments on 17 datasets encompassing three tasks: (i) (long) document classification; (ii) text sequence matching; and (iii) short text tasks, including classification and tagging. The source code and datasets can be obtained from https:// github.com/dinghanshen/SWEM.

Code Repositories

nyk510/scdv-python

Mentioned in GitHub

dinghanshen/SWEM

Official

Mentioned in GitHub

Benchmarks

Benchmark	Methodology	Metrics
named-entity-recognition-ner-on-conll-2003	SWEM-CRF	F1: 86.28
named-entity-recognition-on-conll-2000	SWEM-CRF	F1: 90.34
natural-language-inference-on-multinli	SWEM-max	Matched: 68.2 Mismatched: 67.7
natural-language-inference-on-snli	SWEM-max	% Test Accuracy: 83.8
paraphrase-identification-on-msrp	SWEM-concat	Accuracy: 71.5 F1: 81.3
question-answering-on-quora-question-pairs	SWEM-concat	Accuracy: 83.03%
question-answering-on-wikiqa	SWEM-concat	MAP: 0.6788 MRR: 0.6908
sentiment-analysis-on-mr	SWEM-concat	Accuracy: 78.2
sentiment-analysis-on-sst-2-binary	SWEM-concat	Accuracy: 84.3
sentiment-analysis-on-sst-5-fine-grained	SWEM-concat	Accuracy: 46.1
sentiment-analysis-on-yelp-binary	SWEM-hier	Error: 4.19
sentiment-analysis-on-yelp-fine-grained	SWEM-hier	Error: 36.21
subjectivity-analysis-on-subj	SWEM-concat	Accuracy: 93
text-classification-on-ag-news	SWEM-concat	Error: 7.34
text-classification-on-dbpedia	SWEM-concat	Error: 1.43
text-classification-on-trec-6	SWEM-aver	Error: 7.8
text-classification-on-yahoo-answers	SWEM-concat	Accuracy: 73.53

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started

Hyper Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

Command Palette