4 months ago

XLNet: Generalized Autoregressive Pretraining for Language Understanding

Zhilin Yang; Zihang Dai; Yiming Yang; Jaime Carbonell; Ruslan Salakhutdinov; Quoc V. Le

Abstract

With the capability of modeling bidirectional contexts, denoising autoencoding based pretraining like BERT achieves better performance than pretraining approaches based on autoregressive language modeling. However, relying on corrupting the input with masks, BERT neglects dependency between the masked positions and suffers from a pretrain-finetune discrepancy. In light of these pros and cons, we propose XLNet, a generalized autoregressive pretraining method that (1) enables learning bidirectional contexts by maximizing the expected likelihood over all permutations of the factorization order and (2) overcomes the limitations of BERT thanks to its autoregressive formulation. Furthermore, XLNet integrates ideas from Transformer-XL, the state-of-the-art autoregressive model, into pretraining. Empirically, under comparable experiment settings, XLNet outperforms BERT on 20 tasks, often by a large margin, including question answering, natural language inference, sentiment analysis, and document ranking.

Code Repositories

tomgoter/nlp_finalproject

Mentioned in GitHub

fanchenyou/transformer-study

pytorch

Mentioned in GitHub

SambhawDrag/XLNet.jl

pytorch

Mentioned in GitHub

2miatran/Natural-Language-Processing

Mentioned in GitHub

NathanDuran/Sentence-Encoding-for-DA-Classification

Mentioned in GitHub

graykode/xlnet-Pytorch

pytorch

Mentioned in GitHub

https-seyhan/BugAI

Mentioned in GitHub

MindCode-4/code-5/tree/main/xlnet

mindspore

facebookresearch/anli

pytorch

Mentioned in GitHub

pauldevos/python-notes

pytorch

Mentioned in GitHub

jonahwinninghoff/Text-Summarization

Mentioned in GitHub

pwc-1/Paper-9/tree/main/5/xlnet

mindspore

pwc-1/Paper-9/tree/main/1/xlnet

mindspore

lvyufeng/bert4ms/blob/master/bert4ms/models/xlnet.py

mindspore

zihangdai/xlnet

Official

Mentioned in GitHub

cuhksz-nlp/SAPar

pytorch

Mentioned in GitHub

listenviolet/XLNet

pytorch

Mentioned in GitHub

joshuaWang-bit/Textclassification-pytorch

pytorch

Mentioned in GitHub

huggingface/transformers

pytorch

Mentioned in GitHub

chesterdu/contrastive_summary

pytorch

Mentioned in GitHub

samwisegamjeee/pytorch-transformers

pytorch

Mentioned in GitHub

PaddlePaddle/PaddleNLP/tree/develop/examples/language_model/xlnet

paddle

MS-P3/code7/tree/main/xlnet

mindspore

kaushaltrivedi/fast-bert

pytorch

Mentioned in GitHub

utterworks/fast-bert

pytorch

Mentioned in GitHub

zaradana/Fast_BERT

pytorch

Mentioned in GitHub

huggingface/xlnet

Mentioned in GitHub

Benchmarks

Benchmark	Methodology	Metrics
document-ranking-on-clueweb09-b	XLNet	ERR@20: 20.28 nDCG@20: 31.10
humor-detection-on-200k-short-texts-for-humor-1	XLNet Large Cased	F1-score: 0.920
linguistic-acceptability-on-cola	XLNet (single model)	Accuracy: 69%
natural-language-inference-on-anli-test	XLNet (Large)	A1: 70.3 A2: 50.9 A3: 49.4
natural-language-inference-on-multinli	XLNet (single model)	Matched: 90.8
natural-language-inference-on-qnli	XLNet (single model)	Accuracy: 94.9%
natural-language-inference-on-rte	XLNet (single model)	Accuracy: 85.9%
natural-language-inference-on-wnli	XLNet	Accuracy: 92.5
paraphrase-identification-on-quora-question	XLNet-Large (ensemble)	Accuracy: 90.3 F1: 74.2
question-answering-on-quora-question-pairs	XLNet (single model)	Accuracy: 92.3%
question-answering-on-race	XLNet	RACE: 81.75 RACE-m: 85.45
question-answering-on-squad11	XLNet (single model)	EM: 89.898 F1: 95.080 Hardware Burden: 46449G
question-answering-on-squad11-dev	XLNet (single model)	EM: 89.7 F1: 95.1
question-answering-on-squad20	XLNet (single model)	EM: 87.926 F1: 90.689
question-answering-on-squad20-dev	XLNet (single model)	EM: 87.9 F1: 90.6
reading-comprehension-on-race	XLNet	Accuracy (High): 84.0 Accuracy (Middle): 88.6
semantic-textual-similarity-on-mrpc	XLNet (single model)	Accuracy: 90.8%
semantic-textual-similarity-on-senteval	XLNet-Large	MRPC: 93.0/90.7 SICK-E: - SICK-R: - STS: 91.6/91.1*
semantic-textual-similarity-on-sts-benchmark	XLNet (single model)	Pearson Correlation: 0.925
sentiment-analysis-on-imdb	XLNet	Accuracy: 96.21
sentiment-analysis-on-sst-2-binary	XLNet-Large (ensemble)	Accuracy: 96.8
sentiment-analysis-on-sst-2-binary	XLNet (single model)	Accuracy: 97
sentiment-analysis-on-yelp-binary	XLNet	Error: 1.37
sentiment-analysis-on-yelp-fine-grained	XLNet	Error: 27.05
text-classification-on-ag-news	XLNet	Error: 4.45
text-classification-on-amazon-2	XLNet	Error: 2.11
text-classification-on-amazon-5	XLNet	Error: 31.67
text-classification-on-dbpedia	XLNet	Error: 0.62
text-classification-on-yelp-2	XLNet	Accuracy: 98.63%
text-classification-on-yelp-5	XLNet	Accuracy: 72.95%

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started

Hyper Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

Command Palette

XLNet: Generalized Autoregressive Pretraining for Language Understanding

Zhilin Yang; Zihang Dai; Yiming Yang; Jaime Carbonell; Ruslan Salakhutdinov; Quoc V. Le

Abstract

Code Repositories

Benchmarks

Build AI with AI

Hyper Newsletters