HyperAIHyperAI

Command Palette

Search for a command to run...

4 months ago

XLNet: Generalized Autoregressive Pretraining for Language Understanding

Zhilin Yang; Zihang Dai; Yiming Yang; Jaime Carbonell; Ruslan Salakhutdinov; Quoc V. Le

XLNet: Generalized Autoregressive Pretraining for Language Understanding

Abstract

With the capability of modeling bidirectional contexts, denoising autoencoding based pretraining like BERT achieves better performance than pretraining approaches based on autoregressive language modeling. However, relying on corrupting the input with masks, BERT neglects dependency between the masked positions and suffers from a pretrain-finetune discrepancy. In light of these pros and cons, we propose XLNet, a generalized autoregressive pretraining method that (1) enables learning bidirectional contexts by maximizing the expected likelihood over all permutations of the factorization order and (2) overcomes the limitations of BERT thanks to its autoregressive formulation. Furthermore, XLNet integrates ideas from Transformer-XL, the state-of-the-art autoregressive model, into pretraining. Empirically, under comparable experiment settings, XLNet outperforms BERT on 20 tasks, often by a large margin, including question answering, natural language inference, sentiment analysis, and document ranking.

Code Repositories

tomgoter/nlp_finalproject
tf
Mentioned in GitHub
fanchenyou/transformer-study
pytorch
Mentioned in GitHub
SambhawDrag/XLNet.jl
pytorch
Mentioned in GitHub
graykode/xlnet-Pytorch
pytorch
Mentioned in GitHub
https-seyhan/BugAI
Mentioned in GitHub
facebookresearch/anli
pytorch
Mentioned in GitHub
pauldevos/python-notes
pytorch
Mentioned in GitHub
zihangdai/xlnet
Official
tf
Mentioned in GitHub
cuhksz-nlp/SAPar
pytorch
Mentioned in GitHub
listenviolet/XLNet
pytorch
Mentioned in GitHub
huggingface/transformers
pytorch
Mentioned in GitHub
chesterdu/contrastive_summary
pytorch
Mentioned in GitHub
samwisegamjeee/pytorch-transformers
pytorch
Mentioned in GitHub
kaushaltrivedi/fast-bert
pytorch
Mentioned in GitHub
utterworks/fast-bert
pytorch
Mentioned in GitHub
zaradana/Fast_BERT
pytorch
Mentioned in GitHub
huggingface/xlnet
tf
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
document-ranking-on-clueweb09-bXLNet
ERR@20: 20.28
nDCG@20: 31.10
humor-detection-on-200k-short-texts-for-humor-1XLNet Large Cased
F1-score: 0.920
linguistic-acceptability-on-colaXLNet (single model)
Accuracy: 69%
natural-language-inference-on-anli-testXLNet (Large)
A1: 70.3
A2: 50.9
A3: 49.4
natural-language-inference-on-multinliXLNet (single model)
Matched: 90.8
natural-language-inference-on-qnliXLNet (single model)
Accuracy: 94.9%
natural-language-inference-on-rteXLNet (single model)
Accuracy: 85.9%
natural-language-inference-on-wnliXLNet
Accuracy: 92.5
paraphrase-identification-on-quora-questionXLNet-Large (ensemble)
Accuracy: 90.3
F1: 74.2
question-answering-on-quora-question-pairsXLNet (single model)
Accuracy: 92.3%
question-answering-on-raceXLNet
RACE: 81.75
RACE-m: 85.45
question-answering-on-squad11XLNet (single model)
EM: 89.898
F1: 95.080
Hardware Burden: 46449G
question-answering-on-squad11-devXLNet (single model)
EM: 89.7
F1: 95.1
question-answering-on-squad20XLNet (single model)
EM: 87.926
F1: 90.689
question-answering-on-squad20-devXLNet (single model)
EM: 87.9
F1: 90.6
reading-comprehension-on-raceXLNet
Accuracy (High): 84.0
Accuracy (Middle): 88.6
semantic-textual-similarity-on-mrpcXLNet (single model)
Accuracy: 90.8%
semantic-textual-similarity-on-sentevalXLNet-Large
MRPC: 93.0/90.7
SICK-E: -
SICK-R: -
STS: 91.6/91.1*
semantic-textual-similarity-on-sts-benchmarkXLNet (single model)
Pearson Correlation: 0.925
sentiment-analysis-on-imdbXLNet
Accuracy: 96.21
sentiment-analysis-on-sst-2-binaryXLNet-Large (ensemble)
Accuracy: 96.8
sentiment-analysis-on-sst-2-binaryXLNet (single model)
Accuracy: 97
sentiment-analysis-on-yelp-binaryXLNet
Error: 1.37
sentiment-analysis-on-yelp-fine-grainedXLNet
Error: 27.05
text-classification-on-ag-newsXLNet
Error: 4.45
text-classification-on-amazon-2XLNet
Error: 2.11
text-classification-on-amazon-5XLNet
Error: 31.67
text-classification-on-dbpediaXLNet
Error: 0.62
text-classification-on-yelp-2XLNet
Accuracy: 98.63%
text-classification-on-yelp-5XLNet
Accuracy: 72.95%

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp