HyperAIHyperAI

Command Palette

Search for a command to run...

5 months ago

Universal Sentence Encoder

Daniel Cer; Yinfei Yang; Sheng-yi Kong; Nan Hua; Nicole Limtiaco; Rhomni St. John; Noah Constant; Mario Guajardo-Cespedes; Steve Yuan; Chris Tar; Yun-Hsuan Sung; Brian Strope; Ray Kurzweil

Universal Sentence Encoder

Abstract

We present models for encoding sentences into embedding vectors that specifically target transfer learning to other NLP tasks. The models are efficient and result in accurate performance on diverse transfer tasks. Two variants of the encoding models allow for trade-offs between accuracy and compute resources. For both variants, we investigate and report the relationship between model complexity, resource consumption, the availability of transfer task training data, and task performance. Comparisons are made with baselines that use word level transfer learning via pretrained word embeddings as well as baselines do not use any transfer learning. We find that transfer learning using sentence embeddings tends to outperform word level transfer. With transfer learning via sentence embeddings, we observe surprisingly good performance with minimal amounts of supervised training data for a transfer task. We obtain encouraging results on Word Embedding Association Tests (WEAT) targeted at detecting model bias. Our pre-trained sentence encoding models are made freely available for download and on TF Hub.

Code Repositories

facebookresearch/InferSent
pytorch
Mentioned in GitHub
ncbi-nlp/BioSentVec
Mentioned in GitHub
joseph-bongo-220/TV_NLP_Project
tf
Mentioned in GitHub
krisbukovi/document_similarity
tf
Mentioned in GitHub
textflint/textflint
Mentioned in GitHub
f-data/ADD
tf
Mentioned in GitHub
ppapalampidi/SUMMER
pytorch
Mentioned in GitHub
Alleansa/eluvio
tf
Mentioned in GitHub
facebookresearch/SentEval
pytorch
Mentioned in GitHub
ncbi-nlp/BioWordVec
Mentioned in GitHub
pathway/crosslang_embed
tf
Mentioned in GitHub
korymath/jann
tf
Mentioned in GitHub
contemn1/sentence_embeddings
tf
Mentioned in GitHub
ppapalampidi/GraphTP
pytorch
Mentioned in GitHub
weiyezhimeng/silent-guardian
pytorch
Mentioned in GitHub
jobdataexchange/competensor
tf
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
conversational-response-selection-on-polyaiUSE
1-of-100 Accuracy: 47.7%
semantic-textual-similarity-on-sts-benchmarkUSE_T
Pearson Correlation: 0.782
sentiment-analysis-on-crUSE_T+CNN (w2v w.e.)
Accuracy: 87.45
sentiment-analysis-on-mpqaUSE_T+DAN (w2v w.e.)
Accuracy: 88.14
sentiment-analysis-on-mrUSE_T+CNN
Accuracy: 81.59
sentiment-analysis-on-sst-2-binaryUSE_T+CNN (lrn w.e.)
Accuracy: 87.21
subjectivity-analysis-on-subjUSE
Accuracy: 93.90
text-classification-on-trec-6USE_T+CNN
Error: 1.93

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp