HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

Knowledge Enhanced Contextual Word Representations

Matthew E. Peters Mark Neumann Robert L. Logan IV Roy Schwartz Vidur Joshi Sameer Singh Noah A. Smith

Knowledge Enhanced Contextual Word Representations

Abstract

Contextual word representations, typically trained on unstructured, unlabeled text, do not contain any explicit grounding to real world entities and are often unable to remember facts about those entities. We propose a general method to embed multiple knowledge bases (KBs) into large scale models, and thereby enhance their representations with structured, human-curated knowledge. For each KB, we first use an integrated entity linker to retrieve relevant entity embeddings, then update contextual word representations via a form of word-to-entity attention. In contrast to previous approaches, the entity linkers and self-supervised language modeling objective are jointly trained end-to-end in a multitask setting that combines a small amount of entity linking supervision with a large amount of raw text. After integrating WordNet and a subset of Wikipedia into BERT, the knowledge enhanced BERT (KnowBert) demonstrates improved perplexity, ability to recall facts as measured in a probing task and downstream performance on relationship extraction, entity typing, and word sense disambiguation. KnowBert's runtime is comparable to BERT's and it scales to large KBs.

Code Repositories

allenai/kb
Official
pytorch

Benchmarks

BenchmarkMethodologyMetrics
entity-linking-on-aida-conllPeters et al. (2019)
Micro-F1 strong: 73.7
relation-classification-on-tacred-1KnowBERT
F1: 71.5
relation-extraction-on-semeval-2010-task-8KnowBert-W+W
F1: 89.1
relation-extraction-on-tacredKnowBert-W+W
F1: 71.5

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp