HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

A Simple and Effective Model for Answering Multi-span Questions

Elad Segal Avia Efrat Mor Shoham Amir Globerson Jonathan Berant

A Simple and Effective Model for Answering Multi-span Questions

Abstract

Models for reading comprehension (RC) commonly restrict their output space to the set of all single contiguous spans from the input, in order to alleviate the learning problem and avoid the need for a model that generates text explicitly. However, forcing an answer to be a single span can be restrictive, and some recent datasets also include multi-span questions, i.e., questions whose answer is a set of non-contiguous spans in the text. Naturally, models that return single spans cannot answer these questions. In this work, we propose a simple architecture for answering multi-span questions by casting the task as a sequence tagging problem, namely, predicting for each input token whether it should be part of the output or not. Our model substantially improves performance on span extraction questions from DROP and Quoref by 9.9 and 5.5 EM points respectively.

Code Repositories

eladsegal/project-NLP-AML
Mentioned in GitHub
j30206868/numnet-chinese
pytorch
Mentioned in GitHub
llamazing/numnet_plus
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
question-answering-on-drop-testTASE-BERT
F1: 80.7

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp