HyperAIHyperAI

Command Palette

Search for a command to run...

4 months ago

Multi-News: a Large-Scale Multi-Document Summarization Dataset and Abstractive Hierarchical Model

Alexander R. Fabbri; Irene Li; Tianwei She; Suyi Li; Dragomir R. Radev

Multi-News: a Large-Scale Multi-Document Summarization Dataset and Abstractive Hierarchical Model

Abstract

Automatic generation of summaries from multiple news articles is a valuable tool as the number of online publications grows rapidly. Single document summarization (SDS) systems have benefited from advances in neural encoder-decoder model thanks to the availability of large datasets. However, multi-document summarization (MDS) of news articles has been limited to datasets of a couple of hundred examples. In this paper, we introduce Multi-News, the first large-scale MDS news dataset. Additionally, we propose an end-to-end model which incorporates a traditional extractive summarization model with a standard SDS model and achieves competitive results on MDS datasets. We benchmark several methods on Multi-News and release our data and code in hope that this work will promote advances in summarization in the multi-document setting.

Code Repositories

Alex-Fabbri/Multi-News
Official
pytorch

Benchmarks

BenchmarkMethodologyMetrics
multi-document-summarization-on-multi-newsHi-MAP
ROUGE-1: 43.47
ROUGE-2: 14.89
ROUGE-SU4: 17.41

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp