HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

Learn to Copy from the Copying History: Correlational Copy Network for Abstractive Summarization

{BoWen Zhou Xiaodong He Youzheng Wu Yujia Wang Peng Yuan Song Xu Haoran Li}

Learn to Copy from the Copying History: Correlational Copy Network for Abstractive Summarization

Abstract

The copying mechanism has had considerable success in abstractive summarization, facilitating models to directly copy words from the input text to the output summary. Existing works mostly employ encoder-decoder attention, which applies copying at each time step independently of the former ones. However, this may sometimes lead to incomplete copying. In this paper, we propose a novel copying scheme named Correlational Copying Network (CoCoNet) that enhances the standard copying mechanism by keeping track of the copying history. It thereby takes advantage of prior copying distributions and, at each time step, explicitly encourages the model to copy the input word that is relevant to the previously copied one. In addition, we strengthen CoCoNet through pre-training with suitable corpora that simulate the copying behaviors. Experimental results show that CoCoNet can copy more accurately and achieves new state-of-the-art performances on summarization benchmarks, including CNN/DailyMail for news summarization and SAMSum for dialogue summarization. The code and checkpoint will be publicly available.

Benchmarks

BenchmarkMethodologyMetrics
abstractive-text-summarization-on-cnn-dailyCoCoNet + CoCoPretrain
ROUGE-1: 44.50
ROUGE-2: 21.55
ROUGE-L: 41.24
abstractive-text-summarization-on-cnn-dailyCoCoNet
ROUGE-1: 44.39
ROUGE-2: 21.41
ROUGE-L: 41.05

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Learn to Copy from the Copying History: Correlational Copy Network for Abstractive Summarization | Papers | HyperAI