HyperAIHyperAI

Command Palette

Search for a command to run...

5 months ago

Salient Information Prompting to Steer Content in Prompt-based Abstractive Summarization

Lei Xu; Mohammed Asad Karim; Saket Dingliwal; Aparna Elangovan

Salient Information Prompting to Steer Content in Prompt-based Abstractive Summarization

Abstract

Large language models (LLMs) can generate fluent summaries across domains using prompting techniques, reducing the need to train models for summarization applications. However, crafting effective prompts that guide LLMs to generate summaries with the appropriate level of detail and writing style remains a challenge. In this paper, we explore the use of salient information extracted from the source document to enhance summarization prompts. We show that adding keyphrases in prompts can improve ROUGE F1 and recall, making the generated summaries more similar to the reference and more complete. The number of keyphrases can control the precision-recall trade-off. Furthermore, our analysis reveals that incorporating phrase-level salient information is superior to word- or sentence-level. However, the impact on hallucination is not universally positive across LLMs. To conduct this analysis, we introduce Keyphrase Signal Extractor (SigExt), a lightweight model that can be finetuned to extract salient keyphrases. By using SigExt, we achieve consistent ROUGE improvements across datasets and open-weight and proprietary LLMs without any LLM customization. Our findings provide insights into leveraging salient information in building prompt-based summarization systems. We release our code at \url{https://github.com/amazon-science/SigExt}

Code Repositories

amazon-science/SigExt
Official
pytorch

Benchmarks

BenchmarkMethodologyMetrics
abstractive-text-summarization-on-cnn-daily-2Claude Instant + SigExt
ROUGE-1: 42
ROUGE-L: 26.6
text-summarization-on-arxiv-summarizationClaude Instant + SigExt
ROUGE-1: 45.2
ROUGE-L: 23.5
text-summarization-on-meetingbankClaude Instant + SigExt
ROUGE-L: 31.9
Rouge-1: 42.3
text-summarization-on-samsum-corpusMistral 7B + SigExt
ROUGE-1: 44.1
ROUGE-L: 33.9

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp