HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

Selective In-Context Data Augmentation for Intent Detection using Pointwise V-Information

Yen-Ting Lin Alexandros Papangelis Seokhwan Kim Sungjin Lee Devamanyu Hazarika Mahdi Namazifar Di Jin Yang Liu Dilek Hakkani-Tur

Selective In-Context Data Augmentation for Intent Detection using Pointwise V-Information

Abstract

This work focuses on in-context data augmentation for intent detection. Having found that augmentation via in-context prompting of large pre-trained language models (PLMs) alone does not improve performance, we introduce a novel approach based on PLMs and pointwise V-information (PVI), a metric that can measure the usefulness of a datapoint for training a model. Our method first fine-tunes a PLM on a small seed of training data and then synthesizes new datapoints - utterances that correspond to given intents. It then employs intent-aware filtering, based on PVI, to remove datapoints that are not helpful to the downstream intent classifier. Our method is thus able to leverage the expressive power of large language models to produce diverse training data. Empirical results demonstrate that our method can produce synthetic training data that achieve state-of-the-art performance on three challenging intent detection datasets under few-shot settings (1.28% absolute improvement in 5-shot and 1.18% absolute in 10-shot, on average) and perform on par with the state-of-the-art in full-shot settings (within 0.01% absolute, on average).

Benchmarks

BenchmarkMethodologyMetrics
intent-detection-on-banking77RoBERTa-Large + ICDA
Accuracy (%): 94.42
intent-detection-on-banking77-10-shotRoBERTa-Large + ICDA
Accuracy (%): 89.79
intent-detection-on-banking77-5-shotRoBERTa-Large + ICDA
Accuracy (%): 84.01
intent-detection-on-clinc150RoBERTa-Large + ICDA
Accuracy (%): 97.12
intent-detection-on-clinc150-10-shotRoBERTa-Large + ICDA
Accuracy (%): 94.84
intent-detection-on-clinc150-5-shotRoBERTa-Large + ICDA
Accuracy (%): 92.62
intent-detection-on-hwu64RoBERTa-Large + ICDA
Accuracy (%): 92.57
intent-detection-on-hwu64-10-shotRoBERTa-Large + ICDA
Accuracy (%): 87.41
intent-detection-on-hwu64-5-shotRoBERTa-Large + ICDA
Accuracy (%): 82.45
text-classification-on-banking77RoBERTa-Large + ICDA
Accuracy: 94.42

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp