HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

Amalgamating Knowledge from Two Teachers for Task-oriented Dialogue System with Adversarial Training

{Ruifeng Xu Ying Shen Chengming Li Rui Yan Min Yang Wanwei He}

Amalgamating Knowledge from Two Teachers for Task-oriented Dialogue System with Adversarial Training

Abstract

The challenge of both achieving task completion by querying the knowledge base and generating human-like responses for task-oriented dialogue systems is attracting increasing research attention. In this paper, we propose a {``}Two-Teacher One-Student{''} learning framework (TTOS) for task-oriented dialogue, with the goal of retrieving accurate KB entities and generating human-like responses simultaneously. TTOS amalgamates knowledge from two teacher networks that together provide comprehensive guidance to build a high-quality task-oriented dialogue system (student network). Each teacher network is trained via reinforcement learning with a goal-specific reward, which can be viewed as an expert towards the goal and transfers the professional characteristic to the student network. Instead of adopting the classic student-teacher learning of forcing the output of a student network to exactly mimic the soft targets produced by the teacher networks, we introduce two discriminators as in generative adversarial network (GAN) to transfer knowledge from two teachers to the student. The usage of discriminators relaxes the rigid coupling between the student and teachers. Extensive experiments on two benchmark datasets (i.e., CamRest and In-Car Assistant) demonstrate that TTOS significantly outperforms baseline methods.

Benchmarks

BenchmarkMethodologyMetrics
task-oriented-dialogue-systems-on-kvretTTOS
BLEU: 17.35
Entity F1: 55.38

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Amalgamating Knowledge from Two Teachers for Task-oriented Dialogue System with Adversarial Training | Papers | HyperAI