5 months ago

A Transformer-Based Model With Self-Distillation for Multimodal Emotion Recognition in Conversations

Hui Ma; Jian Wang; Hongfei Lin; Bo Zhang; Yijia Zhang; Bo Xu

Abstract

Emotion recognition in conversations (ERC), the task of recognizing the emotion of each utterance in a conversation, is crucial for building empathetic machines. Existing studies focus mainly on capturing context- and speaker-sensitive dependencies on the textual modality but ignore the significance of multimodal information. Different from emotion recognition in textual conversations, capturing intra- and inter-modal interactions between utterances, learning weights between different modalities, and enhancing modal representations play important roles in multimodal ERC. In this paper, we propose a transformer-based model with self-distillation (SDT) for the task. The transformer-based model captures intra- and inter-modal interactions by utilizing intra- and inter-modal transformers, and learns weights between modalities dynamically by designing a hierarchical gated fusion strategy. Furthermore, to learn more expressive modal representations, we treat soft labels of the proposed model as extra training supervision. Specifically, we introduce self-distillation to transfer knowledge of hard and soft labels from the proposed model to each modality. Experiments on IEMOCAP and MELD datasets demonstrate that SDT outperforms previous state-of-the-art baselines.

Code Repositories

butterfliesss/sdt

Official

pytorch

Benchmarks

Benchmark	Methodology	Metrics
emotion-recognition-in-conversation-on	SDT	Accuracy: 73.95 Weighted-F1: 74.08
emotion-recognition-in-conversation-on-meld	SDT	Accuracy: 67.55 Weighted-F1: 66.60

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started

Hyper Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

Command Palette