HyperAIHyperAI

Command Palette

Search for a command to run...

4 months ago

Discourse Marker Augmented Network with Reinforcement Learning for Natural Language Inference

Boyuan Pan; Yazheng Yang; Zhou Zhao; Yueting Zhuang; Deng Cai; Xiaofei He

Discourse Marker Augmented Network with Reinforcement Learning for Natural Language Inference

Abstract

Natural Language Inference (NLI), also known as Recognizing Textual Entailment (RTE), is one of the most important problems in natural language processing. It requires to infer the logical relationship between two given sentences. While current approaches mostly focus on the interaction architectures of the sentences, in this paper, we propose to transfer knowledge from some important discourse markers to augment the quality of the NLI model. We observe that people usually use some discourse markers such as "so" or "but" to represent the logical relationship between two sentences. These words potentially have deep connections with the meanings of the sentences, thus can be utilized to help improve the representations of them. Moreover, we use reinforcement learning to optimize a new objective function with a reward defined by the property of the NLI datasets to make full use of the labels information. Experiments show that our method achieves the state-of-the-art performance on several large-scale datasets.

Code Repositories

ZJULearning/DMP
Official
tf

Benchmarks

BenchmarkMethodologyMetrics
natural-language-inference-on-snli300D DMAN
% Test Accuracy: 88.8
% Train Accuracy: 95.4
Parameters: 9.2m
natural-language-inference-on-snli300D DMAN Ensemble
% Test Accuracy: 89.6
% Train Accuracy: 96.1
Parameters: 79m

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp