HyperAIHyperAI

Command Palette

Search for a command to run...

5 months ago

Identifying Well-formed Natural Language Questions

Manaal Faruqui; Dipanjan Das

Identifying Well-formed Natural Language Questions

Abstract

Understanding search queries is a hard problem as it involves dealing with "word salad" text ubiquitously issued by users. However, if a query resembles a well-formed question, a natural language processing pipeline is able to perform more accurate interpretation, thus reducing downstream compounding errors. Hence, identifying whether or not a query is well formed can enhance query understanding. Here, we introduce a new task of identifying a well-formed natural language question. We construct and release a dataset of 25,100 publicly available questions classified into well-formed and non-wellformed categories and report an accuracy of 70.7% on the test set. We also show that our classifier can be used to improve the performance of neural sequence-to-sequence models for generating questions for reading comprehension.

Code Repositories

Benchmarks

BenchmarkMethodologyMetrics
query-wellformedness-on-query-wellformednessword-1, 2 POS-1, 2, 3
Accuracy: 70.7

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Identifying Well-formed Natural Language Questions | Papers | HyperAI