HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

Efficient Passage Retrieval with Hashing for Open-domain Question Answering

Ikuya Yamada Akari Asai Hannaneh Hajishirzi

Efficient Passage Retrieval with Hashing for Open-domain Question Answering

Abstract

Most state-of-the-art open-domain question answering systems use a neural retrieval model to encode passages into continuous vectors and extract them from a knowledge source. However, such retrieval models often require large memory to run because of the massive size of their passage index. In this paper, we introduce Binary Passage Retriever (BPR), a memory-efficient neural retrieval model that integrates a learning-to-hash technique into the state-of-the-art Dense Passage Retriever (DPR) to represent the passage index using compact binary codes rather than continuous vectors. BPR is trained with a multi-task objective over two tasks: efficient candidate generation based on binary codes and accurate reranking based on continuous vectors. Compared with DPR, BPR substantially reduces the memory cost from 65GB to 2GB without a loss of accuracy on two standard open-domain question answering benchmarks: Natural Questions and TriviaQA. Our code and trained models are available at https://github.com/studio-ousia/bpr.

Code Repositories

studio-ousia/bpr
Official
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
open-domain-question-answering-on-naturalBPR (linear scan; l=1000)
Exact Match: 41.6
open-domain-question-answering-on-tqaBPR (linear scan; l=1000)
Exact Match: 56.8
question-answering-on-natural-questions-longBPR (linear scan; l=1000)
EM: 41.6

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp