HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

ColBERTv2: Effective and Efficient Retrieval via Lightweight Late Interaction

Keshav Santhanam Omar Khattab Jon Saad-Falcon Christopher Potts Matei Zaharia

ColBERTv2: Effective and Efficient Retrieval via Lightweight Late Interaction

Abstract

Neural information retrieval (IR) has greatly advanced search and other knowledge-intensive language tasks. While many neural IR methods encode queries and documents into single-vector representations, late interaction models produce multi-vector representations at the granularity of each token and decompose relevance modeling into scalable token-level computations. This decomposition has been shown to make late interaction more effective, but it inflates the space footprint of these models by an order of magnitude. In this work, we introduce ColBERTv2, a retriever that couples an aggressive residual compression mechanism with a denoised supervision strategy to simultaneously improve the quality and space footprint of late interaction. We evaluate ColBERTv2 across a wide range of benchmarks, establishing state-of-the-art quality within and outside the training domain while reducing the space footprint of late interaction models by 6--10$\times$.

Code Repositories

stanford-futuredata/ColBERT
Official
pytorch
Mentioned in GitHub
thakur-nandan/beir-colbert
pytorch
Mentioned in GitHub
stanford-futuredata/Baleen
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
passage-retrieval-on-peerqaColBERTv2
MRR: 0.4122
Recall@10: 0.6371

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp