HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

FooDI-ML: a large multi-language dataset of food, drinks and groceries images and descriptions

David Amat Olóndriz Ponç Palau Puigdevall Adrià Salvador Palau

FooDI-ML: a large multi-language dataset of food, drinks and groceries images and descriptions

Abstract

In this paper we introduce the FooDI-ML dataset. This dataset contains over 1.5M unique images and over 9.5M store names, product names descriptions, and collection sections gathered from the Glovo application. The data made available corresponds to food, drinks and groceries products from 37 countries in Europe, the Middle East, Africa and Latin America. The dataset comprehends 33 languages, including 870K samples of languages of countries from Eastern Europe and Western Asia such as Ukrainian and Kazakh, which have been so far underrepresented in publicly available visio-linguistic datasets. The dataset also includes widely spoken languages such as Spanish and English. To assist further research, we include benchmarks over two tasks: text-image retrieval and conditional image generation.

Code Repositories

glovo/foodi-ml-dataset
Official
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
image-retrieval-on-foodi-ml-globalADAPT-I2T
A-R@1: 0.005
A-R@10: 0.05
A-R@5: 0.02
Re-R@1: 0.01
Re-R@10: 0.045
Re-R@5: 0.03
image-retrieval-on-foodi-ml-spainADAPT-I2T
A-R@1: 0.93
A-R@10: 5.8
A-R@5: 3.33
Re-R@1: 0.73
Re-R@10: 5.67
Re-R@5: 2.93

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp