HyperAIHyperAI

Command Palette

Search for a command to run...

5 months ago

Image-based table recognition: data, model, and evaluation

Xu Zhong; Elaheh ShafieiBavani; Antonio Jimeno Yepes

Image-based table recognition: data, model, and evaluation

Abstract

Important information that relates to a specific topic in a document is often organized in tabular format to assist readers with information retrieval and comparison, which may be difficult to provide in natural language. However, tabular data in unstructured digital documents, e.g., Portable Document Format (PDF) and images, are difficult to parse into structured machine-readable format, due to complexity and diversity in their structure and style. To facilitate image-based table recognition with deep learning, we develop the largest publicly available table recognition dataset PubTabNet (https://github.com/ibm-aur-nlp/PubTabNet), containing 568k table images with corresponding structured HTML representation. PubTabNet is automatically generated by matching the XML and PDF representations of the scientific articles in PubMed Central Open Access Subset (PMCOA). We also propose a novel attention-based encoder-dual-decoder (EDD) architecture that converts images of tables into HTML code. The model has a structure decoder which reconstructs the table structure and helps the cell decoder to recognize cell content. In addition, we propose a new Tree-Edit-Distance-based Similarity (TEDS) metric for table recognition, which more appropriately captures multi-hop cell misalignment and OCR errors than the pre-established metric. The experiments demonstrate that the EDD model can accurately recognize complex tables solely relying on the image representation, outperforming the state-of-the-art by 9.7% absolute TEDS score.

Code Repositories

ibm-aur-nlp/PubTabNet
Official
Mentioned in GitHub
Line290/EDD-third-party
pytorch
Mentioned in GitHub
JiaquanYe/TableMASTER-mmocr
pytorch
Mentioned in GitHub
namtuanly/MTL-TabNet
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
table-recognition-on-pubtabnetEDD
TEDS (all samples): 88.3

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp