Command Palette
Search for a command to run...
{Li-Rong Dai Yi-Zhe Song Yongxin Yang Si Wei Jun Du Jianshu Zhang}
Abstract
Recent encoder-decoder approaches typically employ string decoders to convert images into serialized strings for image-to-markup. However, for tree-structured representational markup, string representations can hardly cope with the structural complexity. In this work, we first show via a set of toy problems that string decoders struggle to decode tree structures, especially as structural complexity increases. We then propose a tree-structured decoder that specifically aims at generating a tree-structured markup. Our decoders works sequentially, where at each step a child node and its parent node are simultaneously generated to form a sub-tree. This sub-tree is consequently used to construct the final tree structure in a recurrent manner. Key to the success of our tree decoder is twofold, (i) it strictly respects the parent-child relationship of trees, and (ii) it explicitly outputs trees as oppose to a linear string. Evaluated on both math formula recognition and chemical formula recognition, the proposed tree decoder is shown to greatly outperform strong string decoder baselines.
Benchmarks
| Benchmark | Methodology | Metrics |
|---|---|---|
| handwritten-mathmatical-expression | TD | ExpRate: 49.1 |
| handwritten-mathmatical-expression-1 | TD | ExpRate: 48.5 |
| handwritten-mathmatical-expression-2 | TD | ExpRate: 51.4 |
| handwritten-mathmatical-expression-3 | TD | ExpRate: 62.6 |
Build AI with AI
From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.