Command Palette
Search for a command to run...
EPT-X: An Expression-Pointer Transformer model that generates eXplanations for numbers
{Gahgene Gweon Sangkyu Rhim Kyung Seo Ki Bugeun Kim}

Abstract
In this paper, we propose a neural model EPT-X (Expression-Pointer Transformer with Explanations), which utilizes natural language explanations to solve an algebraic word problem. To enhance the explainability of the encoding process of a neural model, EPT-X adopts the concepts of plausibility and faithfulness which are drawn from math word problem solving strategies by humans. A plausible explanation is one that includes contextual information for the numbers and variables that appear in a given math word problem. A faithful explanation is one that accurately represents the reasoning process behind the model’s solution equation. The EPT-X model yields an average baseline performance of 69.59% on our PEN dataset and produces explanations with quality that is comparable to human output. The contribution of this work is two-fold. (1) EPT-X model: An explainable neural model that sets a baseline for algebraic word problem solving task, in terms of model’s correctness, plausibility, and faithfulness. (2) New dataset: We release a novel dataset PEN (Problems with Explanations for Numbers), which expands the existing datasets by attaching explanations to each number/variable.
Benchmarks
| Benchmark | Methodology | Metrics |
|---|---|---|
| math-word-problem-solving-on-alg514 | EPT-X | Accuracy (%): 67.07 |
| math-word-problem-solving-on-alg514 | EPT | Accuracy (%): 73.91 |
| math-word-problem-solving-on-draw-1k | EPT-X | Accuracy (%): 56 |
| math-word-problem-solving-on-draw-1k | EPT | Accuracy (%): 63.5 |
| math-word-problem-solving-on-mawps | EPT | Accuracy (%): 88.7 |
| math-word-problem-solving-on-mawps | EPT-X | Accuracy (%): 84.57 |
| math-word-problem-solving-on-pen | EPT-X | Accuracy (%): 69.59 |
Build AI with AI
From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.