HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

Automatic Model Selection with Large Language Models for Reasoning

James Xu Zhao Yuxi Xie Kenji Kawaguchi Junxian He Michael Qizhe Xie

Automatic Model Selection with Large Language Models for Reasoning

Abstract

Chain-of-Thought (CoT) and Program-Aided Language Models (PAL) represent two distinct reasoning methods, each with its own strengths. CoT employs natural language, offering flexibility and interpretability, while PAL utilizes programming language, yielding more structured and rigorous logic. We introduce a model selection method to combine the best of both worlds by employing a large language model (LLM) to dynamically select between them. Our theoretical analysis underscores the feasibility of this method, which is further corroborated by empirical results. Our proposed method demonstrates significant performance improvements across eight reasoning datasets with Codex, ChatGPT, and GPT-4. Additionally, our method is complementary to self-consistency; when integrated, it can further enhance performance while significantly reducing computation costs. Moreover, we achieve new state-of-the-art results on GSM8K and SVAMP, with respective accuracies of 96.8% and 93.7%. Our code, data and prompts are available at https://github.com/XuZhao0/Model-Selection-Reasoning

Code Repositories

xuzhao0/model-selection-reasoning
Official
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
math-word-problem-solving-on-svampGPT-4 (Model Selection)
Execution Accuracy: 93.7

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp