HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

Chain-of-Thought Prompting Elicits Reasoning in Large Language Models

Jason Wei Xuezhi Wang Dale Schuurmans Maarten Bosma Brian Ichter Fei Xia Ed Chi Quoc Le Denny Zhou

Chain-of-Thought Prompting Elicits Reasoning in Large Language Models

Abstract

We explore how generating a chain of thought -- a series of intermediate reasoning steps -- significantly improves the ability of large language models to perform complex reasoning. In particular, we show how such reasoning abilities emerge naturally in sufficiently large language models via a simple method called chain of thought prompting, where a few chain of thought demonstrations are provided as exemplars in prompting. Experiments on three large language models show that chain of thought prompting improves performance on a range of arithmetic, commonsense, and symbolic reasoning tasks. The empirical gains can be striking. For instance, prompting a 540B-parameter language model with just eight chain of thought exemplars achieves state of the art accuracy on the GSM8K benchmark of math word problems, surpassing even finetuned GPT-3 with a verifier.

Code Repositories

thudm/chatglm2-6b
pytorch
Mentioned in GitHub
mbzuai-clear/ioe-prompting
Mentioned in GitHub
thu-keg/korc
pytorch
Mentioned in GitHub
imnearth/coat
Mentioned in GitHub
mrlab-ai/NL2Plan
Mentioned in GitHub
scofield7419/thor-isa
pytorch
Mentioned in GitHub
srush/minichain
pytorch
Mentioned in GitHub
TianduoWang/MsAT
pytorch
Mentioned in GitHub
rlqja1107/torch-LLM4SGG
pytorch
Mentioned in GitHub
sunlab-osu/understanding-cot
pytorch
Mentioned in GitHub
infini-ai-lab/sirius
pytorch
Mentioned in GitHub
yinzhangyue/eot
pytorch
Mentioned in GitHub
microsoft/guidance
Mentioned in GitHub
nicolay-r/thor-ecac
pytorch
Mentioned in GitHub
lupantech/chameleon-llm
Mentioned in GitHub
guidance-ai/guidance
Mentioned in GitHub
yinzhangyue/AoR
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
common-sense-reasoning-on-commonsenseqaChain of thought ASDiv
Accuracy: 28.6
question-answering-on-webquestionsCoT
EM: 42.5

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp