Home Console Docs News Papers Tutorials Datasets Wiki SOTA LLM Models GPU Leaderboard Events

English

3 months ago

Chain-of-Thought Prompting Elicits Reasoning in Large Language Models

View Paper Details

Jason Wei Xuezhi Wang Dale Schuurmans Maarten Bosma Brian Ichter Fei Xia Ed Chi Quoc Le Denny Zhou

Chain-of-Thought Prompting Elicits Reasoning in Large Language Models

Abstract

We explore how generating a chain of thought -- a series of intermediate reasoning steps -- significantly improves the ability of large language models to perform complex reasoning. In particular, we show how such reasoning abilities emerge naturally in sufficiently large language models via a simple method called chain of thought prompting, where a few chain of thought demonstrations are provided as exemplars in prompting. Experiments on three large language models show that chain of thought prompting improves performance on a range of arithmetic, commonsense, and symbolic reasoning tasks. The empirical gains can be striking. For instance, prompting a 540B-parameter language model with just eight chain of thought exemplars achieves state of the art accuracy on the GSM8K benchmark of math word problems, surpassing even finetuned GPT-3 with a verifier.

Code Repositories

thudm/chatglm2-6b

pytorch

Mentioned in GitHub

mbzuai-clear/ioe-prompting

Mentioned in GitHub

pytorch

Mentioned in GitHub

Mentioned in GitHub

mrlab-ai/NL2Plan

Mentioned in GitHub

scofield7419/thor-isa

pytorch

Mentioned in GitHub

srush/minichain

pytorch

Mentioned in GitHub

lastmile-ai/aiconfig

TianduoWang/MsAT

pytorch

Mentioned in GitHub

rlqja1107/torch-LLM4SGG

pytorch

Mentioned in GitHub

sunlab-osu/understanding-cot

pytorch

Mentioned in GitHub

infini-ai-lab/sirius

pytorch

Mentioned in GitHub

yinzhangyue/eot

pytorch

Mentioned in GitHub

microsoft/guidance

Mentioned in GitHub

nicolay-r/thor-ecac

pytorch

Mentioned in GitHub

lupantech/chameleon-llm

Mentioned in GitHub

guidance-ai/guidance

Mentioned in GitHub

yinzhangyue/AoR

Mentioned in GitHub

coldmist-lu/erroranalysis_prompt

Mentioned in GitHub

Benchmarks

Benchmark	Methodology	Metrics
common-sense-reasoning-on-commonsenseqa	Chain of thought ASDiv	Accuracy: 28.6
question-answering-on-webquestions	CoT	EM: 42.5

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Hyper Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

Powered by MailChimp