Abstract

Large Language Models are typically used in a zero-shot or few-shot setting, that is, on novel tasks. When prompted to only give an answer, these models usually do not perform to their full potential. Chain-of-Thought prompting has been shown to successfully extract reasoning capabilities for large models (>100B parameters). However, smaller models do not inherently show these capabilities, even when prompted in such a manner. We examine whether these models can be finetuned in a few-shot Chain-of-Thought setting.

Our method only requires a small number of ground truth Chain-of-Thought examples. Unlike other methods, we train on question- answer pairs only, while reusing the same static support for each forward pass. We show that Chain-of-Thought finetuning in this manner can improve the answers to two common reasoning tasks (3-digit-division and natural language inference) for models as small as 560m parameters. However, our technique does not outperform finetuning without CoT, indicating that the models do not learn to reason. We at- tribute most performance gains to finetuning on the task itself.

Name		Name	Last commit message	Last commit date
Latest commit History 193 Commits
cot		cot
data		data
figs		figs
jobs		jobs
store		store
.gitattributes		.gitattributes
.gitignore		.gitignore
Poster.pdf		Poster.pdf
README.md		README.md
Report.pdf		Report.pdf
dev.ipynb		dev.ipynb
env_cot.yml		env_cot.yml
models.yml		models.yml
results.ipynb		results.ipynb
train_local.sh		train_local.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Abstract

About

Uh oh!

Releases

Packages

Contributors 5

Uh oh!

Languages

elidub/CoT

Folders and files

Latest commit

History

Repository files navigation

Abstract

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 5

Uh oh!

Languages

Packages