Learn Before
Dataset

GSM8K Benchmark

The GSM8K (Grade School Math 8K) dataset, introduced by Cobbe et al. in 2021, is a prominent benchmark for assessing the reasoning abilities of Large Language Models. It comprises thousands of math word problems appropriate for grade school students. To evaluate a model, it is prompted to generate a solution for each problem in natural language.

0

1

Updated 2026-04-30

Contributors are:

Who are from:

Tags

Ch.3 Prompting - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences

Ch.2 Generative Models - Foundations of Large Language Models