Learn Before
Dataset

BIG-Bench Benchmark

The BIG-Bench benchmark is a standard evaluation dataset used to assess and quantify the capabilities of large language models across diverse tasks. It serves as a rigorous testing ground to compare model performance against human baselines. For example, the 540-billion-parameter PaLM (Pathway Language Model) demonstrated its advanced capabilities by outperforming average human performance on the BIG-Bench benchmark.

0

1

Updated 2026-05-15

Contributors are:

Who are from:

Tags

D2L

Dive into Deep Learning @ D2L

Related