Learn Before
Concept

Chinchilla

Chinchilla is a 7070-billion-parameter language model that inherits the architecture and compute budget of Gopher but trains on a significantly larger dataset of 1.41.4 trillion tokens. It outperforms Gopher by placing greater emphasis on the number of training tokens rather than the sheer number of model parameters.

0

1

Updated 2026-05-15

Contributors are:

Who are from:

Tags

D2L

Dive into Deep Learning @ D2L

Related
Learn After