Learn Before
Concept
Chinchilla
Chinchilla is a -billion-parameter language model that inherits the architecture and compute budget of Gopher but trains on a significantly larger dataset of trillion tokens. It outperforms Gopher by placing greater emphasis on the number of training tokens rather than the sheer number of model parameters.
0
1
Updated 2026-05-15
Tags
D2L
Dive into Deep Learning @ D2L