1Cademy - LLaMA2

Learn Before

Examples of Pre-trained Transformers by Architecture
Data Demand for Large Language Models
Common Data Sources for Pre-training LLMs

Concept

LLaMA2

LLaMA2 is a family of large language models introduced in 2023. It includes prominent versions such as the 65-billion-parameter LLaMA2-65B, which was pre-trained on between 1.0 trillion and 1.4 trillion tokens. The training data for LLaMA2 comes from a diverse mix of public sources, including webpages, software code, Wikipedia, books, academic papers, and question-and-answer content.

Updated 2026-04-21

Contributors are: