Learn Before
GPT-4
As the successor to GPT-3, GPT-4 is a large-scale, multimodal model that significantly expands upon the capabilities of previous text-only architectures. Although its full technical details were not completely disclosed, its defining characteristic is the ability to process both text and images as input to generate text outputs.
0
1
Tags
D2L
Dive into Deep Learning @ D2L
Related
A research institution is planning to develop a new language model with approximately 175 billion parameters. Based on the characteristics of a model of this magnitude, which of the following represents the most significant trade-off the institution must evaluate?
A 2020 research paper by Brown et al. introduced a generative pre-trained transformer model that was particularly groundbreaking. What was the most defining characteristic of this model that set it apart from its direct predecessors?
The largest version of the generative pre-trained transformer model introduced in 2020 by Brown et al. is notable for its scale, containing ____ parameters.
Performance Scaling in GPT-3
GPT-4
InstructGPT