1Cademy - A machine learning engineer needs to deploy a large Transformer-based language model on a device with very limited memory. The primary objective is to significantly reduce the models file size on disk. Which of the following strategies directly achieves this by changing the numerical precision of the models parameters?

Learn Before

Quantization for BERT Compression

Multiple Choice

A machine learning engineer needs to deploy a large Transformer-based language model on a device with very limited memory. The primary objective is to significantly reduce the model's file size on disk. Which of the following strategies directly achieves this by changing the numerical precision of the model's parameters?

Updated 2025-10-02

Contributors are:

Who are from:

Learn Before

Related