1Cademy - Evaluating Model Compression Strategies

Learn Before

Pruning for BERT Compression

Case Study

Evaluating Model Compression Strategies

A development team needs to deploy a large, pre-trained Transformer-based language model on a mobile device with limited processing power. Their primary goal is to significantly reduce inference time while minimizing the loss of accuracy on a sentiment analysis task. They are considering two different strategies for compressing the model. Evaluate the two proposals below and justify which one is more likely to achieve the team's goal.

Updated 2025-09-28

Contributors are:

Who are from:

Learn Before

Related