1Cademy - Choosing a Generation Combination Strategy

Model 1: warm (0.6), sunny (0.3), bright (0.1)
Model 2: warm (0.2), sunny (0.7), bright (0.1)

Learn Before

Model Averaging for Token-Level Prediction

Case Study

Choosing a Generation Combination Strategy

Based on the scenario provided, which method (A or B) directly implements the principle of averaging predictions at the token level? Justify your choice by explaining the fundamental difference in how the two methods combine model outputs to produce the final text.

Updated 2025-10-06

Contributors are: