1Cademy - Model Initialization Strategy in RLHF

Learn Before

Three-Stage Training Process of RLHF

Activity (Process)

Model Initialization Strategy in RLHF

The first step in the RLHF training process is to initialize the necessary models from existing ones. Typically, the reward and value models are initialized using a pre-trained Large Language Model. In contrast, the reference and target (policy) models are initialized from a model that has already undergone instruction fine-tuning. After this initialization, the reference model's parameters are fixed and are not updated further during the subsequent training stages.