Essay

Evaluating LLM Training Strategies for a Tutoring Application

A software company is training a large language model to act as a programming tutor for beginners. They are considering two different training approaches.

Approach A: The model is given a programming problem and is rewarded only if the final code it generates passes a set of predefined tests.

Approach B: The model is given the same problem, but the training data includes a detailed, step-by-step solution. The model is rewarded for correctly generating each logical step in the problem-solving process (e.g., defining variables, writing the main loop, handling edge cases).

Evaluate the long-term effectiveness of these two approaches for creating a reliable and helpful programming tutor. In your evaluation, justify which approach is superior and explain the potential pitfalls of the less effective method.

0

1

Updated 2025-10-07

Contributors are:

Who are from:

Tags

Ch.5 Inference - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences

Evaluation in Bloom's Taxonomy

Cognitive Psychology

Psychology

Social Science

Empirical Science

Science