1Cademy - Training Strategy for a Legal AI

Learn Before

Weak-to-Strong Generalization via Fine-Tuning on Weak Model Data

Short Answer

Training Strategy for a Legal AI

A legal tech company wants to improve its powerful, general-purpose language model ('StrongLLM') for the specific task of identifying 'indemnity clauses' in contracts. They have a massive database of unlabeled contracts but lack the resources to have lawyers label them all. They also have a smaller, less capable model ('WeakLLM') that can identify these clauses with about 70% accuracy. They propose the following two-stage plan:

Use 'WeakLLM' to scan the entire database of unlabeled contracts and generate a label ('contains indemnity clause' or 'does not contain indemnity clause') for each one.
Take the dataset of contracts and their machine-generated labels and use it to fine-tune the 'StrongLLM', training it to predict the labels provided by 'WeakLLM'.

Based on this scenario, explain the rationale behind this two-stage training strategy. Specifically, describe the role of the 'WeakLLM' in the first stage and the expected outcome for the 'StrongLLM' after the second stage.

0

1

Updated 2025-10-05

Contributors are:

Who are from:

Learn Before

Related