1Cademy - Exploration vs. Exploitation in LLM Search

Learn Before

The Search Problem in LLM Inference

Concept

Exploration vs. Exploitation in LLM Search

Solving the search problem in LLM inference requires managing the fundamental trade-off between exploration and exploitation. Exploration involves searching broadly across the vast space of possible output sequences to discover novel, high-quality options. Exploitation, on the other hand, involves focusing on and refining the most promising sequences already found. The central challenge is to devise an efficient search strategy that balances these two aspects to produce high-quality outputs without conducting an exhaustive search.