1Cademy - A development team fine-tunes a large language model on a custom-built dataset of 50,000 technical support chat logs to improve its ability to resolve customer issues. The fine-tuned model achieves near-perfect accuracy on a test set composed of 5,000 additional logs from the same original source. However, when deployed to handle live customer chats, which include new and unforeseen types of user problems, the models performance is significantly worse. Based on this scenario, which challenge as

Learn Before

Challenges of Training-Based Methods for LLM Reasoning

Multiple Choice

A development team fine-tunes a large language model on a custom-built dataset of 50,000 technical support chat logs to improve its ability to resolve customer issues. The fine-tuned model achieves near-perfect accuracy on a test set composed of 5,000 additional logs from the same original source. However, when deployed to handle live customer chats, which include new and unforeseen types of user problems, the model's performance is significantly worse. Based on this scenario, which challenge as

Updated 2025-09-26

Contributors are:

Who are from:

Learn Before

Related