Learn Before
Evaluating an Agent's Action Set
An agent is being trained to play tic-tac-toe on a 3x3 grid. The agent's set of possible moves is defined as {'place X on square 1', 'place X on square 2', ..., 'place X on square 9'}. Critically evaluate this set of moves. Identify one significant flaw and explain why it is a problem for the agent's learning process.
0
1
Tags
Data Science
Foundations of Large Language Models Course
Computing Sciences
Evaluation in Bloom's Taxonomy
Cognitive Psychology
Psychology
Social Science
Empirical Science
Science
Related
Action in the Context of LLMs
A simple robotic arm is being trained to sort objects on a conveyor belt. The arm can perform only three distinct movements from its resting position: it can pick up an object, it can place an object in a bin, or it can do nothing and wait. In this learning scenario, what does the set {pick up, place, wait} represent?
Evaluating an Agent's Action Set
Smart Thermostat Agent Actions