Learn Before
Concept
Possible Limitations in Text Game RL Problems
Environment Model: As in a text game, textual feedback is provided, the Decision Process for the Agent is defined as Discrete-Time Partially Observed Markov Decision Process.
Some possible limitations of generation tools to reduce the scope of challenges faced
Partial observability - Controlled by improving agents observations.
Complexity of the language - Can be reduced by providing in-game vocabulary to the agent and using symbolic tokens.
Rewards - Simplifies by rewarding agent based on state transitions and ground truth winning policy.
Action - Using word-level and syntax commands to reduce the complexity of actions to be taken
0
1
Updated 2022-08-14
Tags
Natural language processing
Data Science