Concept

Possible Limitations in Text Game RL Problems

Environment Model: As in a text game, textual feedback is provided, the Decision Process for the Agent is defined as Discrete-Time Partially Observed Markov Decision Process.

Some possible limitations of generation tools to reduce the scope of challenges faced

Partial observability - Controlled by improving agents observations.

Complexity of the language - Can be reduced by providing in-game vocabulary to the agent and using symbolic tokens.

Rewards - Simplifies by rewarding agent based on state transitions and ground truth winning policy.

Action - Using word-level and syntax commands to reduce the complexity of actions to be taken

0

1

Updated 2022-08-14

Tags

Natural language processing

Data Science