Sign#
Description#
Sign environment from https://arxiv.org/abs/2008.02790.
If you use this environment, please cite the above paper (Liu et al., 2020).
Small U-shaped maze with 6 objects: (blue, red, green) x (key, box). A sign on the wall says “blue”, “green”, or “red.”
In addition to the normal state, accessible under state[“obs”], the state also includes a goal under state[“goal”] that specifies box or key.
The episode ends when any object is touched.
The sign and goal can be configured via the color_index and goal arguments to the constructor respectively.
Includes an action to end the episode.
Action Space#
Num |
Action |
---|---|
0 |
turn left |
1 |
turn right |
2 |
move forward |
3 |
end episode |
Observation Space#
The observation space is an ndarray
with shape (obs_height, obs_width, 3)
representing a RGB image of what the agents sees.
Rewards:#
+1 for touching the object where the color matches the sign and the shape matches the goal -1 for touching any other object
Arguments#
Sign(size=10, max_episode_steps=20, color_index=0, goal=0)
size
: size of the square room.
max_episode_steps
: number of steps before the episode ends.
color_index
: specifies whether the sign says blue (0), green (1), or red (2).
goal
: specifies box (0) or key (1).