Skip to content

Comments

LightsOut#490

Open
zdu863 wants to merge 4 commits intoPufferAI:3.0from
zdu863:lightsout-pr
Open

LightsOut#490
zdu863 wants to merge 4 commits intoPufferAI:3.0from
zdu863:lightsout-pr

Conversation

@zdu863
Copy link

@zdu863 zdu863 commented Feb 21, 2026

Implementation of the Lights Out game as an env.

  • Each action selects one cell, toggling that cell plus its orthogonal neighbors. The goal is to turn all lights off within a step limit.
  • Observations are binary grid states, actions are cell indices, and rewards encourage solving efficiently.
  • It supports vectorized training and raylib rendering.
  • The environment includes a curriculum mechanism driven by an EMA of recent outcomes.
  • This EMA adjusts scramble probability over time, so training can start easier and become harder as policy performance improves.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant