Hello, here is the post-jam version of my Ludum Dare 47 entry. The main change is the AI has a better reward function now and there are 2 additional players. It isn’t nearly as exploitable now. Here is the Jam version.
As before, you (the green agent) move with WASD.
NOTE: It may take a minute to load.
A bit more information on the AI – it uses AOgmaNeo (Arduino-compatible version of OgmaNeo2) but compiled with WebAssembly (with emscripten). Graphics are handled by the SFML-like (but WebAssembly-compatible) library SMK.
Each player has 18 columns with 16 cells in each column. Overall, there are 1,440 cells and almost 1 million synapses being simulated in the game.
The agents are trained via self-play, but they continue learning while you play (online learning).