Show HN: Watch a neural net learn to play Snake
15 points by c1b 2 days ago | 3 comments
In browser PPO training demo, made possible by tinygrad: TinyJit -> WebGPU kernels.

Requires WebGPU.


dole 47 seconds ago
Dunno if it's a bug or feature but it seems like the more you let it train, the more apt it is to fall back to going in circles and not scoring as a strategy. "The Only Winning Move."
reply
simedw 26 minutes ago
Cool project!

I noticed that if you go from training to watch and then back, the training temporarily drop significantly in score.

reply
neduma 17 minutes ago
More details and implementation notes please?
reply