As impractical as the idea is, reinforcement learning is so damn fun. I highly r...

orlp · on Dec 24, 2020

Eh, I'd say it's fun if you have a couple thousand TPUs lying around.

If you're just messing around with 1 GPU and a desktop PC you should be happy to get Atari breakout to work.

syntaxing · on Dec 24, 2020

Definitely the worse part of RL is that it takes so long to train. But it works surprisingly well on Google Colab or equivalent.

confuseshrink · on Dec 24, 2020

The published hyperparameters are usually ridiculously conservative, for the simple games like breakout and pong you can usually converge in far fewer frames than in the papers.

orlp · on Dec 24, 2020

I have tried reproducing the papers, with mixed success. I do not share your sentiment.