
Reinforcement Learning Made Simple: Build a Q-Learning Agent in Python
In March 2016, Go world champion Lee Sedol faced an opponent who was not made of flesh and blood – but of lines of code. It soon became clear that the human had lost. In the end, Lee Sedol lost 4:1. Last week I watched the documentary AlphaGo again — and …