Toughts on alphago beating Lee Sedol

This morning, DeepMind made history by beating Go world champion Lee Sedol.

(4 more matches will be played in the coming days).

On this HN thread, clickok made some great comments:

AlphaGo underwent a substantial amount of improvement since October, apparently. The idea that it could go from mid-level professional to world class in a matter of months is kinda shocking. Once you find an approach that works, progress is fairly rapid.

I don’t play Go, and so it was perhaps unsurprising that I didn’t really appreciate the intricacies of the match, but even being familiar with deep reinforcement learning didn’t help either. You can write a program that will crush humans at chess with tree-search + position evaluation in a weekend, and maybe build some intuition for how your agent “thinks” from that, plus maybe playing a few games. Can you get that same level of insight into how AlphaGo makes its decisions? Even evaluating the forward prop of the value network for a single move is likely to require a substantial amount of time if you did it by hand.

These sorts of results are amazing, but expect more of the same, more often, over the coming years. More people are getting into machine learning, better algorithms are being developed, and now that “deep learning research” constitutes a market segment for GPU manufacturers, the complexity of the networks we can implement and the datasets we can tackle will expand significantly.

It’s still early in the series, but I can imagine it’s an amazing feeling for David Silver of DeepMind. I read Hamid Maei’s thesis from 2009 a while back, and some of the results presented mentioned Silver’s implementation of the algorithms for use in Go. Seven years between trying some things and seeing how well they work and beating one of the best human Go players. Surreal stuff.

…

Since I’m linking papers, why not peruse the one in Nature that describes AlphaGo?

I totally agree with point 1. Going from professional to world-class in five months is hard to believe. This is a jump of about 500 to 600 Elo points.

What we learned today is that the AI train is moving really fast. Now, we can be confident that once at Humanville station, it will swoosh right by.

To learn more about what Deep Mind plans next, I recommend this video: Demis Hassabis: The Future of Artificial Intelligence