AlphaGo Zero花3天擊敗昔日勁敵柯潔：人類太多餘了

見習騎士 | 2017-10-20 08:49:04

1樓

今年5月，人工智慧「AlphaGo」擊敗中國圍棋高手柯潔後宣布退役，開發公司DeepMind又創新一代「AlphaGo Zero」，在沒有任何人類輸入數據的條件下，迅速自學圍棋，並以100比零的戰績擊敗「AlphaGo」，更讓人驚訝的是這個自學過程只花費3天。柯潔今（19）日在微博難過的寫下「對於AlphaGo的自我進步來講…人類太多餘了（失望）」。

據「澎湃新聞」報導，DeepMind團隊將關於「AlphaGo Zero」的相關研究以論文的形式，刊登在科學雜誌《自然》上，並表示，「在數百萬局自我對弈及訓練後，AlphaGo Zero獨立發現了人類花費數千年才總結出的圍棋規則，還建立了新戰略，為這個古老的遊戲帶來新見解」、「AlphaGo Zero的水平已超過之前所有版本的AlphaGo」。

面對柯潔的沈重PO文，大陸網友們留言鼓勵，「下次拔它電源」、「競技體育的魅力在於追求人類的極限。享受這個過程吧，少年，你是千千萬萬人中的被選中的那一個」、「所以你們結婚吧，你是他唯一不多餘的一個人類」、「你說現在會不會出現電影中的一幕，阿諾突然穿越到我們面前，告訴我們這是天網」，柯潔也回應網友「我覺得今天谷歌會突然出現在我面前」。

AlphaGo Zero is a version of DeepMind's Go software AlphaGo. AlphaGo's team published an article in the journal Nature on 19 October 2017, introducing AlphaGo Zero, a version without human data and stronger than any previous version.[1] By playing games against itself, AlphaGo Zero surpassed the strength of AlphaGo Lee in three days by winning 100 games to 0, reached the level of AlphaGo Master in 21 days, and exceed all the old versions in 40 days.[2]

Training AIs without datasets derived from human experts has significant implications for the development of AIs with superhuman skills for expert data is "often expensive, unreliable or simply unavailable."[3] Demis Hassabis, the co-founder and CEO of DeepMind, said that AlphaGo Zero was so powerful because it was "no longer constrained by the limits of human knowledge".[4]

AlphaGo Zero 4 TPUs[2] v2, single machine 5,185[1] 100:0 against AlphaGo Lee
89:11 against AlphaGo Master