AlphaZero 完爆前輩AlphaGo Zero，還贏了西洋棋和象棋最強 ... | 寵物協尋網

...ChessandShogibySelf-PlaywithaGeneralReinforcementLearningAlgorithm」，它講述了團隊如何利用AlphaGo的機器學習系統，構建了新的項目AlphaZero。

本文獲合作媒體極客公園授權轉載。[1]

Google 旗下人工智慧公司 DeepMind 發布了一篇新論文「Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm[2]」，它講述了團隊如何利用 AlphaGo 的機器學習系統，構建了新的項目 AlphaZero。AlphaZero 使用了名為「強化學習」（reinforcement learning）的 AI 技術，它只使用了基本規則，沒有人的經驗，從零開始訓練，橫掃了棋類遊戲 AI。

AlphaZero 首先征服了圍棋，又完爆其他棋類遊戲：相同條件下，該系統經過 8 個小時的訓練，打敗了第一個擊敗人類的 AI——李世乭版 AlphaGo；經過 4 個小時的訓練，打敗了之前最強西洋棋 AI Stockfish，2 個小時打敗了最強象棋 AI Elmo。連最強圍棋 AlphaGo 也未能倖免，訓練 34 個小時的 AlphaZero 勝過了訓練 72 小時的 AlphaGo Zero。

AlphaZero 在比賽中贏，平局或輸的局數（來自 DeepMind 團隊論文）強化學習這麼強大，它是什麼？

知名 AI 部落格作者 Adit Deshpande 來自加州大學洛杉磯分校（UCLA），他曾在部落格中發表過「深度學習研究評論[3]」系列文章，解讀了 AlphaGo 勝利背後的力量。他在文章中介紹到，機器學習領域可以分為三大類：監督學習、無監督學習和強化學習。強化學習可以在不同的情景或者環境下學習採取不同的行動，以此來獲得最佳的效果。

Adit Deshpande 的《Deep Learning Research Review Week 2: Reinforcement Learning》

我們想像一個小...

AlphaZero | 寵物協尋網

From the moment it stepped onto the scene, AlphaZero has changed chess by spawning a new generation of neural network chess engines, by contributing to chess ... Read More

AlphaZero | 寵物協尋網

AlphaZero是DeepMind所開發的人工智能軟體。目录. 1 簡介; 2 與Stockfish以及elmo的比較; 3 訓練; 4 成績. 4.1 西洋棋; 4.2 將棋; 4.3 圍棋. 5 相關連結; 6 參考資料 ... Read More

Stockfish 15 (3880) Vs Alphazero (3872) 2022 new Game | 寵物協尋網

AlphaZero 完爆前輩AlphaGo Zero，還贏了西洋棋和象棋最強 ... | 寵物協尋網

... Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm」，它講述了團隊如何利用AlphaGo 的機器學習系統，構建了新的項目AlphaZero。 Read More

AlphaZero Chess: How It Works | 寵物協尋網

2022年5月5日 — In short, AlphaZero is a game-playing program that, through a combination of self-play and neural network reinforcement learning (more on that ... Read More

AlphaZero: Shedding new light on chess | 寵物協尋網

2018年12月6日 — In late 2017 we introduced AlphaZero, a single system that taught itself from scratch how to master the games of chess, shogi(Japanese ... Read More

相關資訊整理

混種犬-阿發遺失 - 黑 9.9E+14

於「南投縣國姓鄉北港村長北路」遺失的混種犬阿發，以下提供飼主姓名、電話、Email，以及晶片號碼、寵物外觀及特徵等資訊，...

AlphaZero 完爆前輩AlphaGo Zero，還贏了西洋棋和象棋最強 ... | 寵物協尋網

AlphaZero | 寵物協尋網

AlphaZero | 寵物協尋網

Stockfish 15 (3880) Vs Alphazero (3872) 2022 new Game | 寵物協尋網

AlphaZero 完爆前輩AlphaGo Zero，還贏了西洋棋和象棋最強 ... | 寵物協尋網

AlphaZero Chess: How It Works | 寵物協尋網

AlphaZero: Shedding new light on chess | 寵物協尋網

混種犬-阿發遺失 - 黑 9.9E+14

貴賓-乖乖遺失 - 黑色捲毛 000210D105

臘腸-小乖遺失 - 短毛 020002104F

混種貓-小三花遺失 - 花色短毛 9.00073E+14

馬爾濟斯-喬喬遺失 - 白長毛 125675167A

西施-皮皮遺失 - 黑白短毛 125235664A

柴犬-陳約翰遺失 - 9.00202E+14

混種狗-RUN-RUN遺失 - 黃白短毛 123271643A

哈士奇-哈哈遺失 - 黑白短毛 02000234C8

博美-揚揚遺失 - 白短毛 134474760A

法國鬥牛犬-遺失 - 9.9E+14

拉不拉多-遺失 - 短毛 0005FD9138

混種貓-海橘遺失 - 9.9E+14

西施-嘟嘟遺失 - 棕白長毛 123314715A

混種狗-遺失 - 花色短毛 000606590D

混種犬-大熊遺失 - 黃色 9.00138E+14