AlphaGo Zero – Wikipedia
Posted: October 17, 2020 at 10:54 am
Artificial intelligence that plays Go
AlphaGo Zero is a version of DeepMind's Go software AlphaGo. AlphaGo's team published an article in the journal Nature on 19 October 2017, introducing AlphaGo Zero, a version created without using data from human games, and stronger than any previous version.[1] By playing games against itself, AlphaGo Zero surpassed the strength of AlphaGo Lee in three days by winning 100 games to 0, reached the level of AlphaGo Master in 21 days, and exceeded all the old versions in 40 days.[2]
Training artificial intelligence (AI) without datasets derived from human experts has significant implications for the development of AI with superhuman skills because expert data is "often expensive, unreliable or simply unavailable."[3]Demis Hassabis, the co-founder and CEO of DeepMind, said that AlphaGo Zero was so powerful because it was "no longer constrained by the limits of human knowledge".[4]David Silver, one of the first authors of DeepMind's papers published in Nature on AlphaGo, said that it is possible to have generalised AI algorithms by removing the need to learn from humans.[5]
Google later developed AlphaZero, a generalized version of AlphaGo Zero that could play chess and Shgi in addition to Go. In December 2017, AlphaZero beat the 3-day version of AlphaGo Zero by winning 60 games to 40, and with 8 hours of training it outperformed AlphaGo Lee on an Elo scale. AlphaZero also defeated a top chess program (Stockfish) and a top Shgi program (Elmo).[6][7]
AlphaGo Zero's neural network was trained using TensorFlow, with 64 GPU workers and 19 CPU parameter servers. Only four TPUs were used for inference. The neural network initially knew nothing about Go beyond the rules. Unlike earlier versions of AlphaGo, Zero only perceived the board's stones, rather than having some rare human-programmed edge cases to help recognize unusual Go board positions. The AI engaged in reinforcement learning, playing against itself until it could anticipate its own moves and how those moves would affect the game's outcome.[8] In the first three days AlphaGo Zero played 4.9 million games against itself in quick succession.[9] It appeared to develop the skills required to beat top humans within just a few days, whereas the earlier AlphaGo took months of training to achieve the same level.[10]
For comparison, the researchers also trained a version of AlphaGo Zero using human games, AlphaGo Master, and found that it learned more quickly, but actually performed more poorly in the long run.[11] DeepMind submitted its initial findings in a paper to Nature in April 2017, which was then published in October 2017.[1]
The hardware cost for a single AlphaGo Zero system in 2017, including the four TPUs, has been quoted as around $25 million.[12]
According to Hassabis, AlphaGo's algorithms are likely to be of the most benefit to domains that require an intelligent search through an enormous space of possibilities, such as protein folding or accurately simulating chemical reactions.[13] AlphaGo's techniques are probably less useful in domains that are difficult to simulate, such as learning how to drive a car.[14] DeepMind stated in October 2017 that it had already started active work on attempting to use AlphaGo Zero technology for protein folding, and stated it would soon publish new findings.[15][16]
AlphaGo Zero was widely regarded as a significant advance, even when compared with its groundbreaking predecessor, AlphaGo. Oren Etzioni of the Allen Institute for Artificial Intelligence called AlphaGo Zero "a very impressive technical result" in "both their ability to do itand their ability to train the system in 40 days, on four TPUs".[8]The Guardian called it a "major breakthrough for artificial intelligence", citing Eleni Vasilaki of Sheffield University and Tom Mitchell of Carnegie Mellon University, who called it an impressive feat and an outstanding engineering accomplishment" respectively.[14]Mark Pesce of the University of Sydney called AlphaGo Zero "a big technological advance" taking us into "undiscovered territory".[17]
Gary Marcus, a psychologist at New York University, has cautioned that for all we know, AlphaGo may contain "implicit knowledge that the programmers have about how to construct machines to play problems like Go" and will need to be tested in other domains before being sure that its base architecture is effective at much more than playing Go. In contrast, DeepMind is "confident that this approach is generalisable to a large number of domains".[9]
In response to the reports, South Korean Go professional Lee Sedol said, "The previous version of AlphaGo wasnt perfect, and I believe thats why AlphaGo Zero was made." On the potential for AlphaGo's development, Lee said he will have to wait and see but also said it will affect young Go players. Mok Jin-seok, who directs the South Korean national Go team, said the Go world has already been imitating the playing styles of previous versions of AlphaGo and creating new ideas from them, and he is hopeful that new ideas will come out from AlphaGo Zero. Mok also added that general trends in the Go world are now being influenced by AlphaGos playing style. "At first, it was hard to understand and I almost felt like I was playing against an alien. However, having had a great amount of experience, Ive become used to it," Mok said. "We are now past the point where we debate the gap between the capability of AlphaGo and humans. Its now between computers." Mok has reportedly already begun analyzing the playing style of AlphaGo Zero along with players from the national team. "Though having watched only a few matches, we received the impression that AlphaGo Zero plays more like a human than its predecessors," Mok said.[18] Chinese Go professional, Ke Jie commented on the remarkable accomplishments of the new program: "A pure self-learning AlphaGo is the strongest. Humans seem redundant in front of its self-improvement."[19]
Future of Go Summit
89:11 against AlphaGo Master
On 5 December 2017, DeepMind team released a preprint on arXiv, introducing AlphaZero, a program using generalized AlphaGo Zero's approach, which achieved within 24 hours a superhuman level of play in chess, shogi, and Go, defeating world-champion programs, Stockfish, Elmo, and 3-day version of AlphaGo Zero in each case.[6]
AlphaZero (AZ) is a more generalized variant of the AlphaGo Zero (AGZ) algorithm, and is able to play shogi and chess as well as Go. Differences between AZ and AGZ include:[6]
An open source program, Leela Zero, based on the ideas from the AlphaGo papers is available. It uses a GPU instead of the TPUs recent versions of AlphaGo rely on.
Link:
- This 90's Japanese commercial for Street Fighter Alpha 2 doesn't make a ton of sense, but it somehow still makes us want to play some Alpha -... [Last Updated On: December 9th, 2019] [Originally Added On: December 9th, 2019]
- Artificial intelligence: How to measure the I in AI - TechTalks [Last Updated On: December 9th, 2019] [Originally Added On: December 9th, 2019]
- Doubting The AI Mystics: Dramatic Predictions About AI Obscure Its Concrete Benefits - Forbes [Last Updated On: December 9th, 2019] [Originally Added On: December 9th, 2019]
- From AR to AI: The emerging technologies marketers can explore to enable and disrupt - Marketing Tech [Last Updated On: December 13th, 2019] [Originally Added On: December 13th, 2019]
- MuZero figures out chess, rules and all - Chessbase News [Last Updated On: December 13th, 2019] [Originally Added On: December 13th, 2019]
- John Robson: Why is man so keen to make man obsolete? - National Post [Last Updated On: December 18th, 2019] [Originally Added On: December 18th, 2019]
- Artificial intelligence in the arms race: Commentary by Avi Ben Ezra - Augusta Free Press [Last Updated On: February 9th, 2020] [Originally Added On: February 9th, 2020]
- Explained: The Artificial Intelligence Race is an Arms Race - The National Interest Online [Last Updated On: February 9th, 2020] [Originally Added On: February 9th, 2020]
- Google's DeepMind effort for COVID-19 coronavirus is based on the shoulders of giants - Mashviral News - Mash Viral [Last Updated On: March 8th, 2020] [Originally Added On: March 8th, 2020]
- Fat Fritz 1.1 update and a small gift - Chessbase News [Last Updated On: March 8th, 2020] [Originally Added On: March 8th, 2020]
- Magnus Carlsen: "In my country the authorities reacted quickly and the situation is under control" - Sportsfinding [Last Updated On: April 6th, 2020] [Originally Added On: April 6th, 2020]
- ACM Prize in Computing Awarded to AlphaGo Developer - HPCwire [Last Updated On: April 6th, 2020] [Originally Added On: April 6th, 2020]
- AlphaZero Crushes Stockfish In New 1,000-Game Match ... [Last Updated On: October 17th, 2020] [Originally Added On: October 17th, 2020]
- AlphaZero: Shedding new light on chess, shogi, and Go ... [Last Updated On: October 17th, 2020] [Originally Added On: October 17th, 2020]
- AlphaZero - Wikipedia [Last Updated On: October 17th, 2020] [Originally Added On: October 17th, 2020]
- When 3 is greater than 5 - Chessbase News [Last Updated On: October 22nd, 2020] [Originally Added On: October 22nd, 2020]
- Facebook AI Introduces 'ReBeL': An Algorithm That Generalizes The Paradigm Of Self-Play Reinforcement Learning And Search To Imperfect-Information... [Last Updated On: December 14th, 2020] [Originally Added On: December 14th, 2020]
- AI has almost solved one of biologys greatest challenges how protein unfolds - ThePrint [Last Updated On: December 14th, 2020] [Originally Added On: December 14th, 2020]
- Scientists say dropping acid can help with social anxiety and alcoholism - The Next Web [Last Updated On: January 31st, 2021] [Originally Added On: January 31st, 2021]
- Toronto scientists help create AI-powered bot that can play chess like a human - blogTO [Last Updated On: January 31st, 2021] [Originally Added On: January 31st, 2021]
- This AI chess engine aims to help human players rather than defeat them - The Next Web [Last Updated On: January 31st, 2021] [Originally Added On: January 31st, 2021]
- Artificial Intelligence, and the Future of Work Should We Be Worried? - BBN Times [Last Updated On: October 21st, 2021] [Originally Added On: October 21st, 2021]
- What Happened in Reinforcement Learning in 2021 - Analytics India Magazine [Last Updated On: November 14th, 2021] [Originally Added On: November 14th, 2021]
- How AI is impacting the video game industry - ZME Science [Last Updated On: December 15th, 2021] [Originally Added On: December 15th, 2021]
- Quest Pro is here, Google and Valve report back - MIXED Reality News [Last Updated On: October 20th, 2022] [Originally Added On: October 20th, 2022]
- AI now not only debates with humans but negotiates and cajoles too - Mint [Last Updated On: November 26th, 2022] [Originally Added On: November 26th, 2022]
- Newspoll quarterly aggregates: July to December (open thread ... - The Poll Bludger [Last Updated On: December 29th, 2022] [Originally Added On: December 29th, 2022]
- MPL 59th National Senior R3: The Systematic Pawn Structure ... - ChessBase India [Last Updated On: December 29th, 2022] [Originally Added On: December 29th, 2022]
- Personality traits and decision-making styles among obstetricians ... - Nature.com [Last Updated On: April 6th, 2023] [Originally Added On: April 6th, 2023]
- What Brains of the Past Teach Us About the AI of the Future - Next Big Idea Club Magazine [Last Updated On: November 26th, 2023] [Originally Added On: November 26th, 2023]