What Happened in Reinforcement Learning in 2021 – Analytics India Magazine
Posted: November 14, 2021 at 1:45 am
One of the most exciting areas in machine learning right now is reinforcement learning. Its application is found in a diverse set of sectors like data processing, robotics, manufacturing, recommender systems, energy, and games, among others.
What makes reinforcement learning (RL) different from other kinds of algorithms is that it does not depend on historical data sets. It learns through trial and error like human beings.
Understanding its importance, the last few years have seen an accelerated pace in understanding and improving RL. Think of any big name in tech- be it Facebook, Google, DeepMind, Amazon, or Microsoft, they are all investing significant time, money and effort in bringing out innovations in RL.
For robots to be useful to mankind, they need to perform a variety of tasks. But, even training for one task using offline reinforcement learning will take a massive amount of time and huge computational expenditure.
To work on this issue, Google came out with MT-Opt and Actionable Models. While the first one is a multi-task RL system for automated data collection and multi-task RL training, the latter is a data collection mechanism to collect episodes of various tasks on real robots and demonstrates a successful application of multi-task RL. They also help robots to learn new tasks more quickly.
A leader in the reinforcement learning space, DeepMind gave us some unique innovations this year. It released RGB-stacking as a benchmark for vision-based robotic manipulation. Here, DeepMind used reinforcement learning to train a robotic arm to balance and stack objects of different shapes.
The diversity of objects used and the number of empirical evaluations performed made this reinforcement learning-based project unique. The learning pipeline was divided into three stages- training in simulation by using an off-the-shelf RL algorithm, training a new policy simulation with only realistic observations, and lastly, collecting data using this policy on real robots and bringing out an improved policy from this.
The implementation of sequential decision processes is crucial for those working in reinforcement learning. In order to simplify such a process, social media giant Facebook (now Meta) came out with SaLinA just a month back. It is built as an extension of PyTorch and can work in both supervised and unsupervised situations with compatibility options with multiple CPUs and GPUs. Such a method will see usage in systems where large-scale training use cases are involved.
IBM, too, has been active in the reinforcement learning segment in 2021. It released the text-based gaming environment called TextWorld Commonsense (TWC) to work on the problem of infusing RL agents with commonsense knowledge. This method was used to train and evaluate RL agents with a specific commonsense knowledge about objects, their attributes, and affordances. It worked on the issue of sequential decision making by introducing several baseline RL agents.
In the self-supervised learning area, we saw new methodologies coming out. Google released an approach called Reversibility-Aware RL, which adds a separate reversibility estimation component to the self-supervised RL procedure. Google said this method increases the performance of RL agents on several tasks, including the Sokoban puzzle game.
As reinforcement learning has a significant impact on games, in the middle of 2021, we saw DeepMind training agents playing games without intervention with the help of reinforcement learning mechanisms. Though previous innovations by DeepMind like AlphaZero beat world champion programs in Chess, Shogi and Go, they still trained separately on each game, unable to learn a new one without repeating the RL procedure from the beginning.
Through this method, however, the agents were able to react to new conditions with adaptation flexibility to new environments. The core part of this research relied on how deep RL can play a role in training neural networks of the agents.
Google has been working on using RL in the gaming domain. In early 2021, it released Evolving Reinforcement Learning Algorithms, which showed how to learn analytically interpretable and generalisable RL algorithms by using a graph representation and applying optimisation techniques from the AutoML community.
It used Regularized Evolution to evolve a population of the computational graphs over a set of simple training environments. This helped to better RL algorithms in complex environments with visual observations like Atari games.
With so much happening in the RL space, interest in this area is bound to grow among students and the professional community. To cater to the growing demand, Microsoft organised the Reinforcement Learning (RL) Open Source Fest to introduce students to open source reinforcement learning programs and software development.
Researchers from DeepMind teamed up with the University College London (UCL) to offer students a comprehensive introduction to modern reinforcement learning. It intended to give students a detailed understanding of topics like Markov Decision Processes, sample-based learning algorithms, deep reinforcement learning, etc.
Reinforcement learning and its advancements still have a long way to go, but there has been major progress in the last couple of years. Its usage can be a game-changer for certain industries. With more and more research coming in RL, we can expect to see major breakthroughs in the near future.
Sreejani Bhattacharyya is a journalist with a postgraduate degree in economics. When not writing, she is found reading on geopolitics, economy and philosophy. She can be reached at sreejani.bhattacharyya@analyticsindiamag.com
View original post here:
What Happened in Reinforcement Learning in 2021 - Analytics India Magazine
- This 90's Japanese commercial for Street Fighter Alpha 2 doesn't make a ton of sense, but it somehow still makes us want to play some Alpha -... [Last Updated On: December 9th, 2019] [Originally Added On: December 9th, 2019]
- Artificial intelligence: How to measure the I in AI - TechTalks [Last Updated On: December 9th, 2019] [Originally Added On: December 9th, 2019]
- Doubting The AI Mystics: Dramatic Predictions About AI Obscure Its Concrete Benefits - Forbes [Last Updated On: December 9th, 2019] [Originally Added On: December 9th, 2019]
- From AR to AI: The emerging technologies marketers can explore to enable and disrupt - Marketing Tech [Last Updated On: December 13th, 2019] [Originally Added On: December 13th, 2019]
- MuZero figures out chess, rules and all - Chessbase News [Last Updated On: December 13th, 2019] [Originally Added On: December 13th, 2019]
- John Robson: Why is man so keen to make man obsolete? - National Post [Last Updated On: December 18th, 2019] [Originally Added On: December 18th, 2019]
- Artificial intelligence in the arms race: Commentary by Avi Ben Ezra - Augusta Free Press [Last Updated On: February 9th, 2020] [Originally Added On: February 9th, 2020]
- Explained: The Artificial Intelligence Race is an Arms Race - The National Interest Online [Last Updated On: February 9th, 2020] [Originally Added On: February 9th, 2020]
- Google's DeepMind effort for COVID-19 coronavirus is based on the shoulders of giants - Mashviral News - Mash Viral [Last Updated On: March 8th, 2020] [Originally Added On: March 8th, 2020]
- Fat Fritz 1.1 update and a small gift - Chessbase News [Last Updated On: March 8th, 2020] [Originally Added On: March 8th, 2020]
- Magnus Carlsen: "In my country the authorities reacted quickly and the situation is under control" - Sportsfinding [Last Updated On: April 6th, 2020] [Originally Added On: April 6th, 2020]
- ACM Prize in Computing Awarded to AlphaGo Developer - HPCwire [Last Updated On: April 6th, 2020] [Originally Added On: April 6th, 2020]
- AlphaZero Crushes Stockfish In New 1,000-Game Match ... [Last Updated On: October 17th, 2020] [Originally Added On: October 17th, 2020]
- AlphaGo Zero - Wikipedia [Last Updated On: October 17th, 2020] [Originally Added On: October 17th, 2020]
- AlphaZero: Shedding new light on chess, shogi, and Go ... [Last Updated On: October 17th, 2020] [Originally Added On: October 17th, 2020]
- AlphaZero - Wikipedia [Last Updated On: October 17th, 2020] [Originally Added On: October 17th, 2020]
- When 3 is greater than 5 - Chessbase News [Last Updated On: October 22nd, 2020] [Originally Added On: October 22nd, 2020]
- Facebook AI Introduces 'ReBeL': An Algorithm That Generalizes The Paradigm Of Self-Play Reinforcement Learning And Search To Imperfect-Information... [Last Updated On: December 14th, 2020] [Originally Added On: December 14th, 2020]
- AI has almost solved one of biologys greatest challenges how protein unfolds - ThePrint [Last Updated On: December 14th, 2020] [Originally Added On: December 14th, 2020]
- Scientists say dropping acid can help with social anxiety and alcoholism - The Next Web [Last Updated On: January 31st, 2021] [Originally Added On: January 31st, 2021]
- Toronto scientists help create AI-powered bot that can play chess like a human - blogTO [Last Updated On: January 31st, 2021] [Originally Added On: January 31st, 2021]
- This AI chess engine aims to help human players rather than defeat them - The Next Web [Last Updated On: January 31st, 2021] [Originally Added On: January 31st, 2021]
- Artificial Intelligence, and the Future of Work Should We Be Worried? - BBN Times [Last Updated On: October 21st, 2021] [Originally Added On: October 21st, 2021]
- How AI is impacting the video game industry - ZME Science [Last Updated On: December 15th, 2021] [Originally Added On: December 15th, 2021]
- Quest Pro is here, Google and Valve report back - MIXED Reality News [Last Updated On: October 20th, 2022] [Originally Added On: October 20th, 2022]
- AI now not only debates with humans but negotiates and cajoles too - Mint [Last Updated On: November 26th, 2022] [Originally Added On: November 26th, 2022]
- Newspoll quarterly aggregates: July to December (open thread ... - The Poll Bludger [Last Updated On: December 29th, 2022] [Originally Added On: December 29th, 2022]
- MPL 59th National Senior R3: The Systematic Pawn Structure ... - ChessBase India [Last Updated On: December 29th, 2022] [Originally Added On: December 29th, 2022]
- Personality traits and decision-making styles among obstetricians ... - Nature.com [Last Updated On: April 6th, 2023] [Originally Added On: April 6th, 2023]
- What Brains of the Past Teach Us About the AI of the Future - Next Big Idea Club Magazine [Last Updated On: November 26th, 2023] [Originally Added On: November 26th, 2023]