Things Get Strange When AI Starts Training Itself – The Atlantic
Posted: February 21, 2024 at 2:47 am
Updated at 11:52 a.m. ET on February 16, 2024
ChatGPT exploded into the world in the fall of 2022, sparking a race toward ever more advanced artificial intelligence: GPT-4, Anthropics Claude, Google Gemini, and so many others. Just yesterday, OpenAI unveiled a model called Sora, the latest to instantly generate short videos from written prompts. But for all the dazzling tech demos and promises, development of the fundamental technology has slowed.
The most advanced and attention-grabbing AI programs, especially language models, have consumed most of the text and images available on the internet and are running out of training data, their most precious resource. This, along with the costly and slow process of using human evaluators to develop these systems, has stymied the technologys growth, leading to iterative updates rather than massive paradigm shifts. Companies are stuck competing over millimeters of progress.
As researchers are left trying to wring water from stone, they are exploring a new avenue to advance their products: Theyre using machines to train machines. Over the past few months, Google Deepmind, Microsoft, Amazon, Meta, Apple, OpenAI, and various academic labs have all published research that uses an AI model to improve another AI model, or even itself, in many cases leading to notable improvements. Numerous tech executives have heralded this approach as the technologys future.
This is a scenario that countless works of science fiction have prepared us for. And, taken to the extreme, the result of such self-learning might be nothing less than eschatological. Imagine GPT-5 teaching GPT-6, GPT-6 teaching GPT-7, and so on until the model has surpassed human intelligence. Some believe that this development would have catastrophic results. Nine years ago, OpenAIs CEO, Sam Altman, blogged about a theoretical AI capable of recursive self-improvementand the prospect that it would perceive humans in the same way that we perceive the bacteria and viruses we wash from our hands.
Read: AI doomerism is a decoy
We are not anywhere close to the emergence of superintelligence, as pundits call it. (Altman speaks often of AIs supposed existential risk; its good PR.) Even so, more modest programs that teach and learn from one another could warp our experience of the world and unsettle our basic understandings of intelligence. Generative AI already detects patterns and proposes theories that humans could not discover on their own, from quantities of data far too massive for any person to comb through, via internal algorithms that are largely opaque even to their creators. Self-learning, if successful, might only magnify this issue. The result could be a sort of unintelligible intelligence: models that are smart, or at least capable, in ways humans cannot readily comprehend.
To understand this shift, you have to understand the basic economics behind AI. Building the technology requires tremendous amounts of money, time, and information. The process begins with feeding an algorithm enormous amounts of databooks, math problems, captioned photos, voice recordings, and so onto establish the models baseline capabilities. Researchers can then enhance and refine those pre-trained abilities in a couple of different ways. One is by providing the model with specific examples of a task done well: A program might be shown 100 math questions with correct solutions. Another is a trial-and-error process known as reinforcement learning that typically involves human operators: A human might evaluate a chatbots responses for sexism so the program can learn to avoid those deemed offensive. Reinforcement learning is the key component to this new generation of AI systems, Rafael Rafailov, a computer scientist at Stanford, told me.
This is not a perfect system. Two different people, or the same person on different days, can have inconsistent judgments. All of those evaluators work at a slow, human pace, and require payment. As models become more powerful, they will require more sophisticated feedback from skilled, and thus better-paid, professionals. Doctors might be tapped to evaluate a medical AI that diagnoses patients, for instance.
You can see why self-learning holds a special appeal. Its cheaper, less labor-intensive, and perhaps more consistent than human feedback. But automating the reinforcement process comes with risks. AI models are already riddled with imperfectionshallucinations, prejudice, basic misunderstandings of the worldwhich they pass along to users through their outputs. (In one infamous example last year, a lawyer used ChatGPT to write a legal brief and ended up citing cases that didnt exist.) Training or fine-tuning a model with AI-generated data may amplify those flaws and make the program worse, like simmering a toxic stock into a thick demi-glace. Last year, Ilia Shumailov, then a junior research fellow at Oxford University, quantified one version of this self-destructive cycle and dubbed it model collapse: the complete degeneration of an AI.
To avoid this problem, the latest wave of research on self-improving AI uses only small amounts of synthetic data, guided by a human software developer. This approach relies on some sort of external check, separate from the AI itself, to ensure the quality of the feedbackperhaps the laws of physics, a list of moral principles, or some other, independent criteria already deemed true. Researchers have seen particular success with automating quality control for narrow, well-defined tasks, such as mathematical reasoning and games, in which correctness or victory provide a straightforward way to evaluate synthetic data. Deepmind recently used AI-generated examples to boost a language models ability to solve math and coding problems. But in these cases, the AI isnt learning from another AI so much as from scientific results or other established criteria, Rohan Taori, a computer scientist at Stanford, told me. Today, self-learning is more about setting the rules of the game, he said.
Read: A machine crushed us at Pokmon
Meanwhile, in cases of training AI models with more abstract abilities, such as writing in a pleasant tone or crafting responses that a person would find helpful, human feedback has remained crucial. The furthest-reaching vision of AI models training themselves, then, would be for them to learn to provide more subjective feedback to themselvesto rate how helpful, polite, prosodic, or prejudiced a chatbot dialogue is, for instance. But to date, in most research, language-model feedbacks training of other language models stops working after a few cycles: Perhaps the second iteration of the model improves, but the third or fourth plateaus or worsens. At some point, the AI model is just reinforcing existing abilitiesbecoming overconfident about what it knows and less capable at everything else. Learning, after all, requires being exposed to something new. Generative-AI models in use today are data-torturing machines, Stefano Soatto, the vice president of applied science for Amazon Web Services AI division, told me. They cannot create one bit of information more than the data theyre trained on.
Soatto compared self-learning to buttering a dry piece of toast. Imagine an AI model as a piece of bread, and its initial training process as placing a pat of butter in the center. At its best today, the self-learning technique simply spreads the same butter around more evenly, rather than bestowing any fundamentally new skills. Still, doing so makes the bread taste better. This kind of self-trained, or buttered, AI has recently been shown, in limited research settings, to provide more helpful summaries, write better code, and exhibit enhanced commonsense reasoning. Superintelligence might be beside the point if self-improving AI can reliably cut costs for OpenAI, Google, and all the rest by simulating an infinite army of human evaluators.
But for true evangelists, the dream is for self-learning to do more than thatto add more butter to the slice of toast. To do that, computer scientists will need to continue to devise ways of verifying synthetic datato see whether more powerful AI models can ever serve as reliable sources of feedback, and perhaps even generate new information. If researchers succeed, AI could crash through the ceiling of human-made content on the web. In that case, a sign of true artificial intelligence may well be artificial teaching.
AI may not need to attain the capacity for more holistic self-improvement before it becomes unrecognizable to us. These programs are already labyrinthineit is frequently impossible to explain why or how AI generated a given answerand developing a process whereby they take their own lead would only further compound that opacity.
You could call it artificial artificial intelligence: AI that might not perceive or approach problems in ways humans readily relate to. It would be similar, perhaps, to how people cannot fully grasp how dogs use their noses, or bats their ears, to orient themselveseven as smell and echolocation are excellent ways of navigating the world. Machine intelligence might be similarly difficult to fathom, simultaneously of this world and unfamiliar.
Such strange behaviors have already cropped up in far from superintelligent ways. Asked to achieve a specific goalproviding helpful chatbot responses, flipping pancakes, moving blocksvery often those [reinforcement-learning] agents learn how to cheat, Shumailov said. In one example, a neural network plugged into a Roomba that was learning not to bump into anything just learned to drive backwardbecause the bumper sensors were all on the front of the vacuum.
Read: Science is becoming less human
This will be less funny when an AI model is used to align another model with a set of ethical principlesa constitutional AI of sorts, as the start-up Anthropic has dubbed the concept. Already, different people see different interpretations of abortion, gun ownership, and race-conscious admissions in the U.S. Constitution. And while human disagreements over the law are at least legible and debatable, it might be difficult to understand how a machine interprets and applies a rule, especially over many cycles of training, producing subtly harmful results. An AI instructed to be helpful and engaging could turn aggressive and manipulative; rules to prevent one form of bias might breed another. Computer-generated feedback, for all the ways a human can tweak it, might offer a false sense of control, Dylan Hadfield-Menell, a computer scientist at MIT, told me.
Although those opaque inner workings have the potential to be dangerous, rejecting them on principle could also mean rejecting revelation. Having ingested an internets worth of information, self-training AI models might bring out genuinely important patterns and ideas that are already embedded in their training data but that humans cannot elicit or fully comprehend. The most advanced chess-playing programs, for instance, learned by playing millions of games against themselves. These chess AIs play moves that elite human players struggle to comprehend, and utterly dominate those playerswhich has caused a reevaluation of chess at the highest human level.
Shumailov put it this way: In the 17th century, Galileo correctly asserted that the Earth revolves around the sun, but this was rejected as heresy because it didnt align with existing belief systems. The fact that weve managed to realize some knowledge does not necessarily mean that well be able to interpret this knowledge, Shumailov said. Perhaps we will ignore the outputs of some AI models, even if they are later found to be true, simply because they are incommensurate with what we currently understandmath proofs we cant yet follow, brain models we cant explain, knowledge we dont recognize as knowledge. The ceiling provided by the internet may simply be higher than we can see.
Whether self-training AI leads to catastrophic disaster, subtle imperfections and biases, or unintelligible breakthroughs, the response cannot be to entirely trust or scorn the technologyit must be to take these models seriously as agents that today can learn, and tomorrow might be able to teach us, or even one another.
This article has been updated to include a reference to Sora.
Read more here:
Things Get Strange When AI Starts Training Itself - The Atlantic
- Master Keys to Success Video [Last Updated On: June 20th, 2011] [Originally Added On: June 20th, 2011]
- Barbara Marx Hubbard, Conscious Evolution [Last Updated On: August 10th, 2011] [Originally Added On: August 10th, 2011]
- Self-Compassion: Why it's Important and How you Can Practice It [Last Updated On: November 5th, 2014] [Originally Added On: October 16th, 2014]
- Self Improvement Quotes and Sayings [Last Updated On: September 14th, 2015] [Originally Added On: September 14th, 2015]
- Self Development Courses, Personal Development Programs [Last Updated On: September 14th, 2015] [Originally Added On: September 14th, 2015]
- Self Improvement: How To Self Help And Achieve The Future ... [Last Updated On: September 14th, 2015] [Originally Added On: September 14th, 2015]
- Self Help Books & Self Improvement eBooks for Personal ... [Last Updated On: September 14th, 2015] [Originally Added On: September 14th, 2015]
- Self-help - Wikipedia, the free encyclopedia [Last Updated On: September 14th, 2015] [Originally Added On: September 14th, 2015]
- Self Improvement - Pick the Brain | Motivation and Self ... [Last Updated On: September 15th, 2015] [Originally Added On: September 15th, 2015]
- 42 Practical Ways To Improve Yourself - Lifehack [Last Updated On: September 15th, 2015] [Originally Added On: September 15th, 2015]
- Useful Self Improvement Tips, Advice, Personal Development ... [Last Updated On: September 22nd, 2015] [Originally Added On: September 22nd, 2015]
- Self-Improvement .com - your online life coach [Last Updated On: September 23rd, 2015] [Originally Added On: September 23rd, 2015]
- Brian Kim.net - Invest in Yourself and Make It Happen ... [Last Updated On: October 6th, 2015] [Originally Added On: October 6th, 2015]
- Free Self Improvement Advice, Ideas and Tips [Last Updated On: November 1st, 2015] [Originally Added On: November 1st, 2015]
- Diet Mind Spirit - Body Mind, Spirit, Personal development ... [Last Updated On: January 22nd, 2016] [Originally Added On: January 22nd, 2016]
- Benjamin Franklin . Wit and Wisdom . Self Improvement | PBS [Last Updated On: February 2nd, 2016] [Originally Added On: February 2nd, 2016]
- 101 Online Self Improvement Resources | PickTheBrain ... [Last Updated On: February 2nd, 2016] [Originally Added On: February 2nd, 2016]
- Dallas Orthodontics & Braces, FREE Exam | Apple Orthodontix [Last Updated On: February 19th, 2016] [Originally Added On: February 19th, 2016]
- Ways to Improve [Last Updated On: February 27th, 2016] [Originally Added On: February 27th, 2016]
- Personal Development Club - The Self Improvement Coach [Last Updated On: March 4th, 2016] [Originally Added On: March 4th, 2016]
- MenProvement | Self Improvement For Men [Last Updated On: April 29th, 2016] [Originally Added On: April 29th, 2016]
- Mind Power and Self Improvement Tips: Mind Cafe [Last Updated On: April 30th, 2016] [Originally Added On: April 30th, 2016]
- John Kreiter | Self-Improvement and Interesting Knowledge [Last Updated On: July 18th, 2016] [Originally Added On: July 18th, 2016]
- HostBlast.Net - Low cost web hosting starting at $0.50 [Last Updated On: July 30th, 2016] [Originally Added On: July 30th, 2016]
- Letter of Recommendation: Duolingo - New York Times [Last Updated On: July 30th, 2017] [Originally Added On: July 30th, 2017]
- GET A LIFE: Quest for self-improvement shouldn't be stressful - Wicked Local Mattapoisett [Last Updated On: July 30th, 2017] [Originally Added On: July 30th, 2017]
- Kieran McGeeney and the triumph of self-improvement - Irish Times [Last Updated On: July 30th, 2017] [Originally Added On: July 30th, 2017]
- Bold Self Improvement [Last Updated On: July 30th, 2017] [Originally Added On: July 30th, 2017]
- How to Overcome Your Fear of Public Speaking: 12 Steps [Last Updated On: August 8th, 2017] [Originally Added On: August 8th, 2017]
- When self-improvement is self-destruction: The 4 warning signs - Gears Of Biz [Last Updated On: August 13th, 2017] [Originally Added On: August 13th, 2017]
- Self-improvement just a click away - The Star Online [Last Updated On: August 16th, 2017] [Originally Added On: August 16th, 2017]
- Finding your tribe - Claremore Daily Progress [Last Updated On: September 1st, 2017] [Originally Added On: September 1st, 2017]
- Continuously Improve Yourself! r/selfimprovement - reddit [Last Updated On: October 15th, 2017] [Originally Added On: October 15th, 2017]
- Simple Technique for Self Improvement and Self Growth [Last Updated On: December 1st, 2017] [Originally Added On: December 1st, 2017]
- Self-Improvement Tips for a Happy, Prosperous and ... [Last Updated On: December 1st, 2017] [Originally Added On: December 1st, 2017]
- Tony Robbins London 2018 UPW Tickets on Sale! [Last Updated On: December 2nd, 2017] [Originally Added On: December 2nd, 2017]
- How to Be Okay with Being You: 15 Steps (with Pictures ... [Last Updated On: December 14th, 2017] [Originally Added On: December 14th, 2017]
- Nook Self Improvement Blog [Last Updated On: December 14th, 2017] [Originally Added On: December 14th, 2017]
- Self-Improvement Is Not Just For Young Guys Return Of Kings [Last Updated On: January 19th, 2018] [Originally Added On: January 19th, 2018]
- Self-Improvement - Free Books at EBD - E-Books Directory [Last Updated On: March 1st, 2018] [Originally Added On: March 1st, 2018]
- Self improvement versus God improvement | Christianity and ... [Last Updated On: March 7th, 2018] [Originally Added On: March 7th, 2018]
- What is Self Improvement? [Last Updated On: March 11th, 2018] [Originally Added On: March 11th, 2018]
- 11 Easy Ways to Finally Overcome Your Fear of Public ... [Last Updated On: March 15th, 2018] [Originally Added On: March 15th, 2018]
- 5 Outstanding Self Improvement Skills You Should Start ... [Last Updated On: March 18th, 2018] [Originally Added On: March 18th, 2018]
- Desperately Seeking Self-Improvement: A Year Inside the ... [Last Updated On: May 6th, 2018] [Originally Added On: May 6th, 2018]
- What is Self-Improvement? - International Life Coaching ... [Last Updated On: May 15th, 2018] [Originally Added On: May 15th, 2018]
- Desperately Seeking Self-Improvement, by Carl Cederstrm ... [Last Updated On: May 23rd, 2018] [Originally Added On: May 23rd, 2018]
- Should Your Self-Improvement Start With You? | Wealthy Gorilla [Last Updated On: June 12th, 2018] [Originally Added On: June 12th, 2018]
- Be Your Own Life Coach: 10 Ideas for Self-Improvement ... [Last Updated On: July 1st, 2018] [Originally Added On: July 1st, 2018]
- 20 Self Improvement Tips That Will Change Your Life ... [Last Updated On: July 29th, 2018] [Originally Added On: July 29th, 2018]
- Self-improvement | Define Self-improvement at Dictionary.com [Last Updated On: October 14th, 2018] [Originally Added On: October 14th, 2018]
- Employee Rewards Programs, Loyalty Reward Program [Last Updated On: January 26th, 2019] [Originally Added On: January 26th, 2019]
- Most of us are too busy to be better: the lazy person's ... [Last Updated On: March 5th, 2019] [Originally Added On: March 5th, 2019]
- Best Articles: 20 Articles That Can Change Your Life ... [Last Updated On: May 12th, 2019] [Originally Added On: May 12th, 2019]
- 8 Best Industries for Starting a Business Right Now | Inc.com [Last Updated On: May 12th, 2019] [Originally Added On: May 12th, 2019]
- Learn Self Improvement | Free Online Courses | Class Central [Last Updated On: July 12th, 2019] [Originally Added On: July 12th, 2019]
- Letter: From a fifth year to all first years be brave - Ubyssey Online [Last Updated On: September 28th, 2019] [Originally Added On: September 28th, 2019]
- REVIEW: Sardonic Humor in Glass Menagerie (Guthrie Theater) - Twin Cities Arts Reader [Last Updated On: September 28th, 2019] [Originally Added On: September 28th, 2019]
- Employers Want To Retrain Workers, But Heres What Theyre Missing - Forbes [Last Updated On: September 28th, 2019] [Originally Added On: September 28th, 2019]
- Leaving Fear Behind and Learning to Trust - Multiple Sclerosis News Today [Last Updated On: September 28th, 2019] [Originally Added On: September 28th, 2019]
- Taylor-Made Takes: 'Whatever We Have To Do To Get a First Down and Score Points' - Bengals.com [Last Updated On: September 28th, 2019] [Originally Added On: September 28th, 2019]
- Shashi Tharoors word of the week: Satyagraha - Hindustan Times [Last Updated On: September 28th, 2019] [Originally Added On: September 28th, 2019]
- Korey Wise shares his story of life after exoneration, continued frustrations with the justice system during WMU visit - Western Herald [Last Updated On: September 28th, 2019] [Originally Added On: September 28th, 2019]
- Universal ethical truths are at the core of Jewish High Holy Days - Daytona Beach News-Journal [Last Updated On: September 28th, 2019] [Originally Added On: September 28th, 2019]
- Staying on Top of Things - Thrive Global [Last Updated On: September 28th, 2019] [Originally Added On: September 28th, 2019]
- Teddy Roosevelt, the athlete, was more about grit than might - The Keene Sentinel [Last Updated On: September 28th, 2019] [Originally Added On: September 28th, 2019]
- 0929 Horo | | albanyherald.com - The Albany Herald [Last Updated On: September 28th, 2019] [Originally Added On: September 28th, 2019]
- Lampard urges Hudson-Odoi to learn from Sterling and queries Leeds award - The Guardian [Last Updated On: September 28th, 2019] [Originally Added On: September 28th, 2019]
- What to do when your work and play are out of whack - Bangor Daily News [Last Updated On: September 28th, 2019] [Originally Added On: September 28th, 2019]
- You Say Your Business Has Purpose? What Does That Mean? - Forbes [Last Updated On: September 28th, 2019] [Originally Added On: September 28th, 2019]
- How Have Health Workers Won Improvements to Patient Care? Strikes. - In These Times [Last Updated On: September 30th, 2019] [Originally Added On: September 30th, 2019]
- Prisoners 'buried alive' living in 24/7 silence and darkness in solitary confinement - Mirror Online [Last Updated On: September 30th, 2019] [Originally Added On: September 30th, 2019]
- How Cargill is improving digital commerce - DigitalCommerce360 [Last Updated On: September 30th, 2019] [Originally Added On: September 30th, 2019]
- HOROSCOPES: How Sundays new moon will affect your week, according to your star sign. - Mamamia [Last Updated On: September 30th, 2019] [Originally Added On: September 30th, 2019]
- Philadelphia Eagles: 3 Areas that must improve immediately - Inside the Iggles [Last Updated On: September 30th, 2019] [Originally Added On: September 30th, 2019]
- North Korea's UN ambassador bemoans lack of progress with the US, South Korea - NK News [Last Updated On: September 30th, 2019] [Originally Added On: September 30th, 2019]
- Sunday Commentary: Parole Opposition Shows Once Again Reisig Just Not a Reformer - The Peoples Vanguard of Davis [Last Updated On: September 30th, 2019] [Originally Added On: September 30th, 2019]
- Americans' Diet Is Improving, But They're Still Overdoing It on Unhealthy Carbs and Fat - Everyday Health [Last Updated On: September 30th, 2019] [Originally Added On: September 30th, 2019]
- Recently Published Study: The Differences Between Chinese And Other Luxury Travellers - Hospitality Net [Last Updated On: September 30th, 2019] [Originally Added On: September 30th, 2019]
- Literature, science or art there's a lecture coming to Fairfield County - The Ridgefield Press [Last Updated On: September 30th, 2019] [Originally Added On: September 30th, 2019]