The Association for Computing Machinery (ACM) has honoured Andrew Barto and Richard Sutton with the prestigious Turing Award for their groundbreaking contributions to reinforcement learning, a fundamental concept in artificial intelligence. Often regarded as the “Nobel Prize of computing,” the award comes with a $1 million prize, which the two researchers will share.
Barto, a researcher at the University of Massachusetts Amherst, first began exploring the idea in 1977, proposing that neurons function similarly to hedonists—seeking pleasure and avoiding pain. In 1978, Sutton joined him in further developing this theory, which laid the foundation for reinforcement learning. This AI training method enables machines to learn from experiences through a system of rewards and penalties, mirroring human and animal learning behaviors.
Their pioneering work has significantly influenced modern AI advancements, including Google’s AlphaGo and OpenAI’s ChatGPT. Their 1998 book, Reinforcement Learning: An Introduction, remains a cornerstone in the field, continuing to shape AI research and development.
Psychologists and scientists had previously explored the ways in which humans and animals learn from experience, and as early as the 1940s, Alan Turing speculated that machines could be trained similarly. However, Barto and Sutton were instrumental in formalizing the mathematical principles behind this approach. Their research built upon earlier theories by A. Harry Klopf, leading them to establish dedicated research labs—Barto at UMass Amherst and Sutton at the University of Alberta in Canada.
Their contributions have been instrumental in advancing artificial intelligence, and experts believe that reinforcement learning is only beginning to unlock its full potential.