homehome Home chatchat Notifications


Google's AlphaZero surpassed the sum of human chess knowledge -- in 4 hours

Feeling outdated yet?

Alexandru Micu
December 7, 2017 @ 6:52 pm

share Share

Google’s latest AI, AlphaZero, just defeated the world’s champion chess program Stockfish — after only four hours of learning, by itself, without any human input beyond the game’s rules.

Chess Game.

Image via Pixabay.

Mastering chess can take us a lifetime, but with a big enough brain, it will hardly keep you occupied for one afternoon. At least, that’s the case with Google‘s newest AI installment AlphaZero. The program showcased a “superhuman performance” with the game, beating the world’s champion program Stockfish after only four hours’ practice.

To be blunt, this AI managed to surpass the highest peaks of human achievement in chess in half your shift.

Castle and rule

AlphaZero was instructed only on the ruleset of chess and nothing more. Starting without any strategy to use as a crutch, the AI needed only four hours to master the game to such an extent that it destroyed Stockfish — the highest-rated chess-playing program today.

The firm’s DeepMind division says that it played 100 games against Stockfish 8. Each program was given one minute’s worth of thinking time per move. AlphaZero won 25 games in which it played with white (gaining the first-move advantage) and a further three in which it played black. The two programs drew the remaining 72 games.

Stockfish 8 had previously won 2016’s Top Chess Engine Championship. The software was first released in 2008 and has been improved on by volunteers in the years since.

“We now know who our new overlord is,” quipped chess researcher David Kramaley, CEO of chess science website Chessable. “It will no doubt revolutionise the game, but think about how this could be applied outside chess. This algorithm could run cities, continents, universes.”

AlphaZero was developed at Google’s DeepMind labs and is a more generic version of AlphaGo Zero, the AI that ousted the human champion of Go, a Chinese board game considered to be the most difficult strategy game in the world. The Go victory was, so far, considered the bleeding edge of its ability, but DeepMind has kept working on and refining this AI, culminating in a startling success in October: a new, fully autonomous version of the AI, which only learned by playing against itself, never humans, bested all its previous incarnations.

By contrast, AlphaGo Zero’s predecessors learned how to play the game, in part, by watching moves made by human players. This was believed to help the fledgling software improve its game. However, in a slight blow to the human ego, it might have actually hindered the AI, considering that AlphaGo Zero’s fully self-reliant learning was so much more effective in a one-on-one competition.

“What we’re seeing here is a model free from human bias and presuppositions. It can learn whatever it determines is optimal, which may indeed be more nuanced that our own conceptions of the same,” MIT computer scientist Nick Hynes told Gizmodo following the October victory.

“It’s like an alien civilisation inventing its own mathematics.”

But it took AlphaZero less than two months to best even that achievement. In their new paper, the team showcases how the very latest AlphaZero AI takes this self-playing method — called reinforcement learning — and mixes it with a much more generally-applicable frame of thought. All in all, this allows the AI to understand and solve a broader range of problems. It doesn’t play just chess, but also Shogi (Japanese chess) as well as Go — and it took only two and eight hours respectively to master these games.

For now, Google’s scientists aren’t publicly commenting on the research, and the paper is still awaiting peer-review. But for now, one thing is certain: AlphaZero made a lot of waves in the chess community.

“I always wondered how it would be if a superior species landed on Earth and showed us how they played chess,” grandmaster Peter Nielsen told BBC.

“Now I know.”

Who knows, maybe AlphaZero will be the computer to finally crack chess forever.

The paper “Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm” has been published on Cornell University’s site arXiv.

share Share

How Hot is the Moon? A New NASA Mission is About to Find Out

Understanding how heat moves through the lunar regolith can help scientists understand how the Moon's interior formed.

This 5,500-year-old Kish tablet is the oldest written document

Beer, goats, and grains: here's what the oldest document reveals.

A Huge, Lazy Black Hole Is Redefining the Early Universe

Astronomers using the James Webb Space Telescope have discovered a massive, dormant black hole from just 800 million years after the Big Bang.

Did Columbus Bring Syphilis to Europe? Ancient DNA Suggests So

A new study pinpoints the origin of the STD to South America.

The Magnetic North Pole Has Shifted Again. Here’s Why It Matters

The magnetic North pole is now closer to Siberia than it is to Canada, and scientists aren't sure why.

For better or worse, machine learning is shaping biology research

Machine learning tools can increase the pace of biology research and open the door to new research questions, but the benefits don’t come without risks.

This Babylonian Student's 4,000-Year-Old Math Blunder Is Still Relatable Today

More than memorializing a math mistake, stone tablets show just how advanced the Babylonians were in their time.

Sixty Years Ago, We Nearly Wiped Out Bed Bugs. Then, They Started Changing

Driven to the brink of extinction, bed bugs adapted—and now pesticides are almost useless against them.

LG’s $60,000 Transparent TV Is So Luxe It’s Practically Invisible

This TV screen vanishes at the push of a button.

Couple Finds Giant Teeth in Backyard Belonging to 13,000-year-old Mastodon

A New York couple stumble upon an ancient mastodon fossil beneath their lawn.