How AI beat human experts at poker revealed

Libratus collectively amassed more than USD 1.8 million in chips

Updated - December 18, 2017 02:03 pm IST

Published - December 18, 2017 02:00 pm IST

 AI programs have defeated top humans in checkers, chess and Go.

AI programs have defeated top humans in checkers, chess and Go.

Libratus, the artificial intelligence(AI) that defeated four top professional poker players earlier this year, uses a three-pronged approach to master a game with more decision points than atoms in the universe, scientists say.

In a study published in the journal Science , researchers from the Carnegie Mellon University in the US detailed how their AI was able to achieve superhuman performance by breaking the game into computationally manageable parts and, based on its opponents’ game play, fix potential weaknesses in its strategy during the competition.

AI programs have defeated top humans in checkers, chess and Go - all challenging games, but ones in which both players know the exact state of the game at all times. Poker players, by contrast, contend with hidden information - what cards their opponents hold and whether an opponent is bluffing.

In a 20-day competition involving 120,000 hands at Rivers Casino in Pittsburgh in January, Libratus became the first AI to defeat top human players at head’s up no-limit Texas Hold’em Poker - the primary benchmark and long-standing challenge problem for AIs.

Libratus beat each of the players individually in the two-player game and collectively amassed more than USD 1.8 million in chips.

“The techniques in Libratus do not use expert domain knowledge or human data and are not specific to poker. Thus they apply to a host of imperfect-information games,” researchers said. Such hidden information is ubiquitous in real-world strategic interactions, including business negotiation, cybersecurity, finance, strategic pricing and military applications.

Libratus includes three main modules, the first of which computes an abstraction of the game that is smaller and easier to solve than by considering all possible decision points. It then creates its own detailed strategy for the early rounds and a coarse strategy for the later rounds. This strategy is called the blueprint strategy. In the final rounds of the game, a second module constructs a new, finer-grained abstraction based on the state of play. The third module is designed to improve the blueprint strategy as competition proceeds. Typically, AIs use machine learning to find mistakes in the opponent’s strategy and exploit them.

In addition to beating the human pros, Libratus was evaluated against the best prior poker AIs. “Due to the ubiquity of hidden information in real-world strategic interactions, we believe the paradigm introduced in Libratus will be critical to the future growth and widespread application of AI,” researchers said.

0 / 0
Sign in to unlock member-only benefits!
  • Access 10 free stories every month
  • Save stories to read later
  • Access to comment on every story
  • Sign-up/manage your newsletter subscriptions with a single click
  • Get notified by email for early access to discounts & offers on our products
Sign in

Comments

Comments have to be in English, and in full sentences. They cannot be abusive or personal. Please abide by our community guidelines for posting your comments.

We have migrated to a new commenting platform. If you are already a registered user of The Hindu and logged in, you may continue to engage with our articles. If you do not have an account please register and login to post comments. Users can access their older comments by logging into their accounts on Vuukle.