Participants: 2016

The 2016 competition had 9 different agents in the heads-up no-limit Texas hold'em competition. As in previous years, agents were submitted by a mixture of universities and individual hobbyists from 5 different countries around the world.

Competitors in the 2016 Annual Computer Poker Competition were not required to supply detailed information about their submission(s) in order to compete, but some information about team members, affiliation, location, high level technique descriptions, and occasionally relevant papers were supplied. This page presents that information.

Heads-up No-Limit Texas Hold'em

Act1

  • Team Name: Act1
  • Team Members: Tim Reiff
  • Affiliation: unfoldpoker
  • Location: Las Vegas, USA
  • Non-dynamic Agent
  • Technique:

Act1 was trained by an experimental distributed implementation of the Pure CFR algorithm.  A heuristic was added to occasionally avoid some game tree paths, reducing the time spent per training iteration.  To compensate for imperfect recall, a distance metric that considers features from all postflop streets was used to construct the card abstraction on the river.  Several bet sizes were omitted because they offer little benefit against other equilibrium opponents while requiring a disproportionate amount of resources to train and store.

The strategy consists of 159 billion information sets (430 billion information set-action pairs) and completed 5.15 trillion iterations.

Hugh

  • Team Name: Hugh
  • Team Members: Stan Sulsky
  • Affiliation: Independent
  • Location: New York, USA
  • Non-dynamic Agent
  • Technique: Just a rule-based engine.

KEmpfer_cfr

  • Team Name: KEmpfer
  • Team Members: Julian Prommer, Patryk Hopner, Suiteng Lu, Eneldo Loza Mencia
  • Affiliation: Knowledge Engineering Group, TU Darmstadt
  • Location: Darmstadt, Germany
  • Non-dynamic Agent
  • Technique: 

This bot implements a CFR strategy. For training the policy, we used the Open Pure CFR implementation and adapted it to no-limit heads-up. In addition, we implemented some more advanced techniques such as cards and bucket clustering.

Nyx

  • Team Name: Nyx
  • Team Members: Martin Schmid, Matej Moravcik
  • Affiliation: Charles University
  • Location: Prague, Czech Republic
  • Non-dynamic Agent
  • Technique: 
  • Equilibrium approximating agent
  • Small computational resources
  • Very compact strategy representation ( only 2GB for uncompressed strategy )
  • Imperfect recall action abstraction with up to 16 possible bets in an information state
  • Abstraction as well as the strategy are continuously learned during self-play
  • Heavily modified CFR utilizing dynamic programing to handle non-stationary imperfect action abstraction with many action

Automatic public card abstraction for the flop round - Schmid, M., Moravcik, M., Hladik, M., & Gaukroder, S. J. (2015, January). Automatic Public State Space Abstraction in Imperfect Information Games. In Workshops at the Twenty-Ninth AAAI Conference on Artificial Intelligence.

Proteus

  • Team Name: Queen's Automated Poker Team (QAPT)
  • Team Members: Chris Barnes, Spencer Evans, Austin Attah, Robert Sun
  • Affiliation: Queen's University
  • Location: Kingston, Canada
  • Mostly static, non-equilibrium Agent
  • Technique: 

We have attempted to create a generic model by mining the logs of matches from
previous matches, especially well performing bots. In future work we plan on
implementing in-game modelling of the opponents.

Rembrant5

  • Team Name: Rembrant5
  • Team Members: Gregor Vohl
  • Affiliation: FERI
  • Location: Maribor, Slovenia
  • Static Agent
  • Technique: 

History games are used to calculate equity of the current hand and the current board cards. Bot is making random decisions with probability of actions from the history games. Because the pure random is not very effective there are also couple of hardcoded rules the but must consider before making a single action.

Slumbot

  • Team Name: Slumbot
  • Team Members: Eric Jackson
  • Affiliation: Independent Researcher
  • Location: Menlo Park, USA
  • Static Agent
  • Technique: 

Slumbot is a large Counterfactual Regret Minimization (CFR) implementation. It uses the external sampling variant of MCCFR (Monte Carlo CFR) and employs a symmetric abstraction.  Some statistics about the size of the abstraction:

  • 4.5x10^11 information sets
  • 1.1x10^12 information-set-action pairs
  • 1.5*10^6 betting sequences

We used a distributed implementation of CFR running on eleven r3.4xlarge Amazon EC2 instances.

More details can be found in my paper to be presented at the 2016 Computer Poker Workshop at AAAI.

BabyTartanian8

  • Team Name: Tartanian
  • Team Members: Noam Brown, Tuomas Sandholm
  • Affiliation: Carnegie Mellon University
  • Location: Pittsburgh, USA
  • Static Agent
  • Technique: 

BabyTartanian8 plays an approximate Nash equilibrium that was computed on the San Diego Comet supercomputer. For equilibrium finding, we used a new Monte Carlo CFR variant that leverages the recently-introduced regret-based pruning (RBP) method [Brown & Sandholm NIPS-15] to sample actions with negative regret less frequently, which dramatically speeds up convergence. Our agent uses an asymmetric action abstraction. This required conducting two separate equilibrium-finding runs.

Noam Brown and Tuomas Sandholm. Regret-Based Pruning in Extensive-Form Games. In Neural Information Processing Systems (NIPS), 2015.

Noam Brown, Sam Ganzfried, and Tuomas Sandholm. Hierarchical Abstraction, Distributed Equilibrium Computation, and Post-Processing, with Application to a Champion No-Limit Texas Hold'em Agent. In Proceedings of the International Conference on Autonomous Agents and Multiagent Systems (AAMAS), 2015.