{"id":2568,"date":"2025-10-01T13:29:03","date_gmt":"2025-10-01T18:29:03","guid":{"rendered":"https:\/\/sites.imsa.edu\/hadron\/?p=2568"},"modified":"2025-11-15T21:28:09","modified_gmt":"2025-11-16T03:28:09","slug":"game-theory-optimal-texas-hold-em-poker","status":"publish","type":"post","link":"https:\/\/sites.imsa.edu\/hadron\/2025\/10\/01\/game-theory-optimal-texas-hold-em-poker\/","title":{"rendered":"Game Theory Optimal Texas Hold Em\u2019 Poker"},"content":{"rendered":"<p style=\"text-align: center\"><span style=\"font-weight: 400\">Written By: Shihan Cheng<\/span><\/p>\n<p><span style=\"font-weight: 400\">Poker is a game of incomplete information. Players must make decisions with uncertainty, ultimately relying on probabilistic outcomes to determine the winner of a round. Unlike popular games such as chess or Go, Poker has the element of hidden cards and not knowing what cards will show up on the table, making it extremely viable to research in fields such as artificial intelligence (AI) and game theory. Game Theory Optimal (GTO) Poker refers to a methodology of playing Poker with a Nash equilibrium strategy so that no opposing player can deviate their playing style to improve their expected value of winnings. GTO Poker consists of choosing to bet (putting more money at stake), call (matching someone else\u2019s bet), and fold (giving up a hand), at certain frequencies such that the opposition cannot capitalize and profit from counter-strategies in the long run.<\/span><\/p>\n<p><span style=\"font-weight: 400\">First we introduce the basic rules of Poker. Each player attempts to make the best possible five-card hand, which has a hierarchy, given their two hole-cards, which are personal to them and not shared by anybody, along with the five community cards, which all players share on the table. There exists four betting rounds: pre-flop, before any of the community cards are shown, flop, after the first three community cards are shown, turn, after the fourth community card is shown, and river, after the fifth and final community card is shown. Players attempt to maximize their profits if they have the optimal five-card hand, and minimize their losses if they do not.\u00a0<\/span><\/p>\n<p><span style=\"font-weight: 400\">GTO Poker is rooted in non-cooperative game theory, which was pioneered by John von Neumann and Oskar Morgenstern in <\/span><i><span style=\"font-weight: 400\">Theory of Games and Economic Behavior<\/span><\/i><span style=\"font-weight: 400\">, published in 1944. In a two-player zero-sum game, which means that every positive to one player directly yields a negative to the other, a Nash equilibrium is the pair of strategies where each player\u2019s choice is the optimal response to the other\u2019s. Mathematically, let <\/span><i><span style=\"font-weight: 400\">S<\/span><\/i><span style=\"font-weight: 400\">1<\/span> <span style=\"font-weight: 400\">and <\/span><i><span style=\"font-weight: 400\">S<\/span><\/i><span style=\"font-weight: 400\">2<\/span> <span style=\"font-weight: 400\">denote the set of possible strategies, or actions, that player 1 and player 2 has, respectively, and <\/span><i><span style=\"font-weight: 400\">u<\/span><\/i><span style=\"font-weight: 400\">1<\/span><span style=\"font-weight: 400\">(<\/span><i><span style=\"font-weight: 400\">s<\/span><\/i><span style=\"font-weight: 400\">1<\/span><i><span style=\"font-weight: 400\">, s<\/span><\/i><span style=\"font-weight: 400\">2<\/span><span style=\"font-weight: 400\">) be the expected utility, or winnings, or player 1 given strategy pair (<\/span><i><span style=\"font-weight: 400\">s<\/span><\/i><span style=\"font-weight: 400\">1<\/span><i><span style=\"font-weight: 400\">, s<\/span><\/i><span style=\"font-weight: 400\">2<\/span><span style=\"font-weight: 400\">) for some strategy <\/span><i><span style=\"font-weight: 400\">s<\/span><\/i><span style=\"font-weight: 400\">1<\/span> <span style=\"font-weight: 400\">in <\/span><i><span style=\"font-weight: 400\">S<\/span><\/i><span style=\"font-weight: 400\">1<\/span> <span style=\"font-weight: 400\">and <\/span><i><span style=\"font-weight: 400\">s<\/span><\/i><span style=\"font-weight: 400\">2<\/span> <span style=\"font-weight: 400\">in <\/span><i><span style=\"font-weight: 400\">S<\/span><\/i><span style=\"font-weight: 400\">2<\/span><span style=\"font-weight: 400\">. Then a Nash Equilibrium must satisfy that<\/span><\/p>\n<p style=\"text-align: center\"><i><span style=\"font-weight: 400\">u<\/span><\/i><span style=\"font-weight: 400\">1<\/span><span style=\"font-weight: 400\">(<\/span><i><span style=\"font-weight: 400\">s1<\/span><\/i><span style=\"font-weight: 400\">\u2217<\/span><i><span style=\"font-weight: 400\">, s2<\/span><\/i><span style=\"font-weight: 400\">\u2217<\/span><span style=\"font-weight: 400\">) <\/span><i><span style=\"font-weight: 400\">\u2265 <\/span><\/i><i><span style=\"font-weight: 400\">u<\/span><\/i><span style=\"font-weight: 400\">1<\/span><span style=\"font-weight: 400\">(<\/span><i><span style=\"font-weight: 400\">s<\/span><\/i><span style=\"font-weight: 400\">1<\/span><i><span style=\"font-weight: 400\">, s2<\/span><\/i><span style=\"font-weight: 400\">\u2217<\/span><span style=\"font-weight: 400\">)<\/span><i><span style=\"font-weight: 400\">,<\/span><\/i> <i><span style=\"font-weight: 400\">u<\/span><\/i><span style=\"font-weight: 400\">2<\/span><span style=\"font-weight: 400\">(<\/span><i><span style=\"font-weight: 400\">s1<\/span><\/i><span style=\"font-weight: 400\">\u2217<\/span><i><span style=\"font-weight: 400\">, s2<\/span><\/i><span style=\"font-weight: 400\">\u2217<\/span><span style=\"font-weight: 400\">) <\/span><i><span style=\"font-weight: 400\">\u2265 <\/span><\/i><i><span style=\"font-weight: 400\">u<\/span><\/i><span style=\"font-weight: 400\">2<\/span><span style=\"font-weight: 400\">(<\/span><i><span style=\"font-weight: 400\">s1<\/span><\/i><span style=\"font-weight: 400\">\u2217<\/span><i><span style=\"font-weight: 400\">, s<\/span><\/i><span style=\"font-weight: 400\">2<\/span><span style=\"font-weight: 400\">)<\/span><\/p>\n<p><span style=\"font-weight: 400\">for all possible <\/span><i><span style=\"font-weight: 400\">s<\/span><\/i><span style=\"font-weight: 400\">1<\/span> <span style=\"font-weight: 400\">and <\/span><i><span style=\"font-weight: 400\">s<\/span><\/i><span style=\"font-weight: 400\">2<\/span><span style=\"font-weight: 400\">. Note that <\/span><i><span style=\"font-weight: 400\">s1<\/span><\/i><span style=\"font-weight: 400\">\u2217<\/span> <span style=\"font-weight: 400\">and <\/span><i><span style=\"font-weight: 400\">s2<\/span><\/i><span style=\"font-weight: 400\">\u2217<\/span><span style=\"font-weight: 400\">, which are deviations in strategy, often include <\/span><span style=\"font-weight: 400\">mixed strategies, where certain actions are played at specific frequencies. This Nash equilibrium <\/span><span style=\"font-weight: 400\">ensures that each player\u2019s expected win rate is maximized when playing against perfectly reasonable play.<\/span><\/p>\n<p style=\"text-align: center\"><img loading=\"lazy\" decoding=\"async\" class=\"alignnone wp-image-2570 aligncenter\" src=\"http:\/\/sites.imsa.edu\/hadron\/files\/2025\/10\/GTO-Preflop-Poker-Chart-300x261.png\" alt=\"\" width=\"341\" height=\"297\" srcset=\"https:\/\/sites.imsa.edu\/hadron\/files\/2025\/10\/GTO-Preflop-Poker-Chart-300x261.png 300w, https:\/\/sites.imsa.edu\/hadron\/files\/2025\/10\/GTO-Preflop-Poker-Chart.png 586w\" sizes=\"auto, (max-width: 341px) 100vw, 341px\" \/><span style=\"font-weight: 400\">Figure 1: A GTO Chart demonstrating the specific frequencies of actions that a player should have pre-flop in a certain position.<\/span><\/p>\n<p><span style=\"font-weight: 400\">Completely solving GTO strategies for Poker is extremely computationally challenging, as there exists around 10<\/span><span style=\"font-weight: 400\">161<\/span> <span style=\"font-weight: 400\">possible states, more than the number of atoms in the observable universe. Due to this enormous number, researchers often use abstraction techniques in order to reduce this number, such as grouping similar hands and similar situations into clusters. They utilize counterfactual regret minimization (CFR) algorithms. CFR essentially uses the question \u201cIf I could go back in time and play differently, how much better off would I have been?\u201d The \u201chow much better off \u201d in the above hypothetical scenario is defined as regret. Through millions of practice games and iterations, CFR minimizes this regret, ultimately converging to a Nash Equilibrium strategy profile.<\/span><\/p>\n<p><span style=\"font-weight: 400\">While a GTO Poker strategy is theoretically unexploitable, it\u2019s designed to minimize one\u2019s losses against good players instead of maximizing wins against suboptimal opponents. Many professional players adopt the concept of a GTO baseline to avoid being exploited, but they also adjust their strategy to exploit their opponent when their opponent deviates from GTO. This dynamic balance between equilibrium and exploitation is an active area of study in poker trading software and multi-agent reinforcement learning.<\/span><\/p>\n<p><span style=\"font-weight: 400\">GTO formalizes optimal Poker play using ideas from game theory and algorithmic computation. It has not only advanced the beautiful game of Poker, but it has calso ontributed to the rapidly growing field of AI and decision-making. Applications include operations management for large corporations, resource allocation, path planning for robotics, and so much more. Researchers continuously study GTO, both refining human strategy in Poker games, but also developing better strategic reasoning in imperfect-information games.<\/span><\/p>\n<p>&nbsp;<\/p>\n<h1 style=\"text-align: center\"><b>References<\/b><\/h1>\n<ol>\n<li><span style=\"font-weight: 400\">Cornell University INFO 2040 Course Blog. Game theory optimal (gto) texas holdem poker theory, November 2021. Accessed: 2025-09-13.<\/span><\/li>\n<li><span style=\"font-weight: 400\">Jingyu Li. Exploitability and game theory optimal play in poker. Undergraduate seminar paper, course 18.204 (discrete mathematics), Massachusetts Institute of Technology, May 2018. Accessed: 2025-09-13.<\/span><\/li>\n<li style=\"font-weight: 400\"><span style=\"font-weight: 400\">John von Neumann and Oskar Morgenstern. <\/span><i><span style=\"font-weight: 400\">Theory of Games and Economic Behavior<\/span><\/i><span style=\"font-weight: 400\">. Princeton University Press, Princeton, NJ, 1944.<\/span><\/li>\n<li><span style=\"font-weight: 400\">William Poundstone and The Editors of Encyclopaedia Britannica. John von neumann \u2014 biography, accomplishments, inventions, &amp; facts, July 2025. Accessed: 2025-09-13.<\/span><\/li>\n<\/ol>\n","protected":false},"excerpt":{"rendered":"<p>Written By: Shihan Cheng Poker is a game of incomplete information. Players must make decisions with uncertainty, ultimately relying on probabilistic outcomes to determine the winner of a round. Unlike popular games such as chess or Go, Poker has the element of hidden cards and<\/p>\n","protected":false},"author":1100,"featured_media":2571,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"ngg_post_thumbnail":0,"footnotes":""},"categories":[101,11],"tags":[],"class_list":["post-2568","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-game-theory","category-math"],"aioseo_notices":[],"_links":{"self":[{"href":"https:\/\/sites.imsa.edu\/hadron\/wp-json\/wp\/v2\/posts\/2568","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/sites.imsa.edu\/hadron\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/sites.imsa.edu\/hadron\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/sites.imsa.edu\/hadron\/wp-json\/wp\/v2\/users\/1100"}],"replies":[{"embeddable":true,"href":"https:\/\/sites.imsa.edu\/hadron\/wp-json\/wp\/v2\/comments?post=2568"}],"version-history":[{"count":3,"href":"https:\/\/sites.imsa.edu\/hadron\/wp-json\/wp\/v2\/posts\/2568\/revisions"}],"predecessor-version":[{"id":2615,"href":"https:\/\/sites.imsa.edu\/hadron\/wp-json\/wp\/v2\/posts\/2568\/revisions\/2615"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/sites.imsa.edu\/hadron\/wp-json\/wp\/v2\/media\/2571"}],"wp:attachment":[{"href":"https:\/\/sites.imsa.edu\/hadron\/wp-json\/wp\/v2\/media?parent=2568"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/sites.imsa.edu\/hadron\/wp-json\/wp\/v2\/categories?post=2568"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/sites.imsa.edu\/hadron\/wp-json\/wp\/v2\/tags?post=2568"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}