A symmetric continuous poker modeloptimal strategy obtained 9 by "passing to the limit" in the...

6
JOURNAL OF RESEARCH of t he National Bureau of Standards- B. Mathemati cs and Mathematical Physics Vol. 64B, No. I , January·March 1960 A Symmetric Continuous Poker Model A. J. Goldman and J. J. Stone I (August 18, H) 59) Beg innin g with von Kc umann, m at hcm >lt i cians conccrncd with tile rati ollal tl ilalrsi ,; of confli ct situ ations have r eali zc d th at iny csti gat i on of acc ura tc or simplified \'c n; ions of com- mo n ca rd games leads to tec hniqu es and insights a ppli cablc to situat ions of mili ta n' or economic in tc r est .. . In the present paper, a sy mm etric pokcr model onc s tagc morc conlpli- cated than the o:'lgll1 aI von Ne umann ga mc is soh ·cd . T hcr e arc l wo bct lc\'cls, a, b, a nd an a nt e of umt (t;t > b> 1).; no ralscs arc perml ttc d. Th c g'lmc has a uniquc optimal st ra tegy, whJCh forbid s bluffing on a low ha nd. Th e limi t in g rasc b= I is s l10w n to vicld the von Ne umann mode l. 1. Introdu ct io n Th e Leclll1i calli Le ra Lure of t he' J1 wL bemat ieal th eory of gam es of s LraLegy can be rou ghly subdivided in Lo (i ) ge neral paper s, which deal wiLh concepts, exisLenee Lheo rems and compu LaL ional me LllOd s relev an t for broa,d classes of gam es, and (ii) specific papers, in which lhe solu Lions of parLicular gll mes are d eLer- mined. Although many of Lh e st udies in Lh e sec ond category hav e dealt with probl ems suggested by eco- nomic or milit ar y conflict s, e.g. [1 , 2]2, a sub sLanLial fra cL ion hav e dealt with accm ate or simplifiecl ver- sions of common eard ga mes, the b es t known exampl e being v on Ne umann 's anal ysis of a simple poker model ([3 ], sec. 19.4 to 19 .10). Bridge and ba ccarat have be en Lr eated [4, 5], b ut poke r has remained the favo riL e Lopi c for Lhese st udies [3, 6 to 10], p er hap s because th e poker models inv esLi gate d have pr e- senL ed just the ri ght degr ee of challenge ("not easy, bu t noL impossibly hard ") to Lh e ingenui ty of the analyst. Of cour se Lhe techniq ll es and insight s de- veloped in these inv estig ation s arc then applicable Lo sit ua t ion s of great er pract ical importan ce . Th e pres en t paper deals wiLh a symme Lri c pok er game one s tage mor e complicat ed than von Neumann's original mod el. The game is continuous in Lhe sense that th e difficulties [6] du e to the enor- mous but finite number of possible hand s have b een avoided by assuming (as von N o umann did) an infinite continuum of possible hand s. The ( mor e difficult ) asymmetric version ha s also b een solved by the authors; the results will appear el sewh e re [11] . 2. Description of the Game The rules of th e game are as follows. Th e two players Rand S first ante 1 unit. Th ey then receive ind ependen t random numb er s (hands) from th e in- te rval [0 , 1]. Each player knows his own hand , but no t tha t of his opponent . The players act simul- t an eously , each ei ther dr opping, be t ting an additioUllI 1 Present address . Stanford Universi ty. 2 It alicized fig llres in brackets indicate the lit era ture references at the end of this paper. 532245- 60--4 35 b- 1 unit s, or beLLing an additioual a- l unit . If both drop, then pl ay ends ancl no payment is made. If bo th b et b- 1 wli Ls or both beL a- l wli Ls, Lhen play end s and 1 he player with Lhe hi gher h an d 3 wins the ante a nd bet of his oppon en t. If one player b eLs a- I umt s a ncl Lhe oLher b ets b- 1 Lhen Lhe laLter can e iLh er f old (in wluc ll case he his an te and previoLls beL Lo his opponen L) or see 4 by betLing an addit ional a- b units (in wlli ch case the pl ayer wi Lh Lhe higher hand wins Lhe an Le and pr evious bet or beLs of his opponen L) ; in eiLher case pl ay end s. N aL urally we assume a> b> l. E ach player ha s four C Ollrses of act ion for any giv en ha nd .. Th ese are (1 ) Lo dr op, (2) Lo b eL b- l uniLs wiLh th e intenL ion of foldi.ng if Lh e oppon ent bets a- I uniLs, (3 ) Lo beL b- l uni Ls wi Lh the in Len t ion of see in g if the opponenL beLs a- I un iLs, and (4 ) Lo beL a- I uni ts. Figur es 1 and 2 show the al gebraic pay off from player S Lo player R fo r any choices of courses of acLion by Lh e playe rs. A strategy (for eiLher player) i a quadrupl e 1'( z) = {r; ( z)} (i = 1,2,3,4) of fun ct ions deuned for O:::; z :::; l. Since 1' i( z) repr esenLs the probabiliLy 5 tha t the pl ayer will choose the i Lh C Ollrse of a cL ion when hi s hand is z, Lh e r es Lriclion s ar e imposed. Th e "smoothn es " eoncliLions on Lh ese functions are (i) th at they be inLegrabl e, ( ii ) Lila t if ri(z/» ° th en , for any inLervllJ J of \\,hich z' is an in terior 6 point , J ri(t)dt> O, and finally 7 (iii) thaL So' [& 1'2(t) -a1'4( t)]dt= 0 Oil [(&- 1 )/&, (a- 1)/a] 3 '1' he possibility ofeqnalhandq has zero probabili ty aL ld so can be disregarded. ' This possibility distiuguishes our model from the case 111= 3 of the versiou studied in part II of [6J. , More precisely, T,(Z) is a pro bability density function. 6 'r he words 'j interior" and <, open') are to be understood relative to tbe interval [O,I J. T This very weak bu t ad hoc condition is precisely wh at is needed to prove (b) anel (e) of l emma 8. .

Transcript of A symmetric continuous poker modeloptimal strategy obtained 9 by "passing to the limit" in the...

Page 1: A symmetric continuous poker modeloptimal strategy obtained 9 by "passing to the limit" in the optimal strategy for b> 1. The von Neumann poker game of sections 19.4 to 19.10 of [3]

JOURNAL OF RESEARCH of the National Bureau of Standards- B. Mathematics and Mathematica l Physics Vol. 64B, No. I , January·March 1960

A Symmetric Continuous Poker Model A. J. Goldman a nd J. J. Stone I

(August 18, H)59)

Beginning wi th von Kcuma nn, m athcm >lticians con ccrn cd with ti le rati ollal tl ila lr s i,; of conflict sit uations ha ve reali zcd t hat inycstigation of acc ura tc o r simplifi ed \'cn; ions of com­mon card games leads to techniques a nd insigh t s applicablc to sit uations of mili ta n ' or economic intcrest .. . I n the p rese nt paper, a symm etric pokc r model onc s tagc morc co nl pli­cated t han t he o:'lgll1aI von Neuma nn gamc is soh ·cd . T hcre a rc l wo bct lc \'cls, a, b, a nd an ante of ~ umt (t;t > b> 1).; no ra lscs arc perml ttcd . Thc g'lmc has a uniq uc op t ima l stra tegy, whJCh forbids bluffing on a low ha nd . The limi t ing rasc b= I is s l10w n to vicld the von Neuma nn mode l. ~

1. Introduction

The Leclll1icalliLe raLure of the' J1 wLbematieal theory of games of sLraLegy can be roughly subdivided in Lo (i) general papers, which deal wiLh concep ts, exisLenee Lheorems and compuLa Lional meLllOds relevan t for broa,d classes of games, and (ii) specific papers, in which lhe solu Lions of parLicular gll mes are deLer­mined. Al though many of Lhe studies in Lhe second category have dealt with problems suggested by eco­nomic or military conflicts, e.g. [1 , 2]2, a subsLanLial fracLion have deal t wi th accm ate or simplifiecl ver ­sions of common eard games, the b est known example being von Neumann's analysis of a simple poker model ([3], sec. 19.4 to 19.10). Bridge and baccarat ha ve been Lrea ted [4, 5], b ut poke r has remained t he favo riLe Lopic for Lhese studies [3 , 6 to 10], perhaps because the poker models invesLigated have pre­senLed just t he righ t degree of challenge ("no t easy, but noL impossibly hard") to Lhe ingenui ty of the analyst. Of course Lhe techniq lles and insights de­veloped in these investigations arc then applicable Lo situa t ions of greater practical importance.

The presen t paper deals wiLh a symmeLric poker game one stage more complicated than von Neumann's original model. The game is continuous in Lhe sense that the difficulties [6] due to the enor­mous but finite number of possible hands have been avoided by assuming (as von N oumann did) an infinite continuum of possible hands. The (more difficult) asymmetric version has also been solved by the authors; the results will appear elsewh ere [11] .

2. Description of the Game

The rules of the game are as follows. The two players Rand S first an te 1 unit. They t hen r eceive independen t random numbers (hands) from the in­terval [0, 1]. Each player knows his own hand, but no t that of his opponent. The players act simul­taneously, each ei ther dropping, be t ting an additioUllI

1 Present address . Stanford Uni versi ty. 2 Italicized figllres in brackets ind icate the literature references at the end of

this paper.

532245- 60--4 35

b- 1 units, or beLLing an additioual a - l unit . If bo th drop, t hen play ends ancl no paymen t is made. If bo t h bet b- 1 wli Ls or both beL a- l wliLs, Lhen play ends and 1 he player wi t h Lhe higher hand 3 wins the an te and bet of his opponen t. If one player beLs a- I umts ancl Lhe oLher bets b- 1 Lhen Lhe laLte r can eiLher f old (in wlucll case he lo s~s his an te and previoLls beL Lo his opponen L) or see 4 by betLing an additional a- b units (in wllich case the player wiLh Lhe higher hand win s Lhe an Le and previous bet or be Ls of h is opponen L) ; in eiLher case play ends. N aL urally we assume a> b> l.

E ach player has four COllrses of action for any given hand . . These are (1 ) Lo drop , (2) Lo beL b- l uniLs wiLh the in ten Lion of foldi.ng if Lhe opponen t bets a- I uniLs, (3) Lo beL b- l uni Ls wiLh the inLen t ion of seeing if the opponenL beLs a- I uniLs, and (4) Lo b eL a- I units. Fig ures 1 and 2 show the algebraic payoff from player S Lo player R for any choices of courses of acLion by Lhe players.

A strategy (for eiLher player) i a quadruple 1'(z) = {r; (z)} (i = 1,2,3,4) of functions deun ed for O:::; z :::; l. Since 1' i (z) represenLs the probabiliLy 5

tha t t he player will choose t he i Lh COllrse of acLion when his hand is z, Lhe resLriclion s

are imposed . The "smoothnes " eoncliLions on Lhese functions are (i) tha t they be inLegrable, (ii) Lila t if r i (z/» ° then , for any inLervllJ J of \\,hich z' is an in terior 6 point,

J r i(t )dt> O,

and finally 7 (iii) thaL

So' [&1'2(t) - a1'4(t )]dt= 0 Oil [(&- 1)/&, (a- 1)/a]

3 '1'he possibility of eqnalhandq has zero probabili ty aLld so can be disregard ed. ' T his possibility distiuguishes our model from the case 111= 3 of the versiou

studied in part II of [6J. , More precisely, T,(Z) is a probability density fu nction. 6 'r he words 'j interior" and <, open') are to be understood relative to tbe interval

[O,IJ. T This very weak bu t ad hoc condition is precisely what is needed to prove

( b) anel (e) of lemma 8. .

Page 2: A symmetric continuous poker modeloptimal strategy obtained 9 by "passing to the limit" in the optimal strategy for b> 1. The von Neumann poker game of sections 19.4 to 19.10 of [3]

implies

on [(b- 1)jb, (a - 1)ja]

It is convenient to define , for any r(z) ,

R ;= .f r ;(t )dt.

For any r( z) , Gi(z) will denote the expected payoff to player R, using r(z), against a fixed hand z and a fixed course of action i of his opponent. Thus the expected payoff ftmction of the game is given by

K (r , r*) = .f 'L,r i*(z) Gi(z)dz (2.1)

The explicit formulas 8 for the Gi(z ) are readily found from figures 1 and 2:

~~s R ~

1 2 3 4 ~ S

R~~ 1 2 3 4

1 ° - 1 - ] - 1 1 ° - 1 - 1 - 1

2 1 b b - b 2 1 - b - b - b

3 1 b b a 3 1 - b - b - a

4 1 b a a 4 1 b - a -a

FIGURE 1 FIGU RE 2

R has the higher hand. S has the higher hand.

G1 (Z) = l " [1'z(t) + 1'3 (t) + 1'4( t) ]elt

+ i 1h(t) + 1'3(t) + 1'4(t)]dt, (2.2)

G2 (z)= l " [- 1'I(t) - b1'2(t) - b1'3(t) + br4(t)]dt

+ i l [- l\(t) + b1'2(t )+ br3(t ) + b1'4(t )]elt, (2.3)

G3 (z) = 1Z [-1'1 (t) - br2(t) - br3(tl - ar4(t) ]elt

+ i l [-rl(t) + br2(t)+ br3(t) + a1'4(t)]dt, (2.4)

G4 ( z) = l ' [- 1'1 (t) - b1'2(t ) - ar3(t) - a1'4(t) ]dt

+ i l [-1'I(t) - b1'2(t)+w'3( t) + a1'4(t )]dt. (2.5)

8 Note that tile G,(z) ' s arc shown hy these.formnlas to be continuous fnnctions.

36

3 . Plan of Attack

It would not be very instructive simply to state the solution of the game and demonstrate its correct­ness. Instead, a slightly polished version of the process actually used to solve the game will be pl'e- !

sented. This is done in sec tions 4 and 5, in which : it is proved (subject to tbe plausible assumntion stated in sec. 4) that the only possible optimal strat­egy is the one obeying the conditions (a) to (el ) of lemma 8, which deal with the behavior of the strat­egyon three intervals [O,g ), (g,g'), and (g' , I] which might be thought of as the sets of low hands, inter­m ediate hands, and high hands. It is then relatively easy to verify that this particular strategy is indeed the unique optimal strategy; this is done in section 6, primarily by inspection of the expected payoff func­tion (2.1). In section 7 the limi ting case b = 1 is dealt with, and this game is found to have a unique optimal strategy obtained 9 by "passing to the limit " in the optimal strategy for b> 1. The von Neumann poker game of sec tions 19.4 to 19.10 of [3] is sllOwn to arise as a "reduced game" from this limiting case.

In section 4 it is shown that r3(z) = 0 on [0,1] for every optimal r(z). The points g, g' are located, and I

the unique optimal r(z) is isolated in section 5. The arguments of these sections are essentially all appli­cations of the following dominati on criterion:

The stmtegy r(z) is optimal only if the jollowing is tl'ue: For each z, i jor which Gi(z) (as a. junction oj i ) does not assume its minimum, 1'i(Z) = O.

The correctness of this criterion follows from the smoothness condition (ii) and the fact that 1'(z) is optimal only if

K (1',r) = min K(1' ,r*) . 1'*

4. Elimina hon of One Cou rse of Action

The elimination is effected as follows . LEMMA 1. For every optimal 1'(z), 1'3(Z) = 0 on

[0 ,1]. Otherwise, for some optimal 1'(z) , Bz'> 0 and

r3(z» ° for some z> 0. If z ' is the least upper bound of the set of all such z, then by (2.4) and (2.5)

so that by continuity and the domination criterion 1'3(Z) = ° in a neighborhood of z ' , contradicting the definition of z'.

The following plausible assumption (which will be verified in sec. 6) is now made:

ASSUMP'l' ION: For every optimal 1'(z) , 1'4(1»0

(anel thus R4> 1).

9 This is not quite the case for the asymmetric version ;see: 11J.

Page 3: A symmetric continuous poker modeloptimal strategy obtained 9 by "passing to the limit" in the optimal strategy for b> 1. The von Neumann poker game of sections 19.4 to 19.10 of [3]

5. Derivation of the Necessary Conditions

'inco applicaLion of t lte domination crilerion . req lIires cxam ining li tO signs of LllC pairwisc clifYcr­I ('n('cs of LIl(' Oi(Z) " , t he formulas for t hcse difr('f"cJl("cS

(based o n (2 .2) Lo (2.5) and tlte eo nditi o n /'3(2) = 0 Ion [0, 11) wi ll firs t be g iven : ' z

G2(Z)- 01 (.Z)=(b- l ) - bHI- 2bJ T2(t)dt (5. 1)

I J' z 0

G3(z)- 0) (z) = bIl2+ aR4- 1- 2b o I'2(t) dt

(S.2)

(5.6)

The firsL Jour of Llll'se con Linuous fu ndiOJl s nrl' d ead y monoton e nonulcrea ing. The followi ng table 10 defines the maximal open inle rvals in which these funcLion s are pos iLive (+), zero (0), an d n egative (-). For each in terval in lhe (+) and (-) columns, the resu lL of apply ing lhe dominalion criterion is given . Insp ecLion of (5. 1) lo (5 .4) reveals condiLion s on Lhe intcrYHls in lh e (0) column

!O Of course lemma 1 renclers su perflu ous tile "ppearanee of '3( Z) =0 in tbe table.

-

Difforence (+ )

02(Z)- 0)(z) [O,(I':!)'? 1'2(Z)= 0

03(Z)- 01(Z) [O,g3!)? 1'3(Z)= 0

-03(Z)- 02(Z) [O ,g3!)

1'3(Z) = 0

04(Z)- 01(Z) [O ,g~)? 1'4(2)= 0

L- --

wllich are g iH'1l belo\\' the intt'l·val. A ques tion mflL"k m cans thaL thc in Lcl'\" al may (so far ns is known aL the momcn t) b c vac uo us or dcgenerat e to a s in gle poinL ; t ltc co ndition IV]"illcn belo \\" s uch an intc rnll is to npply only if Lhe inLerval is 11 0 11 -

dcgcneraLc. Th e llcxL t hree lClllmn s y ie ld Lli (' valu cs o f HI, H2

nndR. foroptin11l1 1'(z) (lrmm }[ 1 sbol\"s thnt R3= 0).

L J:.; ~1'1A 2. Fol' CCfI'!! ojitimal I"(z) :

(a) 0 2(0)- G)(0)= (b- I)-bHI? O.

(b) G4 (0)- G1(O)=- I - bRz-1 aH4'5, O.

If (:\ ) \\" c r(' f:\[s(', th(, 11 t he J1lo l1 ololli('i(~" of G2 (z) - GJ z) :l lld til(' dom i 11 H t io n crite l'ioll \\' 0 uld imp l ~" ill= O, so tha t (a) wo uld be tru('. I f (b) \\' (' 1'0

fnls(' , (hell s ince 04(1) - G1( 1)< 0 011(' would haw

G4(Z')- GI(Z') = 0

Jor SOJlle z' in (0,1); by (5. \ 0) T.)(Z) = () 0 11 [0,':' ); so by (5.4)

L Eml.\ 3. Fo/' aery optimal /" (z):

(a ) G2(0)- GI(0)= (b- l )- bh'I= O.

(b) (,\(O)- GI(O)=- l - bHd aH2= O.

If 02( 1)-G1(1 » 0. Lhen the mOllotonicity of 02(Z)- 01 (Z) ,md lhe domination critcrion ,,,ould imply R2= 0, so thaL Oll th e onc 11I111 d Lhe i.nit"iHl nssumpt iOll " 'ould rE'Hd

-

(0) (- )

(g':! ,g~)? (g~, IF 1'2(2)= 0 I'I(Z) = O (G.7 )

----(g3! ,g3j.) '? (g3j., 1] I'I (Z)= \ I'I (Z)= O (5.8)

-

(g3!,g3J-) ? (g3J-, l ] 1'4(Z) = 0 1'2(Z)= 0 (.5.9 )

(g4! ,g4f)? (g4f, l] 1'4(2) = 0 I'I(Z)= O (S. lO)

37

Page 4: A symmetric continuous poker modeloptimal strategy obtained 9 by "passing to the limit" in the optimal strategy for b> 1. The von Neumann poker game of sections 19.4 to 19.10 of [3]

and 0 11 Lhe oLher hand R I= 1- R4 (by lemm a 1, using R2= O). Combining these statements yield s R4> 1/b, which together with (5.4) and R 2= 0 yields

G4(0) - G1(0)= - 1 + aR4> 0,

cOlltradicLing (b) of lemma 2. Thus G2(1) - G1 (1)::; 0, which togeth er with (a) of lemma 2 implies that

G2(z') - G1 (z' ) = ° for some z' in [0,1]. By (5. 7) 1'2(Z)= 0 on [O, z' ); so by (5.1),

G2(0)- G1(0) = G2(z ' ) - G1 (z ' ) = 0,

and (a) is proved. If (b) were false, then by (b ) of lemma 2, the monotonicity of G4(Z)- G1(z), and the domina tion cri terion, we wo uld h ave R 1 = 0, con­tradicting (a).

Ll~MMA 4. Fol' every oplimall'(z) :

(a)

(b )

(c)

R J= (b- J)/b.

R2=(a- b)lb(a I b).

R4= 2/ (a+ b).

This follows from lemma :) /lml tllC corolla)'), R1+ R2 + R4 = 1 of lemma 1.

N rxt the points g, U' will be located, and the behavior of an optimal 1'(z) on [O ,g) , (g,g' ) and (g' , 1] determined.

L BMM.\ 5 . For every optcimall'( z):

(,t ) u2). = (J~ = (J = (b - J) lb.

(b)

(c)

/', (z)= 1

1', (z) = 0

on [O,(J).

on (U, I ].

By (5 .5) and lemma 4,

G4 (Z) - G2(z)= 2b J:Z1'2 (t)dt - 2aJ:zl'4(t)dt (5. 11)

If g2).< g~, then by (5. 7) and (5. 10) one has 1'2(Z) = 1 on (g2).; g4f,), 1'1(z)= l on [0, g2f,), and 1';(z)= O on [0, g4f,), so that the last equation y ields

G4 (!/~) - Gz(!)").) > 0;

continuity and the domination criterion show that 1'4(Z)= 0 in a neighborhood of g4f, but (5.4) and (b) of lemma 3 show that th is contradicts the definition of g~. An entirely s imil,ar ar~umen t refu tes the pos­sibility q4f<g2f" and so g2f, = g~. B y (5. 7) and (5.10) we have 1' ,(z)= 1 on [0, g2f,) and1'l(z)= O on (g2f" J], so tlmL g2). = H,; an appli ca,tion of (a) of lemma 4 compl r tes the proof.

38

L E VU]\ G. For aery optimal l'(z):

(a) g3J,= p' =(a - 1)/a.

(b) 1'4(Z) = 1 on (g', 1] .

Note first that 1'4 (Z) = 1 ou (g3J" 1] si llce (i) r2(Z) = 0 on (g3J" 1] by (5 .9) , (ii) 1',(z)= O on (g, 1] by lemma 5, and (iii ) g< g3J, because G3(g)- G2(g» 0 by (5 .3) and th e fact (sec (5. 10)) UlI1 t 1'4(Z) = 0 on [0 , g). It follows that

The defillition of g3J, together with (5.3 ) shows that

(5. 12)

/l,lle! combining' Lhe last cq lIa,tions yields the asscr( ed value of g3J,.

Fol' every optimal I'(z),

on [g, g'l.

Sillce 1'2 (Z)= 0 on (y' , 1], (5. 11 ) and (5. 12 ) imp ly

where t llC last eq u a1iL~' follows from lemma J. L et z* b e a point at which G4(Z)- G2(z) attains its m axi­mum M on [g,g'J ; the last equation shows that .LH'2, O. If Al> O, t ben b)' cont inui ty and the domina tion cri­terion one has 1',(z)= O (and thus 1'2(Z) = 1, since r,(z)= O on (g, l ]) on some in terval (z*,z'); by (5 .5) G4 (Z)- G2 (z) is strictl~' incrcasing on (z*,z'), which contradicts Lite definition of Z* unless z*= g' , in which case M = O by t he last equation. Thus M = O; an entirely similar argumenl shows that tbe mini­mum value of G4 (Z) - G2(z) on [g,g'J is m= O, and so the lemma is proved.

The completion and summal'~- of the preceding' material is as follows.

L EMMA 8. L et g= (b- 1)lb and g'=(a- 1) /a. Ij the assumption at the end oj section 4 is correct, then the only possible optimal strategy is the strategy d{'fined I

by (a)

(b)

on [O,g ) .

7'2(z)= a/(a+ b) on (g,g' ).

(c) r4(z)= b/(a+ b) on (g,g' ).

(d) on (g', l ].

From lemma 1, lemma 5, and lemma 6 O llP ha s (>I ) , (cl ), and also

on (g,g'). (5.1:))

Page 5: A symmetric continuous poker modeloptimal strategy obtained 9 by "passing to the limit" in the optimal strategy for b> 1. The von Neumann poker game of sections 19.4 to 19.10 of [3]

FI'OIll lrmlllH i , (5.5), and Ir l1unn 4 0 11(' hn:-;

011 (g.y '),

",bieh by smooLhn ess condiLion (1'ii) (ser sret iOIl 2) implies '

on (g,g') ;

Lhis fi nd (5.] 3) iJllPl~' (b ) fmd (c) .

6 . Proof Tha t the Conditions Are Necessary a n d Sufficient for O p timality

In lhis secLion the noLation 1' (z) is ]"cselTed for Lhe pal'Lieular sLraLeg~' described in lemmn R, \\"hile nrbi ­lrary s tritlegies nrc dell oted /' *(z).

L EMM.\ 9. [{(r, ,.~) = m i l1 [((r,I'*) if alld on/y 1:1 1'~(z) ObfYS r'

(n)

(b)

(' )

r t(.:) - O

1'1'(=) = /,H=) 0

1"1(2) = I"i(:) - ()

011 [O.fl ) .

011 ((I ,fI ').

on (f/ , I].

\\'hich (sillcc the gamc is symmeLrie) implics LllfiL 1'(.:) is optimH I. T hus any op timftl r t(z) mll s t s ilti sf~ '

J((r,rt) ~ mill 1((1',1'*) r'

and thus , by lemma 1 llllcl (c) of lemnw 9, musL saLisfy r! (J )= 1. H ence Lhe ltSS umpti.on at the rnd of section 4 is corr ecL, and so, b~' Ir mm fi 8, r (:::) IS the mJ ly opt imn l s tl'H Cog?

7, Discussion of a Limiting Case

The l imiLing cnse b= ] \\"illIIO\\' be bridly discli Rbed . (Recn II thaL b> t prcviollsl)' ,) I L is ras il~ " vC l'i fi rd LhaL th e l'rsults of Lhe previous sec Lions r rmnin valid (ma ll)" of Lhe proofs C1Ui br considerably simpli­fi ed), so L11l1L thr game \\'ith b= 1 ]Ias a unique opti­mal s tmLeg)T which is obLain eci from LhaL 1'0 1' b> l (i.c. , from (a) to (d) of jemmil 8) by seLlillg b= 1. Sill ce g- O for b- I , the IIniq ll(' optimil l s lrntrgy isgi\' (,l1 b~"

(:1 )

(b)

(r)

/'4(=) - I / (a -j- 1)

/'4(=) = 1

on [O, (a- J) jn ).

Oil [O, (n- I )jn )

Oil ((n-l)jn ,ll . The K (r ,r *) or (2.1 ) is firs t \\'I'i t tC'l1 ;IS (hl' S tllll or

Lhrsr int C'grals : L'pon Cil ll llgi llg

J og [&\ (z) -j r;(=) (0 2 (=) - 0 1 (02)) + r!(z) (0 3 (,:::)

- (,\ (z)) j rt(z) (0 4(z) - Olz))l<1 ,:.

JY' [0 2(Z) 1'1(z) (01 (z) - 0 2(Z)) + l'~(z) (O~(Z') - 0 2(Z)) + 1'tcZ) (0 4 (2) -02(~')) ]rl ,z .

.L [0 4 (,:) -+ I'f(Z)(0 1(Z) - 04 (Z)) rt (Z)(0 2 (z)

- 0 4(Z)) +rrc=) (03(Z) - 0 4(=)) ]d.:' .

] t is casi ly \rerified LhaL I' (z) ha l itr pro]J<'rtirs described in lemma 4 , and since 7'3(Z)= 0 on 10,1l, r(z) obeys (5.1)-(5.6). Using thes(' facts , one find s that on (O,g) the coefficiel1 ts of 1'nz) and l'! (z) in t.he fiTSL inLegrftl \Tanish , while thaL of r! (z) is positive . Hence r~ ~z) minimizes the :first. i.nLegral if an d onl\­if it satisfies (a). Similarly, r~(z) minimi zes Lhe second integral if and only if it saLis fi es (b), and minimizes the third inLegral if and only if iL sftt. isfi eR (c) .

THEOREl\1. 7' (z) 18 th e vniq1l e opti mal 8Irn ff {m .

Since 7' (z) obeys (a) Lo (c) of ]r ll1mn, 9,

K (r, r)= min 1«1',1' *) r'

39

to

respeeli vel:\', olle find s LhaL Lil is sLrateg)- is precisrly Lhe unique opLimal s traLeg~' of Lhe conLinuous von ~ elllnan il poke'r game discussed in seeLions 19.4 to 19. 10 of [3]. To explnin (itis , noLc Lhat (2 .2) nnd (2.:n , \\' ith b= I , y iC'ld

for all stnllegies I' (z), which on Lhe one hand shows LhaL Lhefirst cOlll 'se of acLion is dominated, and on Lilr oLher hand )·ields a quick proofll LhaL I'I (Z)= O Oll [0 ,1] for all opLimal r (z). Thus, wiLhout loss of opLillMl sLraLegies, the game can be " reducrd " by "s Lriking ouL" Lhe first co mse of acLion roJ' both players. Tbis red ucecl game is easily secn Lo be identical with Lhe vo n Ncum~1 n n game, excepL for some change of bnguage and ,L mul Liplicat i ve far Lor b in Lhe expecLed payo ff fUlH'Li oJl . Thus Lhe von N eUlllann gmnc i1ppcnrs ~lS ,1 r educed form of Lhe limiting ca se b= l of thr gamr solvrd in Lhis paper.

I) If TI(2'»0 for some .', tbr n R ,> O; lite fortnltia and t ile dom ill aLion r ril rr ion yipld r!(z)=O fo r all t,:l rontradictioll .

Page 6: A symmetric continuous poker modeloptimal strategy obtained 9 by "passing to the limit" in the optimal strategy for b> 1. The von Neumann poker game of sections 19.4 to 19.10 of [3]

8 . References

ll) L. E . DubillS, A diHcrl'lc e\'fl ~ioll ?;I1 I11C , Anll . :\[ath. Study 39, 231-255 ( 1957).

12] 1'1. Shubik, Edgeworth ml1rkr t gall1 cs, A11n . 2\fath. Study 40, 267-278 (1959).

l :~l J . \ ' 00 N eumann and O. Morgenstern , Th eOl·.\· of games and eCOIlOmic bc\tlwior (3d cd ., Prince ton Univ. Pre~s, Princeton , N.J. , J 95:3) .

l~] G. L. Thompson , B ridge a nd s ignallin g, Ann . Math . Study 28, 279-289 (195;3).

[.'"1] .Y . G. K emeny, L . Snell , and G. L. Thompson , Game­t heoretic solution of baccarat , Am . ;VIath . :--Ion th ly 64, 465-469 (1957).

40

[G] D . B . Gillie's, J . P. l\Iay brJTY, a nd .T. \'011 Neumann , Two variants of p oker, Ann. Math . Study 28, 1:3-50 (J953).

[7) H . ICuhn , A simplified t \\'o-person poker, Ann. l\Iath. Study 24, 97-103 (1950).

[8] J . Nash and L. S. Shapley, A s imple t hr('e-person poker game, Ann . Math. Study 24, 105-ll6 (1950).

[9] R . Bellm an , On games involving bluffill g, Rend . e irc. Mat . Pal ermo [2)1, 139-156 (1952).

[10] S. l-': a rlin and R. R estrepo, Multistage poker models, Ann . Math. Study 39, 337-363 (1957).

[ ll) A. J. Goldman an d .T. J . Stonr, A contim low; poker glUM, to apprar in Duke Mat h. J . (1960).

'YAsr-II NG 'I'ON, D.C. (Paper 64Bl- 18)