Consider the following one-player game: Initially, there is a pot of $\$0$. On each turn, the player may opt to end the game and collect the contents of the pot, or roll a single die. In the latter case, if the roll is $1$, the game is over and the player collects nothing. Otherwise, the amount of the die roll (from $\$2–\$6$) is added to the pot, and the player gets another move. The player continues rolling until they either get a $1$, and collect nothing, or opt to collect the pot. We would like to find the optimal strategy for this game, and the expected payoff under the optimal strategy.
A simple (but not quite correct) analysis goes like this: Any rational strategy must take the form “Stop if and only if the pot has value at least $\$S$.” If the pot contains $\$P$, then the expected gain from the next die roll is $\frac16(-P+2+3+4+5+6 )$ dollars. This is positive whenever $P\ge 20$, so the player should roll until the pot reaches $\$20$ or more, and then stop and collect it.
This is in fact an optimal strategy, but the analysis is not exactly correct, because it assumes, incorrectly, that the expected gain from a successful die roll is exactly $\$4$. But actually the expectation is a little bit more than this, because a successful die roll not only adds $\$4$ to the pot, it enables further die rolls that may add more. In fact strategies $S=20$ and $S=21$ yield identical expected payoffs of $\$8.141794893727$.
A more careful analysis follows: Let $X_S(P)$ be the expected amount won by following the stop-at-$S$ strategy when $\$P$ is already in the pot. Then $X_S$ satisfies the following unusual recurrence:
$$X_S(P) = \begin{cases} P & \text{if $P\ge S$} \\ \frac16\sum_{i=2}^6 X_S(P+i) & \text{otherwise} \end{cases} $$
We are interested in the behavior of $X_S(0)$, which we can calculate exactly using this recurrence. This is tedious, so I had the computer do it. $X_S(0)$ is maximized, as already mentioned, for $S=20$ and $S=21$.
My questions are:
- Is there an easier method to calculate $X_S(0)$ without the help of the computer?
- Is there an argument that shows that $X_{20}(0) = X_{21}(0)$?