4

Monte Carlo Tree Search with UCT is praised for it's asymmetric tree growth, growing promising subtrees more than non-promising ones.

But in a 2-player adversarial game, when a win at one node is a loss at the node below it, wouldn't the tree growth be extremely favourable for the current player at each node, and result in a not so asymmetric tree growth?

David Richerby
  • 82,470
  • 26
  • 145
  • 239
NightRa
  • 506
  • 3
  • 10

2 Answers2

1

The most explored path will be the Nash Equilibrium, because at each step we choose the best move for each player.
The asymmetry happens at each level towards the best move for the current player, thus overall the tree will not grow asymmetrically.

NightRa
  • 506
  • 3
  • 10
0

With each new rollout, considering games with many traps like chess, the random playouts converge super slowly if at all. Usually based on very high rollouts and on nodes with greater exploration.. Good game play for me happens around +100000 rollouts but dependant on the game. For one rollout you are correct, but with each rollout things get incrementally better. Backpropagation over many rollouts solve this in the UCB.