Answer by Neil Slater for Q learning (DQN) strategy for a multiplayer...
This works, and is used as a standard approach for two player zero-sum games in reinforcement learning. As you stated, it is a combination of reinforcement learning with Minimax optimisation.A very...
View ArticleQ learning (DQN) strategy for a multiplayer zero-sum game
I have been looking for ways to train a Q-learning agent for a multiplayer zero-sum game (a variation of Tic-Tac-Toe in my case). I came up with a learning strategy I haven't found anywhere else, and I...
View Article
More Pages to Explore .....