Multi-agent DQN for Nim-21 Two agents that learn to play Nim-21 using PyTorch and DQN. Both agents learn the game-theory optimal strategy.