3 skills found
opendilab / LightZero[NeurIPS 2023 Spotlight] LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios (awesome MCTS)
DHDev0 / Stochastic MuzeroPytorch Implementation of Stochastic MuZero for gym environment. This algorithm is capable of supporting a wide range of action and observation spaces, including both discrete and continuous variations.
chiamp / SigmazeroGeneralizing DeepMind's MuZero algorithm on stochastic environments