Combinatorial Optimization with Policy Adaptation using Latent Space Search Paper • 2311.13569 • Published Nov 13, 2023
Jumanji: a Diverse Suite of Scalable Reinforcement Learning Environments in JAX Paper • 2306.09884 • Published Jun 16, 2023
Putting Data at the Centre of Offline Multi-Agent Reinforcement Learning Paper • 2409.12001 • Published Sep 18, 2024 • 4