Skip to content

v0.2.0

Latest
Compare
Choose a tag to compare
@puyuan1996 puyuan1996 released this 09 Apr 09:06
· 2 commits to main since this release

Environment

  1. Add Metadrive environment and its configurations (#192)
  2. Add Sampled MuZero/UniZero and DMC environment with related configurations (#260)
  3. Polish Chess environment and its render method; add unit tests and configurations (#272)
  4. Add Jericho environment and its related configurations (#307)

Algorithm

  1. Add Harmony Dream loss balance in MuZero (#242)
  2. Adopt AlphaZero for non-zero-sum games (#245)
  3. Add AlphaZero CTree unittest (#306)
  4. Add recent MCTS-related papers (#324)
  5. Introduce rope to use the true timestep index as pos_index (#266)
  6. Add Jericho DDP configuration (#337)

Enhancement

  1. Add LightZero Sphinx documentation (#237)
  2. Add Wandb support (#294)
  3. Add Atari100k metric utilities (#295)
  4. Add eval_benchmark tests (#296)
  5. Include save_replay and collect_episode_data options in Jericho (#333)
  6. Add an MCTS TicTacToe demo in a single file (#315)

Polish

  1. Polish efficiency and performance on Atari and DMC (#292)
  2. Update requirements (#298)
  3. Optimize reward/value/policy_head_hidden_channels (#314)
  4. Update configuration and log instructions in tutorials (#330)

Fix

  1. Fix DownSample issues for different observation shapes (#254)
  2. Fix the wrong chance values in Stochastic MuZero (#275)
  3. Use display_frames_as_gif in CartPole (#288)
  4. Fix the chance encoder in stochastic_muzero_model_mlp.py (#284)
  5. Correct typo in model/utils.py (#290)
  6. Fix SMZ compile_args and num_simulations bug in world_model (#297)
  7. Fix reward type bug in 2048 and OS import issue in CartPole (#304)
  8. Switch to macos-13 in action (#319)
  9. Fix SMZ & SEZ config for pixel-based DMC (#322)
  10. Fix update_per_collect in DDP setting (#321)
  11. Fix bug with obs_shape tuple in initialize_zeros_batch (#327)
  12. Fix prepare_obs_stack_for_unizero issue (#328)
  13. Fix random_policy when len(ready_env_id) < collector_env_num (#335)
  14. Fix timestep compatibility issues (#339)

CI & Test

  1. Add self-hosted Linux (Ubuntu) CI runner (#259)
  2. Add self-hosted Linux runner for CI tests (#323)

Full Changelog: v0.1.0...v0.2.0

Contributors: @ruiheng123 @TuTuHuss @HarryXuancy @ShivamKumar2002 @Roland0511 @cmarlin @xiongjyu @PaParaZz1 @puyuan1996