[Feature] Updated TD-MPC2 evaluation and fixed some bugs #538

t-sekai · 2024-08-30T08:23:24Z

Separated eval_env from train env (Align evaluation setups for different online RL algorithms #486)
- eval_env is now used for evaluation. The only difference between env and eval_env is num_envs.
- num_eval_envs in configs now defines the num_envs of eval_env.
Fixed max_episode_steps (tdmpc2 baseline AttributeError: 'ManiSkillVectorEnv' object has no attribute 'max_episode_steps' #528)
- Newer gymnasium and ManiskillVectorEnv doesn't contrain default max_episode_steps as an attribute. Therefore, default max_episode_steps is now determined by gym_utils.find_max_episode_steps_value(env).
Fixed wandb logging to log exactly eval_episodes
- Realized that wandb logging in evaluation wasn't logging exactly eval_episodes (it was a multiple of num_eval_envs). Futhermore, only num_eval_envs videos were logged at each evaluation.
- It is now fixed to log exactly eval_episodes at each evaluation.

…ps (Issue #528), fixed wandb logging (now logging exactly eval_episodes)

Separated eval_env from train env (Issue #486), fixed max_episode_ste…

cba4497

…ps (Issue #528), fixed wandb logging (now logging exactly eval_episodes)

t-sekai changed the title ~~Updated TD-MPC2 evaluation and fixed some bugs~~ [Feature] Updated TD-MPC2 evaluation and fixed some bugs Aug 30, 2024

StoneT2000 approved these changes Sep 3, 2024

View reviewed changes

StoneT2000 merged commit abb2281 into haosulab:main Sep 3, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature] Updated TD-MPC2 evaluation and fixed some bugs #538

[Feature] Updated TD-MPC2 evaluation and fixed some bugs #538

t-sekai commented Aug 30, 2024 •

edited

Loading

[Feature] Updated TD-MPC2 evaluation and fixed some bugs #538

[Feature] Updated TD-MPC2 evaluation and fixed some bugs #538

Conversation

t-sekai commented Aug 30, 2024 • edited Loading

t-sekai commented Aug 30, 2024 •

edited

Loading