[question] In train.py, why is gamma in VecNormalize not updated per trial?

Hi, from [this issue](https://github.com/openai/baselines/issues/538#issuecomment-417522993), it says `VecNormalize`'s `gamma` should match the `gamma` of RL algorithm (e.g., `gamma`=0.99 should be consistent in both `PPO2` and `VecNormalize`) to ensure consistent sliding window size. However, it seems the normalization arguments used in `create_env` are always the default one read from `.yml` file (i.e., `gamma`=0.99 as default):
https://github.com/araffin/rl-baselines-zoo/blob/fd9d38862047d7fd4f67be8eb3f6736e093eac9f/train.py#L269

although `gamma` has different candidates in `hyperparams_opt.py`:
https://github.com/araffin/rl-baselines-zoo/blob/fd9d38862047d7fd4f67be8eb3f6736e093eac9f/utils/hyperparams_opt.py#L188

The same applies for rl-baselines3-zoo. Is this a bug? Should `create_env` consider `gamma` change in initiating `VecNormalize` per trial? Please give me some hint if I missed anything, thank you!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[question] In train.py, why is gamma in VecNormalize not updated per trial? #91

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

[question] In train.py, why is gamma in VecNormalize not updated per trial? #91

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions