[algo] fix: seq mean and default scale factor loss_mask.shape[-1] a…
#5948
e2e_one_step_off_policy.yml
on: push
setup
11s
e2e_one_step_off_policy_fsdp2
3m 16s
e2e_one_step_off_policy_megatron
3m 13s
cleanup
5s