[fsdp,vllm,trainer,algo] feat: On-Policy Distillation #5969
e2e_one_step_off_policy.yml
on: pull_request
setup
9s
e2e_one_step_off_policy_fsdp2
3m 9s
e2e_one_step_off_policy_megatron
3m 22s
cleanup
5s