[trainer] feat: add support for the GDPO algorithm #5928
This workflow is awaiting approval from a maintainer in #5409
Triggered via pull request
February 26, 2026 07:38
Status
Action required
Total duration
–
Artifacts
–
This workflow is awaiting approval from a maintainer in #5409
e2e_one_step_off_policy.yml
on: pull_request
setup
e2e_one_step_off_policy_fsdp2
e2e_one_step_off_policy_megatron
cleanup