Skip to content

Actions: verl-project/verl

Actions

e2e_one_step_off_policy_ascend

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
528 workflow runs
528 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

[WIP][algo] Migrate and implement the GDPO algorithm into the existing framework.
e2e_one_step_off_policy_ascend #562: Pull request #5422 synchronize by Rhetee
Action required Rhetee:GDPO
[WIP][algo] Migrate and implement the GDPO algorithm into the existing framework.
e2e_one_step_off_policy_ascend #561: Pull request #5422 synchronize by Rhetee
Action required Rhetee:GDPO
[WIP][algo] Migrate and implement the GDPO algorithm into the existing framework.
e2e_one_step_off_policy_ascend #560: Pull request #5422 synchronize by Rhetee
Action required Rhetee:GDPO
[ci] feat: add profiling tests to vLLM ci
e2e_one_step_off_policy_ascend #555: Pull request #5215 synchronize by Gary-cjy
Action required Gary-cjy:main
[WIP][algo] Migrate and implement the GDPO algorithm into the existing framework.
e2e_one_step_off_policy_ascend #546: Pull request #5422 synchronize by Rhetee
Action required Rhetee:GDPO
[trainer] feat: add support for the GDPO algorithm
e2e_one_step_off_policy_ascend #544: Pull request #5409 synchronize by yue-zeng-yue
Action required yue-zeng-yue:feat-gdpo
[rollout, data] fix: honor train_max_samples/val_max_samples in fully…
e2e_one_step_off_policy_ascend #543: Commit c3e3970 pushed by ArronHZG
1h 28m 48s main