Skip to content

Actions: verl-project/verl

Actions

e2e_one_step_off_policy_ascend

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
540 workflow runs
540 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

[WIP][algo] Migrate and implement the GDPO algorithm into the existing framework.
e2e_one_step_off_policy_ascend #546: Pull request #5422 synchronize by Rhetee
Action required Rhetee:GDPO
[trainer] feat: add support for the GDPO algorithm
e2e_one_step_off_policy_ascend #544: Pull request #5409 synchronize by yue-zeng-yue
Action required yue-zeng-yue:feat-gdpo
[rollout, data] fix: honor train_max_samples/val_max_samples in fully…
e2e_one_step_off_policy_ascend #543: Commit c3e3970 pushed by ArronHZG
1h 28m 48s main
[tool] fix: handle empty image inputs in ToolAgentLoop (#5420)
e2e_one_step_off_policy_ascend #542: Commit 9dd447e pushed by wuxibin89
1h 0m 46s main
[trainer] feat: add support for the GDPO algorithm
e2e_one_step_off_policy_ascend #541: Pull request #5409 synchronize by yue-zeng-yue
Action required yue-zeng-yue:feat-gdpo
[WIP][algo] Migrate and implement the GDPO algorithm into the existing framework.
e2e_one_step_off_policy_ascend #531: Pull request #5422 synchronize by Rhetee
Action required Rhetee:GDPO
[tool] feature: scheduling analysis based on profiling data for torch profiler
e2e_one_step_off_policy_ascend #528: Pull request #5367 synchronize by Rhetee
Action required Rhetee:main