v0.4.1 patch release: checkpoint fixes for MoE EP & LoRA, OpenAI/MCP tool calling schema, and SGLang memory optimizations #2225
eric-haibin-lin
announced in
Announcements
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
v0.4.1 patch release: checkpoint fixes for MoE EP & LoRA, OpenAI/MCP tool calling schema, and SGLang memory optimizations
Key changes
PPO fixes and enhancements
vf_lossfactor #2016Checkpoints related
*.checkpoint.async_save=True.checkpoint_configas the keyword to replacecheckpoint_contents[megatron] feat: Support of dist checkpoint #2125python -m verl.model_merger merge .... DocumentationExperimental function calling & MCP interfaces
New models and recipes
Clip-CovandKL-CovmethodsSGLang optimizations
Other performance profiling & optimizations
[actor|ref].fsdp_config.forward_prefetch=True[FSDP] feat: Add FSDP forward pefetch and recompute chunking entropy #1927[actor|ref].entropy_checkpointing=Trueand[actor|ref].entropy_from_logits_with_chunking=True[FSDP] feat: Add FSDP forward pefetch and recompute chunking entropy #1927Other breaking changes and deprecations
See #1902
What's Changed
vf_lossfactor by @tongyx361 in [algo] fix:vf_lossfactor #2016Nonecheck in DataProto print_size() by @GHGmc2 in [Bug] fixNonecheck in DataProto print_size() #2067val_onlyby @hscspring in https://github.com/volcengine/verl/pull/20842083Falseby @GHGmc2 in [rollout] refactor: Add option for rollout_log_probs, and default asFalse#2072reward_extra_infooptional inreward_resultby @HollowMan6 in [trainer] fix: makereward_extra_infooptional inreward_result#2109New Contributors
Thank you all for joining this project!
@vickytsang @davidjsonn @rocke2020 @vwxyzjn @Yangruipis @SeungyounShin @donpromax @leopardracer @ZhiyuLi-Nvidia @LiyuanLucasLiu @Jianbing-D @wyz649296016 @htc070011 @syo093c @FightingZhen @zhihe-wang @KaiChen1998 @wizeng23 @RuixiangMa @davidmlw @feifeibear @hscspring @MaoChouHJM @AlecHenx @wang-zerui @kinza99 @nanjiangwill @zyfzjsc988 @Cccei000 @RanchiZhao @xuuHuang @ultmaster @xxnpark @jvmncs @xingyunjohn1
Full Changelog: v0.4.0...v0.4.1
This discussion was created from the release v0.4.1 patch release: checkpoint fixes for MoE EP & LoRA, OpenAI/MCP tool calling schema, and SGLang memory optimizations.
Beta Was this translation helpful? Give feedback.
All reactions