Actions: verl-project/verl
Actions
2,395 workflow runs
2,395 workflow runs
loss_mask.shape[-1] as in seq-mean-token-sum-norm
e2e_transferqueue
#3132:
Pull request #5417
opened
by
tongyx361
actor_rollout_ref.model.enable_gradient_checkpointing? Is this a deliberate design choice?
e2e_transferqueue
#3128:
Pull request #4263
synchronize
by
khazic
actor_rollout_ref.model.enable_gradient_checkpointing? Is this a deliberate design choice?
e2e_transferqueue
#3123:
Pull request #4263
synchronize
by
khazic
actor_rollout_ref.model.enable_gradient_checkpointing? Is this a deliberate design choice?
e2e_transferqueue
#3113:
Pull request #4263
synchronize
by
khazic
actor_rollout_ref.model.enable_gradient_checkpointing? Is this a deliberate design choice?
e2e_transferqueue
#3112:
Pull request #4263
synchronize
by
khazic