Skip to content

[training_utils] refactor: Extend response slicing to handle multi-dimensional model outputs #5747

[training_utils] refactor: Extend response slicing to handle multi-dimensional model outputs

[training_utils] refactor: Extend response slicing to handle multi-dimensional model outputs #5747

e2e_ppo_trainer_megatron-deepseek

succeeded Jan 18, 2026 in 19m 7s