[trainer] feat: Self-Normalized Importance Sampling (#3980) #43
e2e_transferqueue.yml
on: push
setup
8s
e2e_transferqueue_fsdp
4m 45s
e2e_transferqueue_megatron
6m 32s
cleanup
5s