[fsdp,vllm,trainer,algo] feat: On-Policy Distillation #4564
Annotations
1 error
|
Running GSM8K E2E training tests on 8 L20 GPUs with rmpad using function rm with validation and saving (DDP_SIZE=2, FSDP_SIZE=4)
Process completed with exit code 1.
|
Loading