Skip to content

[fsdp, model]{fix} Allow flash attention 2 to be used for NemotronH model on FSDP #4566

[fsdp, model]{fix} Allow flash attention 2 to be used for NemotronH model on FSDP

[fsdp, model]{fix} Allow flash attention 2 to be used for NemotronH model on FSDP #4566