[algo] fix: seq mean and default scale factor loss_mask.shape[-1] as in seq-mean-token-sum-norm
#5944
Loading
loss_mask.shape[-1] as in seq-mean-token-sum-norm
#5944