This change allows supports for diffusion models where all the linears are scaled fp8 while the other weights are the original precision.
4.2 KiB
4.2 KiB
This change allows supports for diffusion models where all the linears are scaled fp8 while the other weights are the original precision.