* fix a bug in the attn_masked redux code when using weight=1.0 * oh shit wait there was another bug
89 KiB
89 KiB
* fix a bug in the attn_masked redux code when using weight=1.0 * oh shit wait there was another bug