Adding new tokens no longer makes a whole copy of the embeddings weight which can be massive on certain models.
12 KiB
12 KiB
Adding new tokens no longer makes a whole copy of the embeddings weight which can be massive on certain models.