Adding new tokens no longer makes a whole copy of the embeddings weight which can be massive on certain models.
25 KiB
25 KiB
Adding new tokens no longer makes a whole copy of the embeddings weight which can be massive on certain models.