Adding new tokens no longer makes a whole copy of the embeddings weight which can be massive on certain models.
11 KiB
11 KiB
Adding new tokens no longer makes a whole copy of the embeddings weight which can be massive on certain models.