Adding new tokens no longer makes a whole copy of the embeddings weight which can be massive on certain models.
6.6 KiB
6.6 KiB
Adding new tokens no longer makes a whole copy of the embeddings weight which can be massive on certain models.