llama-swap/models at main - llama-swap - Gitea: Git with a cup of tea

andreas/llama-swap

Files

History

Benson Wong b63b81b121 first commit

2024-10-03 20:20:01 -07:00

..

.gitignore

first commit

2024-10-03 20:20:01 -07:00

README.md

first commit

2024-10-03 20:20:01 -07:00

README.md

TODO improve these docs

Download a llama-server suitable for your architecture
Fetch some small models for testing / swapping between
- huggingface-cli download bartowski/Qwen2.5-1.5B-Instruct-GGUF --include "Qwen2.5-1.5B-Instruct-Q4_K_M.gguf" --local-dir ./
- huggingface-cli download bartowski/Llama-3.2-1B-Instruct-GGUF --include "Llama-3.2-1B-Instruct-Q4_K_M.gguf" --local-dir ./
Create a new config.yaml file (see config.example.yaml) pointing to the models