first commit
This commit is contained in:
3
models/.gitignore
vendored
Normal file
3
models/.gitignore
vendored
Normal file
@@ -0,0 +1,3 @@
|
||||
*
|
||||
!.gitignore
|
||||
!README.md
|
||||
7
models/README.md
Normal file
7
models/README.md
Normal file
@@ -0,0 +1,7 @@
|
||||
TODO improve these docs
|
||||
|
||||
1. Download a llama-server suitable for your architecture
|
||||
1. Fetch some small models for testing / swapping between
|
||||
- `huggingface-cli download bartowski/Qwen2.5-1.5B-Instruct-GGUF --include "Qwen2.5-1.5B-Instruct-Q4_K_M.gguf" --local-dir ./`
|
||||
- `huggingface-cli download bartowski/Llama-3.2-1B-Instruct-GGUF --include "Llama-3.2-1B-Instruct-Q4_K_M.gguf" --local-dir ./`
|
||||
1. Create a new config.yaml file (see `config.example.yaml`) pointing to the models
|
||||
Reference in New Issue
Block a user