first commit

2024-10-03 20:20:01 -07:00
commit b63b81b121
8 changed files with 280 additions and 0 deletions
--- a/models/.gitignore
+++ b/models/.gitignore
@@ -0,0 +1,3 @@
+*
+!.gitignore
+!README.md
--- a/models/README.md
+++ b/models/README.md
@@ -0,0 +1,7 @@
+TODO improve these docs
+
+1. Download a llama-server suitable for your architecture
+1. Fetch some small models for testing / swapping between
+    - `huggingface-cli download bartowski/Qwen2.5-1.5B-Instruct-GGUF --include "Qwen2.5-1.5B-Instruct-Q4_K_M.gguf" --local-dir ./`
+    - `huggingface-cli download bartowski/Llama-3.2-1B-Instruct-GGUF --include "Llama-3.2-1B-Instruct-Q4_K_M.gguf" --local-dir ./`
+1. Create a new config.yaml file (see `config.example.yaml`) pointing to the models