update readme

This commit is contained in:
Benson Wong
2024-12-09 19:14:49 -08:00
parent 5fbd53c616
commit e2443251ad

View File

@@ -16,6 +16,7 @@ Features:
- ✅ Run multiple models at once with `profiles`
- ✅ Remote log monitoring at `/log`
- ✅ Automatic unloading of models from GPUs after timeout
- ✅ Use any local server that provides an OpenAI compatible API (llama.cpp, vllm, tabblyAPI, etc)
## Releases