From 7f45493a3715c6a1463f82220ae7781cf5579c03 Mon Sep 17 00:00:00 2001 From: Benson Wong Date: Tue, 17 Dec 2024 14:45:41 -0800 Subject: [PATCH] Update README.md --- README.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/README.md b/README.md index dfbfbba..2fc51e8 100644 --- a/README.md +++ b/README.md @@ -17,7 +17,7 @@ Features: - ✅ Remote log monitoring at `/log` - ✅ Automatic unloading of models from GPUs after timeout - ✅ Use any local OpenAI compatible server (llama.cpp, vllm, tabblyAPI, etc) -- ✅ Direct access to proxied upstream HTTP server via `/upstream/:model_id` +- ✅ Direct access to upstream HTTP server via `/upstream/:model_id` ([demo](https://github.com/mostlygeek/llama-swap/pull/31)) ## Releases