llama-swap/proxy/config/model_config_test.go at main

Files

Benson Wong a89b803d4a Stream loading state when swapping models (#371 )

Swapping models can take a long time and leave a lot of silence while the model is loading. Rather than silently load the model in the background, this PR allows llama-swap to send status updates in the reasoning_content of a streaming chat response.

Fixes: #366

2025-10-29 00:09:39 -07:00

2.1 KiB

Raw Permalink Blame History

View Raw

2.1 KiB Raw Permalink Blame History

2.1 KiB

Raw Permalink Blame History