llama-swap/proxy/config/config_windows_test.go at a89b803d4acf29757ba4488e7babe6ff8aa62ca0

Files

Benson Wong a89b803d4a Stream loading state when swapping models (#371 )

Swapping models can take a long time and leave a lot of silence while the model is loading. Rather than silently load the model in the background, this PR allows llama-swap to send status updates in the reasoning_content of a streaming chat response.

Fixes: #366

2025-10-29 00:09:39 -07:00

5.9 KiB

Raw Blame History

View Raw

5.9 KiB Raw Blame History

5.9 KiB

Raw Blame History