llama-swap/proxy/process.go at a89b803d4acf29757ba4488e7babe6ff8aa62ca0

Files

Benson Wong a89b803d4a Stream loading state when swapping models (#371 )

Swapping models can take a long time and leave a lot of silence while the model is loading. Rather than silently load the model in the background, this PR allows llama-swap to send status updates in the reasoning_content of a streaming chat response.

Fixes: #366

2025-10-29 00:09:39 -07:00

26 KiB

Raw Blame History

View Raw

26 KiB Raw Blame History

26 KiB

Raw Blame History