llama-swap/proxy/proxymanager_test.go at 2fceb78e8dac9dbe0b6f169e89bde693f7698f17

Files

Benson Wong 73ad85ea69 Implement Multi-Process Handling (#7 )

Refactor code to support starting of multiple back end llama.cpp servers. This functionality is exposed as `profiles` to create a simple configuration format. 

Changes: 

* refactor proxy tests to get ready for multi-process support
* update proxy/ProxyManager to support multiple processes (#7)
* Add support for Groups in configuration
* improve handling of Model alias configs
* implement multi-model swapping
* improve code clarity for swapModel
* improve docs, rename groups to profiles in config

2024-11-23 19:45:13 -08:00

2.1 KiB

Raw Blame History

View Raw

2.1 KiB Raw Blame History

2.1 KiB

Raw Blame History