Benson Wong
baeb0c4e7f
Add cmd_stop configuration to better support docker ( #35 )
...
Add `cmd_stop` to model configuration to run a command instead of sending a SIGTERM to shutdown a process before swapping.
2025-01-30 16:59:57 -08:00
Benson Wong
d6ca535939
tweak release tagging so it is not based on number of commits
2024-12-14 15:46:10 -08:00
Benson Wong
27302c0c02
change llama-swap to use goreleaser default ldflag values
2024-12-14 10:30:06 -08:00
Benson Wong
22d3f1a4f9
Change versioning to use git commits counts instead of semver
...
- less work for me
- more frequent releases
2024-12-14 09:53:13 -08:00
Benson Wong
c9233d2c9a
use gin instead of standard http lib in main
2024-11-18 15:58:28 -08:00
Benson Wong
c3b4bb1684
use gin for http server
2024-11-18 15:30:16 -08:00
Benson Wong
8eb5b7b6c4
Add custom check endpoint
...
Replace previously hardcoded value for `/health` to check when the
server became ready to serve traffic. With this the server can support
any server that provides an an OpenAI compatible inference endpoint.
2024-10-11 21:59:21 -07:00
Benson Wong
ef05c05f9c
renaming to llama-swap
2024-10-04 20:21:11 -07:00