Benson Wong
d6ca535939
tweak release tagging so it is not based on number of commits
2024-12-14 15:46:10 -08:00
Benson Wong
27302c0c02
change llama-swap to use goreleaser default ldflag values
2024-12-14 10:30:06 -08:00
Benson Wong
4c94927658
Move release to Makefile out of goreleaser
...
- less complexity
- easier
- goreleaser, github, pipelines: 1... mostlygeek: 0
2024-12-14 10:16:46 -08:00
Benson Wong
22d3f1a4f9
Change versioning to use git commits counts instead of semver
...
- less work for me
- more frequent releases
2024-12-14 09:53:13 -08:00
Benson Wong
533162ce6a
add support for automatically unloading a model ( #10 ) ( #14 )
...
* Make starting upstream process on-demand (#10 )
* Add automatic unload of model after TTL is reached
* add `ttl` configuration parameter to models in seconds, default is 0 (never unload)
2024-11-19 16:32:51 -08:00
Benson Wong
e5c909ddf7
add tests for proxy.Process
2024-11-17 20:49:14 -08:00
Benson Wong
8cf2a389d8
Refactor log implementation
...
- use []byte instead of unnecessary string conversions
- make LogManager.Broadcast private
- make LogManager.GetHistory public
- add tests
2024-10-31 12:16:54 -07:00
Benson Wong
ef05c05f9c
renaming to llama-swap
2024-10-04 20:21:11 -07:00
Benson Wong
e0103d1884
build simple-responder with make all
2024-10-04 12:14:10 -07:00
Benson Wong
d682589fb1
support environment variables
2024-10-04 11:55:27 -07:00
Benson Wong
aaca9d889b
add Makefile
2024-10-04 11:07:00 -07:00