Benson Wong
5dc6b3e6d9
Add barebones but working implementation of model preload ( #209 , #235 )
...
Add barebones but working implementation of model preload
* add config test for Preload hook
* improve TestProxyManager_StartupHooks
* docs for new hook configuration
* add a .dev to .gitignore
2025-08-14 10:27:28 -07:00
g2mt
87dce5f8f6
Add metrics logging for chat completion requests ( #195 )
...
- Add token and performance metrics for v1/chat/completions
- Add Activity Page in UI
- Add /api/metrics endpoint
Contributed by @g2mt
2025-07-21 22:19:55 -07:00
Benson Wong
c867a6c9a2
Add name and description to v1/models list ( #179 )
...
* Add support for name and description in v1/models list
* add configuration example for name and description
2025-06-30 23:02:44 -07:00
Benson Wong
4236cec03a
Add Filters to Model Configuration ( #174 )
...
llama-swap can strip specific keys in JSON requests. This is useful for removing the ability for clients to set sampling parameters like temperature, top_k, top_p, etc.
2025-06-23 10:52:29 -07:00
Benson Wong
4fa12a429c
Refactor all default config values into config.go ( #162 )
...
- Move all default values into one place.
- Update tests to be more cross platform
2025-06-15 12:32:00 -07:00
Benson Wong
afc9aef058
Fix #133 SanitizeCommand removes comments ( #134 )
2025-05-15 15:28:50 -07:00
Benson Wong
d7b390df74
Add GH Action for Testing on Windows ( #132 )
...
* Add windows specific test changes
* Change the command line parsing library - Possible breaking changes for windows users!
2025-05-14 21:51:53 -07:00