Benson Wong
|
f58c8c8ec5
|
Support llama.cpp's cache_n in timings info (#287)
Capture prompt cache metrics and surface them on Activities page in UI
|
2025-09-06 13:58:02 -07:00 |
|
Benson Wong
|
74c69f39ef
|
Add prompt processing metrics (#250)
- capture prompt processing metrics
- display prompt processing metrics on UI Activity page
|
2025-08-14 10:02:16 -07:00 |
|
g2mt
|
87dce5f8f6
|
Add metrics logging for chat completion requests (#195)
- Add token and performance metrics for v1/chat/completions
- Add Activity Page in UI
- Add /api/metrics endpoint
Contributed by @g2mt
|
2025-07-21 22:19:55 -07:00 |
|