Benson Wong
01d4838fb3
Fix token metrics parsing ( #199 )
...
Fix #198
- use llama-server's `timings` info if available in response body
- send "-1" for token/sec when not able to accurately calculate
performance
- optimize streaming body search for metrics information
2025-07-22 23:10:14 -07:00
..
2025-06-16 16:45:19 -07:00
2025-07-21 22:19:55 -07:00
2025-07-15 10:14:16 -07:00
2025-07-21 22:19:55 -07:00
2025-07-21 22:19:55 -07:00
2025-07-21 22:19:55 -07:00
2025-06-15 12:32:00 -07:00
2025-07-01 22:17:35 -07:00
2025-07-15 18:04:30 -07:00
2025-07-22 23:10:14 -07:00
2025-07-21 22:19:55 -07:00
2025-07-15 10:14:16 -07:00
2025-07-15 18:04:30 -07:00
2025-05-13 11:39:19 -07:00
2025-05-13 11:39:19 -07:00
2025-07-21 22:19:55 -07:00
2025-07-01 22:17:35 -07:00
2025-07-22 23:10:14 -07:00
2025-07-21 22:59:41 -07:00
2025-04-01 08:43:53 -07:00
2025-04-01 08:43:53 -07:00
2025-06-16 16:45:19 -07:00