g2mt
|
87dce5f8f6
|
Add metrics logging for chat completion requests (#195)
- Add token and performance metrics for v1/chat/completions
- Add Activity Page in UI
- Add /api/metrics endpoint
Contributed by @g2mt
|
2025-07-21 22:19:55 -07:00 |
|
Benson Wong
|
29cd98878d
|
better container build logic when upstream containers do not exist
|
2025-03-09 13:02:06 -07:00 |
|
Benson Wong
|
4ed58fb173
|
update container build action
|
2025-02-18 09:59:06 -08:00 |
|
Benson Wong
|
f5a2be698d
|
revert package src until new ggml-org has them
|
2025-02-15 18:23:58 -08:00 |
|
Benson Wong
|
f5e6ec3b7a
|
fix package src in containerfile
|
2025-02-15 18:20:35 -08:00 |
|
Benson Wong
|
3f462da146
|
switch package source from ggerganov to ggml-org
|
2025-02-15 18:18:49 -08:00 |
|
Benson Wong
|
92336f00bf
|
more container build fixes
|
2025-02-14 15:34:38 -08:00 |
|
Benson Wong
|
ed2a50d9a6
|
fix bug in build-container.sh
|
2025-02-14 15:27:56 -08:00 |
|
Benson Wong
|
96a8ea0241
|
add cpu docker container build
|
2025-02-14 15:25:45 -08:00 |
|
Benson Wong
|
f20f2c9b7a
|
add docs and container build improvements #43
|
2025-02-14 12:20:07 -08:00 |
|
Benson Wong
|
ddc1ce031e
|
fix container file name #46
|
2025-02-14 10:49:44 -08:00 |
|
Benson Wong
|
43e23c16dc
|
add check for GITHUB_TOKEN #46
|
2025-02-14 10:47:25 -08:00 |
|
Benson Wong
|
f9c8e763ba
|
add execute bit on build-container.sh
|
2025-02-14 10:44:53 -08:00 |
|
Benson Wong
|
ab93460a8b
|
first container code (#52)
|
2025-02-14 10:39:25 -08:00 |
|