Commit graph

4 commits

Author SHA1 Message Date
Parth Sareen
bab59072fb
launch: add plan-aware model gating (#16027) 2026-05-06 14:34:26 -07:00
Parth Sareen
d319227df0
server: cache show responses (#15967) 2026-05-05 14:40:18 -07:00
Parth Sareen
b6447caebc
launch: use vram bytes for model recommendations (#15885) 2026-04-29 18:40:14 -07:00
Parth Sareen
321cc8a2ba
server/launch: add model recommendations cache endpoint (#15868) 2026-04-28 17:09:04 -07:00