ollama/server
2026-05-10 14:08:57 -07:00
..
internal go: bump to 1.26 (#15904) 2026-05-03 23:24:35 -07:00
auth.go server: reject unexpected auth hosts (#13738) 2026-01-16 14:10:36 -05:00
auth_test.go server: reject unexpected auth hosts (#13738) 2026-01-16 14:10:36 -05:00
cloud_proxy.go cloud_proxy: for the web_search legacy path, flush on newlines (#14897) 2026-03-17 13:30:17 -07:00
cloud_proxy_test.go cloud_proxy: for the web_search legacy path, flush on newlines (#14897) 2026-03-17 13:30:17 -07:00
create.go refine implementation 2026-05-06 17:26:05 -07:00
create_test.go Clean up the manifest and modelpath (#13807) 2026-01-21 11:46:17 -08:00
download.go Clean up the manifest and modelpath (#13807) 2026-01-21 11:46:17 -08:00
fixblobs.go
fixblobs_test.go
gemma4_test.go refine implementation 2026-05-06 17:26:05 -07:00
images.go Merge remote-tracking branch 'upstream/main' into llama-runner-phase-0 2026-05-08 16:00:29 -07:00
images_test.go refine implementation 2026-05-06 17:26:05 -07:00
inference_request_log.go add ability to turn on debug request logging (#14106) 2026-03-19 17:08:17 -07:00
logprob.go logprob: add bytes to logprobs (#13068) 2025-11-13 13:49:25 -08:00
model.go create: Clean up experimental paths, fix create from existing safetensor model (#14679) 2026-04-07 08:12:57 -07:00
model_caches.go server: cache show responses (#15967) 2026-05-05 14:40:18 -07:00
model_recommendations.go launch: add plan-aware model gating (#16027) 2026-05-06 14:34:26 -07:00
model_recommendations_test.go launch: add plan-aware model gating (#16027) 2026-05-06 14:34:26 -07:00
model_resolver.go llama/compat: load Ollama-format GGUFs in llama-server 2026-05-06 17:26:05 -07:00
model_resolver_test.go Reapply "don't require pulling stubs for cloud models" again (#14608) 2026-03-06 14:27:47 -08:00
model_show_cache.go server: cache show responses (#15967) 2026-05-05 14:40:18 -07:00
model_show_cache_test.go server: cache show responses (#15967) 2026-05-05 14:40:18 -07:00
prompt.go gemma4: render differently based on model size 2026-04-15 14:37:16 -07:00
prompt_test.go gemma4: render differently based on model size 2026-04-15 14:37:16 -07:00
quantization.go runner: Remove CGO engines, use llama-server exclusively for GGML models 2026-05-06 17:26:05 -07:00
renderer_resolution.go gemma4: render differently based on model size 2026-04-15 14:37:16 -07:00
routes.go misc discovery fixes, and bos handling 2026-05-10 14:08:57 -07:00
routes_cloud_test.go revert context length warnings change (#15121) 2026-03-28 16:43:59 -07:00
routes_create_test.go New models (#15861) 2026-04-28 11:50:12 -07:00
routes_debug_test.go refine implementation 2026-05-06 17:26:05 -07:00
routes_delete_test.go Reapply "don't require pulling stubs for cloud models" again (#14608) 2026-03-06 14:27:47 -08:00
routes_generate_renderer_test.go sched: Model eviction for MLX 2026-03-16 17:40:29 -07:00
routes_generate_test.go refine implementation 2026-05-06 17:26:05 -07:00
routes_harmony_streaming_test.go sched: Model eviction for MLX 2026-03-16 17:40:29 -07:00
routes_list_test.go
routes_options_test.go refine implementation 2026-05-06 17:26:05 -07:00
routes_request_log_test.go add ability to turn on debug request logging (#14106) 2026-03-19 17:08:17 -07:00
routes_test.go modelfiles: fix /save command and add shortname for safetensors based models (#15413) 2026-04-08 21:05:39 -07:00
routes_web_experimental_test.go cloud_proxy: send ollama client version (#14769) 2026-03-10 15:53:25 -07:00
sched.go scheduler improvements 2026-05-08 12:16:35 -07:00
sched_test.go scheduler improvements 2026-05-08 12:16:35 -07:00
sparse_common.go
sparse_windows.go
test_home_test.go add ability to disable cloud (#14221) 2026-02-12 15:47:00 -08:00
upload.go Clean up the manifest and modelpath (#13807) 2026-01-21 11:46:17 -08:00