mirror of https://github.com/ollama/ollama.git synced 2026-07-11 10:14:28 +00:00

History

Daniel Hiltgen 87288ced4f New models (#15861 ) * mlx: add laguna model support * convert: support fp8 safetensors import Decode HF F8_E4M3 safetensors with block scale companions into GGUF-supported tensor types, and record which output tensors came from FP8 source weights. Use that source-precision metadata during create quantization: default FP8-sourced GGUFs to Q8_0, keep non-FP8 tensors at their original precision for Q8_0, and promote non-FP8 quantizable tensors to Q8_0 for Q4_K requests. * ggml: add laguna model support * server: preserve generate logprobs with builtin parsers Generate requests were dropping logprob-only chunks whenever a builtin parser buffered visible content. Chat already handled this case, but generate only forwarded chunks with visible response, thinking, or tool-call output. Keep generate chunks that carry logprobs even when the builtin parser has not flushed visible content yet, and add a regression test that exercises the behavior with a generic thinking parser. * review comments - perf improvements * ggml: implement nemotron 3 nano omni * add poolside integration * update poolside doc * adapt to new cache setup * fix test * fix test --------- Co-authored-by: Eva Ho <hoyyeva@gmail.com>		2026-04-28 11:50:12 -07:00
..
api	docs: document reasoning_effort support in OpenAI-compatible API (#14821 )	2026-03-13 10:57:14 -07:00
capabilities	docs: update structured outputs doc for cloud (#15733 )	2026-04-22 00:42:39 -07:00
images	docs: update hermes (#15655 )	2026-04-17 14:20:59 -07:00
integrations	New models (#15861 )	2026-04-28 11:50:12 -07:00
tools/extract-examples	docs: add docs for v1/responses and rework openai compat section (#13416 )	2025-12-11 17:39:40 -08:00
api.md	Add experimental image generation fields to /api/generate (#13753 )	2026-01-17 18:27:41 -08:00
cli.mdx	doc: update vscode doc (#15064 )	2026-03-26 13:45:48 -07:00
cloud.mdx	add ability to disable cloud (#14221 )	2026-02-12 15:47:00 -08:00
context-length.mdx	update context length format.	2026-02-10 17:06:05 -08:00
development.md	MLX: add header vendoring and remove go build tag (#14642 )	2026-03-09 17:24:45 -07:00
docker.mdx	Enable Vulkan with a temporary opt-in setting (#12931 )	2025-11-12 08:40:38 -08:00
docs.json	New models (#15861 )	2026-04-28 11:50:12 -07:00
examples.md	docs: update readme and links (#12809 )	2025-10-28 16:20:02 -07:00
faq.mdx	add ability to disable cloud (#14221 )	2026-02-12 15:47:00 -08:00
favicon-dark.svg	docs: add docs for docs.ollama.com (#12805 )	2025-10-28 13:18:48 -07:00
favicon.svg	docs: add docs for docs.ollama.com (#12805 )	2025-10-28 13:18:48 -07:00
gpu.mdx	rocm: doc driver constraints (#14833 )	2026-03-13 15:53:35 -07:00
import.mdx	docs: remove unsupported quantizations (#13982 )	2026-01-31 12:46:20 -08:00
index.mdx	docs: ollama launch (#13852 )	2026-01-23 23:18:50 -08:00
linux.mdx	rocm: update linux to v7.2 (#14391 )	2026-03-09 08:26:55 -07:00
logo.svg	docs: add docs for docs.ollama.com (#12805 )	2025-10-28 13:18:48 -07:00
macos.mdx	docs: update readme and links (#12809 )	2025-10-28 16:20:02 -07:00
modelfile.mdx	Revert "runner: add token history sampling parameters to ollama runner (#14537 )" (#14776 )	2026-03-10 21:07:52 -07:00
ollama-logo.svg	docs: add docs for docs.ollama.com (#12805 )	2025-10-28 13:18:48 -07:00
ollama.png	docs: add docs for docs.ollama.com (#12805 )	2025-10-28 13:18:48 -07:00
openapi.yaml	docs: fix openapi schema for /api/ps and /api/tags endpoints (#14210 )	2026-02-11 17:37:40 -08:00
quickstart.mdx	docs: make integrations more discoverable (#14301 )	2026-02-17 13:27:25 -08:00
README.md	api: implement anthropic api (#13600 )	2026-01-09 11:53:36 -08:00
styling.css	docs: add docs for docs.ollama.com (#12805 )	2025-10-28 13:18:48 -07:00
template.mdx	docs: add docs for docs.ollama.com (#12805 )	2025-10-28 13:18:48 -07:00
troubleshooting.mdx	rocm: doc driver constraints (#14833 )	2026-03-13 15:53:35 -07:00
windows.mdx	ci: fix missing windows zip file (#14807 )	2026-03-12 16:14:00 -07:00

README.md

Documentation

Getting Started

Reference

Resources