mirror of
https://github.com/ollama/ollama.git
synced 2026-05-13 14:27:00 +00:00
* mlx: add laguna model support * convert: support fp8 safetensors import Decode HF F8_E4M3 safetensors with block scale companions into GGUF-supported tensor types, and record which output tensors came from FP8 source weights. Use that source-precision metadata during create quantization: default FP8-sourced GGUFs to Q8_0, keep non-FP8 tensors at their original precision for Q8_0, and promote non-FP8 quantizable tensors to Q8_0 for Q4_K requests. * ggml: add laguna model support * server: preserve generate logprobs with builtin parsers Generate requests were dropping logprob-only chunks whenever a builtin parser buffered visible content. Chat already handled this case, but generate only forwarded chunks with visible response, thinking, or tool-call output. Keep generate chunks that carry logprobs even when the builtin parser has not flushed visible content yet, and add a regression test that exercises the behavior with a generic thinking parser. * review comments - perf improvements * ggml: implement nemotron 3 nano omni * add poolside integration * update poolside doc * adapt to new cache setup * fix test * fix test --------- Co-authored-by: Eva Ho <hoyyeva@gmail.com> |
||
|---|---|---|
| .. | ||
| cogito.go | ||
| cogito_test.go | ||
| deepseek3.go | ||
| deepseek3_test.go | ||
| functiongemma.go | ||
| functiongemma_test.go | ||
| gemma4.go | ||
| gemma4_test.go | ||
| glm46.go | ||
| glm46_test.go | ||
| glm47.go | ||
| glm47_test.go | ||
| glmocr.go | ||
| laguna.go | ||
| laguna_test.go | ||
| lfm2.go | ||
| lfm2_test.go | ||
| ministral.go | ||
| ministral_test.go | ||
| nemotron3nano.go | ||
| nemotron3nano_test.go | ||
| olmo3.go | ||
| olmo3_test.go | ||
| olmo3_think.go | ||
| olmo3_think_test.go | ||
| parsers.go | ||
| parsers_test.go | ||
| qwen3.go | ||
| qwen3_test.go | ||
| qwen3coder.go | ||
| qwen3coder_test.go | ||
| qwen3vl.go | ||
| qwen3vl_nonthinking_test.go | ||
| qwen3vl_thinking_test.go | ||
| qwen35.go | ||
| qwen35_test.go | ||
| testhelpers_test.go | ||