ollama

mirror of https://github.com/ollama/ollama.git synced 2026-07-05 07:11:10 +00:00

History

Jeffrey Morgan acfb50d9af models: add cohere2_moe (Command A / North) to the MLX engine (#16670 ) Implements Cohere2MoeForCausalLM (e.g. CohereLabs/North-Mini-Code-1.0)		2026-06-16 23:15:21 -07:00
..
agent	x/cmd: enable web search and web fetch with flag (#13690 )	2026-01-12 13:59:40 -08:00
cmd	Reapply "don't require pulling stubs for cloud models" again (#14608 )	2026-03-06 14:27:47 -08:00
create	models: add cohere2_moe (Command A / North) to the MLX engine (#16670 )	2026-06-16 23:15:21 -07:00
imagegen	llama-server followups (#16353 )	2026-06-01 10:44:21 -07:00
internal/mlxthread	mlxthread: preserve the original stack when worker work panics	2026-06-09 00:39:19 -07:00
mlxrunner	models: add cohere2_moe (Command A / North) to the MLX engine (#16670 )	2026-06-16 23:15:21 -07:00
models	models: add cohere2_moe (Command A / North) to the MLX engine (#16670 )	2026-06-16 23:15:21 -07:00
safetensors	mlx: Support NVIDIA TensorRT Model Optimizer import (#15566 )	2026-04-27 18:28:10 -07:00
server	mlx: fix reported information in `ollama show` (#16289 )	2026-05-24 14:08:06 -07:00
tokenizer	runner: Remove CGO engines, use llama-server exclusively for GGML models (#16031 )	2026-05-29 13:35:47 -07:00
tools	add ability to disable cloud (#14221 )	2026-02-12 15:47:00 -08:00
transfer	mlx: refined model push behavior (#15431 )	2026-05-08 14:25:30 -07:00