ollama

mirror of https://github.com/ollama/ollama.git synced 2026-05-13 14:27:00 +00:00

History

Jeffrey Morgan 03bf241c33 x/imagegen: add FP4 quantization support for image generation models (#13773 ) Add --quantize fp4 support to ollama create for image generation models (flux2, z-image-turbo), using MLX's affine 4-bit quantization. Changes: - Add fp4 to validation in CreateImageGenModel - Add FP4 case to quantizeTensor (group_size=32, bits=4, affine mode) - Add GetQuantization() to WeightSource interface for dynamic params - Update LoadLinearLayer to use quantization params from model metadata		2026-01-19 00:54:54 -08:00
..
client	x/imagegen: add FP4 quantization support for image generation models (#13773 )	2026-01-19 00:54:54 -08:00
create.go	Fix `create` and `show` commands for experimental models (#13741 )	2026-01-16 14:31:55 -08:00
create_test.go	Fix `create` and `show` commands for experimental models (#13741 )	2026-01-16 14:31:55 -08:00
imagegen.go	x/imagegen: add FP4 quantization support for image generation models (#13773 )	2026-01-19 00:54:54 -08:00