ollama/x/create
Jeffrey Morgan 03bf241c33
x/imagegen: add FP4 quantization support for image generation models (#13773)
Add --quantize fp4 support to ollama create for image generation models
(flux2, z-image-turbo), using MLX's affine 4-bit quantization.

Changes:
- Add fp4 to validation in CreateImageGenModel
- Add FP4 case to quantizeTensor (group_size=32, bits=4, affine mode)
- Add GetQuantization() to WeightSource interface for dynamic params
- Update LoadLinearLayer to use quantization params from model metadata
2026-01-19 00:54:54 -08:00
..
client x/imagegen: add FP4 quantization support for image generation models (#13773) 2026-01-19 00:54:54 -08:00
create.go Fix create and show commands for experimental models (#13741) 2026-01-16 14:31:55 -08:00
create_test.go Fix create and show commands for experimental models (#13741) 2026-01-16 14:31:55 -08:00
imagegen.go x/imagegen: add FP4 quantization support for image generation models (#13773) 2026-01-19 00:54:54 -08:00