mirror of https://github.com/ollama/ollama.git synced 2026-07-04 06:41:39 +00:00

History

Daniel Hiltgen dba1e27fa8 llama: enable FA on CUDA CC 6.x GPUs (#16994 ) Recent upstream Pascal kernel fixes let us compile native SM60/SM61 kernels again instead of relying on PTX JIT, so allow Flash Attention auto at runtime for CC 6.x devices. Fixes #16591 Fixes #16754		2026-07-02 17:11:39 -07:00
..
api	docs: redesign docs landing and integrations overview (#16807 )	2026-06-24 16:28:28 -04:00
capabilities	docs: document max think level (#16877 )	2026-06-23 15:29:15 -07:00
images	docs: redesign docs landing and integrations overview (#16807 )	2026-06-24 16:28:28 -04:00
integrations	launch: update hermes install urls to official (#16913 )	2026-06-25 16:22:19 -07:00
tools/extract-examples	docs: add docs for v1/responses and rework openai compat section (#13416 )	2025-12-11 17:39:40 -08:00
api.md	docs: document max think level (#16877 )	2026-06-23 15:29:15 -07:00
cli.mdx	docs: update docs examples to use Gemma 4 instead of Gemma 3 (#16607 )	2026-06-07 12:43:13 -07:00
cloud.mdx	docs(cloud): update retirement list (#17000 )	2026-07-01 19:43:14 -07:00
context-length.mdx	docs: update docs examples to use Gemma 4 instead of Gemma 3 (#16607 )	2026-06-07 12:43:13 -07:00
development.md	llama: enable FA on CUDA CC 6.x GPUs (#16994 )	2026-07-02 17:11:39 -07:00
docker.mdx	runner: Remove CGO engines, use llama-server exclusively for GGML models (#16031 )	2026-05-29 13:35:47 -07:00
docs.json	docs: redesign docs landing and integrations overview (#16807 )	2026-06-24 16:28:28 -04:00
examples.md	docs: update readme and links (#12809 )	2025-10-28 16:20:02 -07:00
faq.mdx	llama: enable FA on CUDA CC 6.x GPUs (#16994 )	2026-07-02 17:11:39 -07:00
favicon-dark.svg	docs: add docs for docs.ollama.com (#12805 )	2025-10-28 13:18:48 -07:00
favicon.svg	docs: add docs for docs.ollama.com (#12805 )	2025-10-28 13:18:48 -07:00
gpu.mdx	rocm: remove no longer supported devices (#17010 )	2026-07-02 16:59:01 -07:00
import.mdx	docs: remove unsupported quantizations (#13982 )	2026-01-31 12:46:20 -08:00
index.mdx	docs: redesign docs landing and integrations overview (#16807 )	2026-06-24 16:28:28 -04:00
linux.mdx	rocm: update linux to v7.2 (#14391 )	2026-03-09 08:26:55 -07:00
logo.svg	docs: add docs for docs.ollama.com (#12805 )	2025-10-28 13:18:48 -07:00
macos.mdx	docs: update readme and links (#12809 )	2025-10-28 16:20:02 -07:00
modelfile.mdx	runner: Remove CGO engines, use llama-server exclusively for GGML models (#16031 )	2026-05-29 13:35:47 -07:00
ollama-logo.svg	docs: add docs for docs.ollama.com (#12805 )	2025-10-28 13:18:48 -07:00
ollama.png	docs: add docs for docs.ollama.com (#12805 )	2025-10-28 13:18:48 -07:00
openapi.yaml	docs: document max think level (#16877 )	2026-06-23 15:29:15 -07:00
quickstart.mdx	docs: redesign docs landing and integrations overview (#16807 )	2026-06-24 16:28:28 -04:00
README.md	api: implement anthropic api (#13600 )	2026-01-09 11:53:36 -08:00
styling.css	fix capability grid dark mode style (#16907 )	2026-06-25 13:55:39 -04:00
template.mdx	docs: add docs for docs.ollama.com (#12805 )	2025-10-28 13:18:48 -07:00
troubleshooting.mdx	rocm: doc driver constraints (#14833 )	2026-03-13 15:53:35 -07:00
windows.mdx	CUDA: require driver 550 or newer for v12 (#16895 )	2026-06-25 08:46:00 -07:00

README.md

Documentation

Getting Started

Reference

Resources