ollama/docs
Daniel Hiltgen dba1e27fa8
llama: enable FA on CUDA CC 6.x GPUs (#16994)
Recent upstream Pascal kernel fixes let us compile native SM60/SM61 kernels again instead of relying on PTX JIT, so allow Flash Attention auto at runtime for CC 6.x devices.

Fixes #16591

Fixes #16754
2026-07-02 17:11:39 -07:00
..
api docs: redesign docs landing and integrations overview (#16807) 2026-06-24 16:28:28 -04:00
capabilities docs: document max think level (#16877) 2026-06-23 15:29:15 -07:00
images docs: redesign docs landing and integrations overview (#16807) 2026-06-24 16:28:28 -04:00
integrations launch: update hermes install urls to official (#16913) 2026-06-25 16:22:19 -07:00
tools/extract-examples docs: add docs for v1/responses and rework openai compat section (#13416) 2025-12-11 17:39:40 -08:00
api.md docs: document max think level (#16877) 2026-06-23 15:29:15 -07:00
cli.mdx docs: update docs examples to use Gemma 4 instead of Gemma 3 (#16607) 2026-06-07 12:43:13 -07:00
cloud.mdx docs(cloud): update retirement list (#17000) 2026-07-01 19:43:14 -07:00
context-length.mdx docs: update docs examples to use Gemma 4 instead of Gemma 3 (#16607) 2026-06-07 12:43:13 -07:00
development.md llama: enable FA on CUDA CC 6.x GPUs (#16994) 2026-07-02 17:11:39 -07:00
docker.mdx runner: Remove CGO engines, use llama-server exclusively for GGML models (#16031) 2026-05-29 13:35:47 -07:00
docs.json docs: redesign docs landing and integrations overview (#16807) 2026-06-24 16:28:28 -04:00
examples.md docs: update readme and links (#12809) 2025-10-28 16:20:02 -07:00
faq.mdx llama: enable FA on CUDA CC 6.x GPUs (#16994) 2026-07-02 17:11:39 -07:00
favicon-dark.svg docs: add docs for docs.ollama.com (#12805) 2025-10-28 13:18:48 -07:00
favicon.svg docs: add docs for docs.ollama.com (#12805) 2025-10-28 13:18:48 -07:00
gpu.mdx rocm: remove no longer supported devices (#17010) 2026-07-02 16:59:01 -07:00
import.mdx docs: remove unsupported quantizations (#13982) 2026-01-31 12:46:20 -08:00
index.mdx docs: redesign docs landing and integrations overview (#16807) 2026-06-24 16:28:28 -04:00
linux.mdx rocm: update linux to v7.2 (#14391) 2026-03-09 08:26:55 -07:00
logo.svg docs: add docs for docs.ollama.com (#12805) 2025-10-28 13:18:48 -07:00
macos.mdx docs: update readme and links (#12809) 2025-10-28 16:20:02 -07:00
modelfile.mdx runner: Remove CGO engines, use llama-server exclusively for GGML models (#16031) 2026-05-29 13:35:47 -07:00
ollama-logo.svg docs: add docs for docs.ollama.com (#12805) 2025-10-28 13:18:48 -07:00
ollama.png docs: add docs for docs.ollama.com (#12805) 2025-10-28 13:18:48 -07:00
openapi.yaml docs: document max think level (#16877) 2026-06-23 15:29:15 -07:00
quickstart.mdx docs: redesign docs landing and integrations overview (#16807) 2026-06-24 16:28:28 -04:00
README.md api: implement anthropic api (#13600) 2026-01-09 11:53:36 -08:00
styling.css fix capability grid dark mode style (#16907) 2026-06-25 13:55:39 -04:00
template.mdx docs: add docs for docs.ollama.com (#12805) 2025-10-28 13:18:48 -07:00
troubleshooting.mdx rocm: doc driver constraints (#14833) 2026-03-13 15:53:35 -07:00
windows.mdx CUDA: require driver 550 or newer for v12 (#16895) 2026-06-25 08:46:00 -07:00