mirror of
https://github.com/ollama/ollama.git
synced 2026-07-05 07:11:10 +00:00
Sampler.Distribution built row i as if draftTokens[:i] were appended, leaving a single-row proposal call with no draft history, so a drafter skipped the repeat/presence penalties the target's validation applies and re-proposed penalized tokens. Align rows with the end of the draft chain instead: the final row sees every draft token, each earlier row one fewer. |
||
|---|---|---|
| .. | ||
| logprob_test.go | ||
| sample.go | ||
| sample_test.go | ||