llama-swap/examples/speculative-decoding/README.md

# Qwen 2.5 Coder with a Draft Model

Using a small draft model like qwen-2.5-coder-0.5B can have a big impact on the performance of the larger 32 billion parameter model.
	`# Qwen 2.5 Coder with a Draft Model`

	`Using a small draft model like qwen-2.5-coder-0.5B can have a big impact on the performance of the larger 32 billion parameter model.`