Add examples
This commit is contained in:
3
examples/speculative-decoding/README.md
Normal file
3
examples/speculative-decoding/README.md
Normal file
@@ -0,0 +1,3 @@
|
||||
# Qwen 2.5 Coder with a Draft Model
|
||||
|
||||
Using a small draft model like qwen-2.5-coder-0.5B can have a big impact on the performance of the larger 32 billion parameter model.
|
||||
Reference in New Issue
Block a user