Skip to main content
Scour
Discover
Docs
Login
Sign Up
Discover
About
Docs
Changelog
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
Back to article
habr.com
4w
4 weeks ago
Qwen3.6 MTP весит на 0.3 Гб больше, а даёт ускорение в ~2 раза. С 60 t/s до 130 t/s для Qwen3.6 27B без искажений
(opens in new tab)
Covers
4 stories
See all stories this covers
including
Hugging Face – Fun chat with your own Artificial Intelligence
Love
Like
Not for me
Save
|
|
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Covers 4 related stories
huggingface.co
·
72w
72 weeks ago
Hugging Face – Fun chat with your own Artificial Intelligence
Discussed on
r/webdev
and
DEV
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Hugging Face – Fun chat with your own Artificial Intelligence
GitHub
·
6w
6 weeks ago
llama + spec: MTP Support by am17an · Pull Request #22673
Discussed on
Hacker News
and
r/LocalLLaMA
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for llama + spec: MTP Support by am17an · Pull Request #22673
huggingface.co
·
5w
5 weeks ago
Used over a million tokens in three separate sessions to test Qwen 3.6 35b (new Multi-token Prediction version)
Discussed on
r/LocalLLaMA
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Used over a million tokens in three separate sessions to test Qwen 3.6 35b (new Multi-token Prediction version)
GitHub
·
26w
26 weeks ago
[Speculative decoding] feat: add EAGLE3 speculative decoding support by ichbinhandsome · Pull Request #18039 · ggml-org/llama.cpp
Discussed on
r/LocalLLaMA
and
r/LocalLLaMA
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for [Speculative decoding] feat: add EAGLE3 speculative decoding support by ichbinhandsome · Pull Request #18039 · ggml-org/llama.cpp
Keyboard Shortcuts
Navigation
Next / previous post
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Discover
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help
Like
Save
Not for me
Report