Skip to main content
Scour
Discover
Docs
Login
Sign Up
Discover
About
Docs
Changelog
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
Back to article
Xiangpeng's blog
1w
1 week ago
A system programmer’s guide to LLM inference
(opens in new tab)
Covers
3 stories
See all stories this covers
including
Alibaba open-sources Qwen3.6-35B-A3B, a 35B MoE model with 3B active parameters
Discussed on
Hacker News
Love
Like
Not for me
Save
|
|
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Feeds
Xiangpeng's blog
blog.xiangpeng.systems
A system programmer’s guide to LLM inference
1w
1 week ago
Four mistakes in my PhD
8w
8 weeks ago
parquet-linter: A better Parquet is Parquet itself
17w
17 weeks ago
Hacker News - Newest: "LLM"
hnrss.org
Flat per-call LLM API gateway (20x cheaper than Claude Max)
1h
1 hour ago
Googles specification (and tooling) for the LLM wiki
1h
1 hour ago
ZhuLinsen/daily_stock_analysis: LLM驱动的 A/H/美股智能分析器,多数据源行情 + 实时新闻 + Gemini 决策仪表盘 + 多渠道推送,零成本,纯白嫖,定时运行
3h
3 hours ago
+5 more in the past day
Hacker News: Newest
hnrss.org
Jürgen Habermas Defended Reason in a Darkening Age
17m
17 minutes ago
ZSTD –auto it picks the optimal compression level for you
21m
21 minutes ago
Anthropic uses Persona for identity verification
24m
24 minutes ago
+355 more in the past day
Keyboard Shortcuts
Navigation
Next / previous post
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Discover
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help
Like
Save
Not for me
Report