Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
AI
馃 AI
ai research, ai tools, LLM advancement, ai development
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
116
posts in
7.9
ms
harshuljain13/llm-inference-at-scale
: A Practitioner handbook for production
llm
serving.
聽
馃К
Genetics
聽
Content type:
Code
github.com
路
4d
4 days ago
路
Hacker News
Actions for harshuljain13/llm-inference-at-scale: A Practitioner handbook for production llm serving.
Why LLMs (still) lack taste
聽
馃К
Genetics
beyondtheprior.com
路
1d
1 day ago
路
Hacker News
Actions for Why LLMs (still) lack taste
RKSC: Reasoning-Aware KV Cache Sharing and Confident Early Exit for Multi-Step
LLM
Inference
聽
馃К
Genetics
聽
Content type:
Academic
arxiv.org
路
17h
17 hours ago
Actions for RKSC: Reasoning-Aware KV Cache Sharing and Confident Early Exit for Multi-Step LLM Inference
Inferoa
AI
harness claimed 90% cache savings. We ran it and measured 97.8%
聽
馃К
Genetics
zozo123.github.io
路
11h
11 hours ago
路
Hacker News
Actions for Inferoa AI harness claimed 90% cache savings. We ran it and measured 97.8%
Siri
AI
at WWDC 2026
聽
馃К
Bioinformatics
simonwillison.net
路
1d
1 day ago
路
Hacker News
Actions for Siri AI at WWDC 2026
How we fight GPU scarcity without compromise
聽
馃К
Genetics
聽
Content type:
Blog
equixly.com
路
5d
5 days ago
路
Hacker News
Actions for How we fight GPU scarcity without compromise
How to Build an Agentic
RAG
with RubyLLM and Rails
聽
馃К
Genetics
聽
Content type:
Blog
panasiti.me
路
14h
14 hours ago
路
Hacker News
Actions for How to Build an Agentic RAG with RubyLLM and Rails
OpenEnv is now owned by HF, Torch, Prime Intellect, Unsloth, Modal, Mercor, and more! Use it for training agents.
聽
馃敩
translational medicine
聽
Content type:
Blog
huggingface.co
路
2d
2 days ago
路
Hacker News
,
r/LocalLLaMA
Actions for OpenEnv is now owned by HF, Torch, Prime Intellect, Unsloth, Modal, Mercor, and more! Use it for training agents.
The Missing Link Between Agents and Applications
聽
馃К
Bioinformatics
聽
Content type:
Blog
langchain.com
路
3h
3 hours ago
路
Hacker News
Actions for The Missing Link Between Agents and Applications
DeepSeekV4 1.6T Day 0 to Day 43 Performance Over Time - Huawei, GB300 NVL72, MI355X, B200
聽
馃К
Genetics
聽
Content type:
News
newsletter.semianalysis.com
路
1d
1 day ago
路
Hacker News
Actions for DeepSeekV4 1.6T Day 0 to Day 43 Performance Over Time - Huawei, GB300 NVL72, MI355X, B200
Machinic
Psychopharmacology: Do LLMs Self-Medicate?
聽
馃敩
translational medicine
lesswrong.com
路
7h
7 hours ago
路
Hacker News
Actions for Machinic Psychopharmacology: Do LLMs Self-Medicate?
Show HN: Run Llama.cpp In-Process from Java with Project Panama FFM
聽
馃敩
translational medicine
deemwar-products.github.io
路
5d
5 days ago
路
Hacker News
Actions for Show HN: Run Llama.cpp In-Process from Java with Project Panama FFM
DiffusionGemma: 4x Faster Text Generation
聽
馃К
Genetics
聽
Content type:
News
聽
Content type:
Blog
blog.google
路
5h
5 hours ago
路
Hacker News
,
r/LocalLLaMA
,
r/singularity
Actions for DiffusionGemma: 4x Faster Text Generation
DiffusionGemma: The
Developer
Guide- Google Developers Blog
聽
馃К
Genetics
聽
Content type:
Blog
developers.googleblog.com
路
21h
21 hours ago
路
r/LocalLLaMA
Actions for DiffusionGemma: The Developer Guide- Google Developers Blog
Apple rebuilt its on-device
AI
stack at WWDC 2026
聽
馃К
Genetics
聽
Content type:
Blog
ziraph.com
路
1d
1 day ago
路
Hacker News
Actions for Apple rebuilt its on-device AI stack at WWDC 2026
Nvidia Nemotron 3 Ultra
聽
馃К
Genetics
research.nvidia.com
路
6d
6 days ago
路
Hacker News
Actions for Nvidia Nemotron 3 Ultra
1-bit and 1.58 bit
LLM
Benchmarking on Jetson Orin Nano Super | Bonsai LM
聽
馃К
Genetics
smolhub.com
路
2d
2 days ago
路
r/LocalLLaMA
Actions for 1-bit and 1.58 bit LLM Benchmarking on Jetson Orin Nano Super | Bonsai LM
Alignment Collapse Under KV Cache Quantization: Diagnosis and Mitigation
聽
馃К
Bioinformatics
聽
Content type:
Academic
arxiv.org
路
17h
17 hours ago
Actions for Alignment Collapse Under KV Cache Quantization: Diagnosis and Mitigation
Mini Shai-Hulud, Miasma, and Hades Worms Target Bioinformatics and MCP
Developers
via Malicious PyPI Wheels
聽
馃К
Bioinformatics
聽
Content type:
Blog
socket.dev
路
2d
2 days ago
路
Hacker News
Actions for Mini Shai-Hulud, Miasma, and Hades Worms Target Bioinformatics and MCP Developers via Malicious PyPI Wheels
Introducing Granite Libraries and Project Granite Switch
聽
馃К
Bioinformatics
聽
Content type:
Blog
research.ibm.com
路
6d
6 days ago
路
Hacker News
Actions for Introducing Granite Libraries and Project Granite Switch
Page 2 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help