AI

ai research, ai tools, LLM advancement, ai development

Feeds to Scour
SubscribedAll
Scoured 116 posts in 7.9 ms

harshuljain13/llm-inference-at-scale: A Practitioner handbook for production llm serving.

馃КGeneticsContent type: Code
github.comHacker News

Why LLMs (still) lack taste

馃КGenetics

RKSC: Reasoning-Aware KV Cache Sharing and Confident Early Exit for Multi-Step LLM Inference

馃КGeneticsContent type: Academic
arxiv.org

Inferoa AI harness claimed 90% cache savings. We ran it and measured 97.8%

馃КGenetics
zozo123.github.ioHacker News

Siri AI at WWDC 2026

馃КBioinformatics

How we fight GPU scarcity without compromise

馃КGeneticsContent type: Blog
equixly.comHacker News

How to Build an Agentic RAG with RubyLLM and Rails

馃КGeneticsContent type: Blog
panasiti.meHacker News

OpenEnv is now owned by HF, Torch, Prime Intellect, Unsloth, Modal, Mercor, and more! Use it for training agents.

馃敩translational medicineContent type: Blog

The Missing Link Between Agents and Applications

馃КBioinformaticsContent type: Blog
langchain.comHacker News

DeepSeekV4 1.6T Day 0 to Day 43 Performance Over Time - Huawei, GB300 NVL72, MI355X, B200

馃КGeneticsContent type: News

Machinic Psychopharmacology: Do LLMs Self-Medicate?

馃敩translational medicine
lesswrong.comHacker News

Show HN: Run Llama.cpp In-Process from Java with Project Panama FFM

馃敩translational medicine

DiffusionGemma: 4x Faster Text Generation

馃КGeneticsContent type: NewsContent type: Blog

DiffusionGemma: The Developer Guide- Google Developers Blog

馃КGeneticsContent type: Blog

Apple rebuilt its on-device AI stack at WWDC 2026

馃КGeneticsContent type: Blog
ziraph.comHacker News

Nvidia Nemotron 3 Ultra

馃КGenetics

1-bit and 1.58 bit LLM Benchmarking on Jetson Orin Nano Super | Bonsai LM

馃КGenetics
smolhub.comr/LocalLLaMA

Alignment Collapse Under KV Cache Quantization: Diagnosis and Mitigation

馃КBioinformaticsContent type: Academic
arxiv.org

Mini Shai-Hulud, Miasma, and Hades Worms Target Bioinformatics and MCP Developers via Malicious PyPI Wheels

馃КBioinformaticsContent type: Blog
socket.devHacker News

Introducing Granite Libraries and Project Granite Switch

馃КBioinformaticsContent type: Blog
research.ibm.comHacker News

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help