LLMs

Feeds to Scour
SubscribedAll
Scoured 309 posts in 5.9 ms

harshuljain13/llm-inference-at-scale: A Practitioner handbook for production llm serving.

 🤖AI  Content type: Code
github.com··Hacker News, r/LLM

Comprehensive evaluation of LLM capabilities for interpretation and analysis of genome-scale metabolic models in metabolic engineering

 🤖AI  Content type: Academic
biorxiv.org·

Fine-tuning Multi-modal LLMs with ART: Art-based Reinforcement Training

 🤖AI  Content type: Academic
arxiv.org·

6. Air-Gapped Claude Code - The Claude Code SRE Handbook

 ⚙️DevOps

Intelligent inference scheduling with llm-d on Red Hat AI

 🤖AI
developers.redhat.com·

Why LLMs (still) lack taste

 ⚙️DevOps

Claude vs GPT-4: Which AI API Is Better for Developers? (2026)

 🐍Python
kalyna.pro··DEV

Ollama 0.30 GPU Boost: Faster local Qwen inference on NVIDIA

 🔌APIs
everylocalai.com··DEV

Timing Trick Cuts Energy Used in LLM Training by Up to 14 Percent

 🏃Running  Content type: News

Why Your LLM Gets Dumber With More Context

 🤖AI
siliconopera.com·

What Ollama Reveals About Local AI, Agents, and Open Models

 🤖AI  Content type: Blog
odsc.medium.com·

The smartest ChatGPT users are putting local AI in front of it — here's why

 🤖AI
tomsguide.com
·

Fixing a stuck Ollama runner and building a GPU watchdog

 System programming

CommBench: Can LLMs Write Correct and Efficient GPU Communication Code?

 🤖AI

Built and launched a research-reading and highlighting tool with Claude over a few months. Here are the things AI was surprisingly good (and bad) at.

 🤖AI
highlyt.app··r/ClaudeAI

Improved performance and model support with GGUF

 🤖AI  Content type: Blog
ollama.com·

MCP Architecture Explained for Beginners: Why AI Needs a Structured Communication System

 🤖AI  Content type: Blog
medium.com
·

MTP Isn't Always a Win: 1.95x on My 3090, but Speculative Decoding Is Hardware-Dependent

 🇬🇧London Tech  Content type: Blog
bric.pe.kr··DEV

Large companies can add a local LLM filter layer to considerably reducing their AI costs

 🤖AI

AMD's Lemonade SDK For Local AI Adds NVIDIA CUDA Support

 🤖AI

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help