LLMs

Feeds to Scour
SubscribedAll
Scoured 1306 posts in 5.5 ms

harshuljain13/llm-inference-at-scale: A Practitioner handbook for production llm serving.

 🤖AI  Content type: Code
github.com··Hacker News, r/LLM

Intelligent inference scheduling with llm-d on Red Hat AI

 ⚙️Systems Programming
developers.redhat.com·

Cross-LLM Consistency in Inference: Evidence from Shared Interactions

 🔧Compilers  Content type: Academic
arxiv.org·

AI chatbots mimic fear, sadness and stress, then calm down after mindfulness exercise

 🤖AI
medicalxpress.com·

Fine-tuning Large Language Models (LLMs) using PEFT

 🤖AI  Content type: Blog
medium.com
·

Report: GKE Inference Gateway delivers up to 92% faster AI responses

 🤖AI  Content type: Blog

How LLMs work | Practical Leaders

 🤖AI

How to Build Financial Services AI Agents with Claude

 🤖AI  Content type: Blog
odsc.medium.com·

Comprehensive evaluation of LLM capabilities for interpretation and analysis of genome-scale metabolic models in metabolic engineering

 🔧Compilers  Content type: Academic
biorxiv.org·

Orchestrate your LLM pipeline. Locally

 🤖AI
llmforge.app··Hacker News

How Effective Are LLM Trading Agents?

 🤖AI  Content type: News  Content type: Blog

Why LLMs (still) lack taste

 🤖AI

Law Professors Prefer AI over Peer Answers

 🤖AI  Content type: Academic

How Large Language Models Are Creating New Security Challenges

 🔧Compilers  Content type: Blog
medium.com
·

LLM Routing: From Strategy Selection to Production Architecture

 🤖AI  Content type: Blog
blog.n8n.io·

AI Evaluation: How to Test LLM Applications Properly

 🤖AI  Content type: Blog
medium.com
·

High Bandwidth Flash | A New Memory for AI Data Centers and Edge Computing | Sandisk

 🤖AI
ncnonline.net·

RAG Pipeline Explained: From Query to Answer, Step by Step

 🗄️Databases  Content type: Blog
medium.com
·

Stop Wasting GPU Budget: Autoscaling AI Inference on Kubernetes with KEDA

 ⚙️Systems Programming
cloudnativenow.com·

The Inference Alpha: Maximizing Frontier Models on AMD

 🤖AI  Content type: Blog
digitalocean.com·

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help