AI

Feeds to Scour
SubscribedAll
Scoured 56 posts in 28.7 ms

SLUUG Talk: Demystifying Large Language Models on Linux

 🤖LLM  Content type: Code

LLM KV Cache Optimization, Open Model Evaluation, & Agent Engineering Skills for Local Deployment

 🤖LLM  Content type: Blog
dev.to··DEV

DiffusionGemma: 4x Faster Text Generation

 🤖LLM  Content type: News  Content type: Blog  21 articles covering this post

PyTorch from Scratch — Part 1: Tensors, Gradients & Activations

 Rust
x.com··DEV

Framework Desktop AMD 395+ (rdna 3.5) cannot run confyui err Fix 2026

 Vite  Content type: Blog
runaihome.com··DEV

Teaching a Reranker the Language of Security Tickets (+41% MRR@10)

 🤖LLM

Three sleep intervals for three APIs: Steam 250ms, GitHub 100ms, HuggingFace none

 🔄TanStack Query  Content type: Reference

FlashAttention Explained: The Optimization That Made Modern LLMs Practical

 🤖LLM  Content type: Blog
dev.to··DEV

Stop Downloading 8GB Models on Every Pod Restart - Use OCI Object Storage as a Model Cache

 🚀DevOps  Content type: Blog
dev.to··DEV

Flowork: Self-Hosted AI Stack with Sovereign Agent OS and LLM Gateway

 🚀DevOps  Content type: Blog
dev.to··DEV

Why JAX Is a Much Better Backend for Quantum Circuit Simulation Than PyTorch

 🔶Svelte  Content type: Code
github.com
··DEV

8GB to 70B: A Real Hardware Guide for Local LLMs

 🤖LLM  Content type: Blog
dev.to··DEV

Run Codex CLI with Local LLM - Gemma4 with llama.cpp on WSL2

 🤖LLM  Content type: Blog
dev.to··DEV

Token Cost Optimization: How to Cut LLM Inference Spend Without Cutting Quality

 🤖LLM  Content type: Blog
dev.to··DEV

I Made Two AI Models Fight Each Other. They Agreed Way Too Much.

 🤖LLM  Content type: Blog
dev.to··DEV

Local Ai Deployment Cost Analysis 2024

 🤖LLM  Content type: Blog
dev.to··DEV

RFC: pluggable publisher verification as a trust tier for community skills · Issue #40555 · NousResearch/hermes-agent

 Vite  Content type: Discussion  Content type: Code
github.com··DEV

Quantization formats compared: GGUF vs GPTQ vs AWQ vs NF4

 🤖LLM  Content type: Blog
dev.to··DEV

Mixture of Experts (MoE): what it actually does under the hood, and when it pays off

 🤖LLM  Content type: Blog
dev.to··DEV

I Built a Python Agent That Uses a Vector DB as Memory, Not Retrieval

 🤖LLM  Content type: Blog
dev.to··DEV

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help