AI new techology

Feeds to Scour
SubscribedAll
Scoured 589 posts in 4.4 ms

harshuljain13/llm-inference-at-scale: A Practitioner handbook for production llm serving.

馃AIContent type: Code
github.comHacker News, r/LLM

Orchestrate your LLM pipeline. Locally

馃AI
llmforge.appHacker News

UniSVQ: 2-bit Unified Scalar-Vector Quantization

馃AIContent type: Academic
arxiv.org

Google's new open-weights model brings image-generation tricks to AI text generation

馃AIContent type: News
theregister.com

Qwen 3.6 27B AutoRound GGUF, need your feedback

馃AI native
huggingface.cor/LocalLLaMA

Intelligent inference scheduling with llm-d on Red Hat AI

馃AI
developers.redhat.com

Gemma 4 QAT models: Optimizing model compression for mobile and laptop efficiency

馃AIContent type: NewsContent type: Blog
blog.googleHacker News

6. Air-Gapped Claude Code - The Claude Code SRE Handbook

馃AI native
har-ki.github.ioHacker News

Why LLMs (still) lack taste

馃AI

Two old GPUs I salvaged are doing more AI work than a brand new $2000 card, and I won't be upgrading anytime soon

馃AI native
xda-developers.com

What's in the Box? A Field Guide to AI Models

馃AIContent type: Blog
iankduncan.com

local llm on laptop 780M GPU using llama + gemma 4 qat

馃AI native Content type: Blog
alper.bearblog.dev

Ask HN: Any Local LLM can I run without GPU for Local Agentic workflow AI?

馃AI native Content type: Discussion
news.ycombinator.comHacker News

Anthropic Reverses Course on Hidden AI Restrictions Following Developer Backlash

馃AI native
devops.com

Inferoa AI harness claimed 90% cache savings. We ran it and measured 97.8%

馃AI

If LLMs are all persona, whose persona are they?

馃AI native

Introducing the Third Generation of Apple鈥檚 Foundation Models

馃AI

MTP Isn't Always a Win: 1.95x on My 3090, but Speculative Decoding Is Hardware-Dependent

馃AI native Content type: Blog
bric.pe.krDEV

#070 - Anthropic walks back Fable 5's throttle, Claude Desktop hides a 1.8GB VM, HTML doubles signups

馃AI
indiehacker.news

147th airhacks tv: Local LLMs, LightMetal, ZSmith Agents, AI Rails, Saving Tokens

馃AIContent type: Blog
adambien.blog

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help