AI new techology

Feeds to Scour
SubscribedAll
Scoured 581 posts in 7.6 ms

harshuljain13/llm-inference-at-scale: A Practitioner handbook for production llm serving.

馃AIContent type: Code
github.comHacker News, r/LLM

Orchestrate your LLM pipeline. Locally

馃AI
llmforge.appHacker News

UniSVQ: 2-bit Unified Scalar-Vector Quantization

馃AIContent type: Academic
arxiv.org

Intelligent inference scheduling with llm-d on Red Hat AI

馃AI
developers.redhat.com

Qwen 3.6 27B AutoRound GGUF, need your feedback

馃AI native
huggingface.cor/LocalLLaMA

6. Air-Gapped Claude Code - The Claude Code SRE Handbook

馃AI native
har-ki.github.ioHacker News

Gemma 4 QAT models: Optimizing model compression for mobile and laptop efficiency

馃AIContent type: NewsContent type: Blog
blog.googleHacker News

Two old GPUs I salvaged are doing more AI work than a brand new $2000 card, and I won't be upgrading anytime soon

馃AI native
xda-developers.com

Why LLMs (still) lack taste

馃AI

Ask HN: Any Local LLM can I run without GPU for Local Agentic workflow AI?

馃AI native Content type: Discussion
news.ycombinator.comHacker News

If LLMs are all persona, whose persona are they?

馃AI native

Google's new open-weights model brings image-generation tricks to AI text generation

馃AIContent type: News
theregister.com

Foundation Models: Apple Isn鈥檛 Building an AI Model. It鈥檚 Building an AI Platform.

馃AIContent type: Blog
medium.com

local llm on laptop 780M GPU using llama + gemma 4 qat

馃AI native Content type: Blog
alper.bearblog.dev

CommBench: Can LLMs Write Correct and Efficient GPU Communication Code?

馃AI native

What's in the Box? A Field Guide to AI Models

馃AIContent type: Blog
iankduncan.com

Making a Vintage LLM from Scratch

馃AI
crlf.linkHacker News

Introducing the Third Generation of Apple鈥檚 Foundation Models

馃AI

Inferoa AI harness claimed 90% cache savings. We ran it and measured 97.8%

馃AI

Google open-sources speedy DiffusionGemma text diffusion model

馃AI
siliconangle.com

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help