LLMs

Feeds to Scour
SubscribedAll
Scoured 526 posts in 6.3 ms

How we fight GPU scarcity without compromise

 🚀Inference  Content type: Blog
equixly.com··Hacker News

Research Proposal: Decoupled RISC-LLM Architectures via Circadian Synaptic Consolidation

 🤖AI Engineering
aermia.com··Hacker News

147th airhacks tv: Local LLMs, LightMetal, ZSmith Agents, AI Rails, Saving Tokens

 🤖AI Engineering  Content type: Blog
adambien.blog·

Using local LLMs for agentic coding

 🤖AI Engineering  Content type: Blog
blog.alexewerlof.com·

Neo-X7/Neo-AI: A fully offline AI assistant powered by Ollama. Stores and retrieves conversations using SQLite + LanceDB vector search. No cloud. No API keys. Runs entirely on your machine.

 🤖AI Engineering  Content type: Code
github.com··DEV

How J.A.R.V.I.S. Became the Smartest Mind on Earth — What is an LLM?

 🤖AI Engineering  Content type: Blog
medium.com·

Large companies can add a local LLM filter layer to considerably reducing their AI costs

 🤖AI Engineering

The Shibboleth Effect: Auditing the Cross-Lingual Distributional Skew of Large Language Models

 🤖AI Engineering  Content type: Academic
arxiv.org·

Show HN: Run Llama.cpp In-Process from Java with Project Panama FFM

 🤖AI Engineering

Why Shrinking an AI Model Often Makes It More Useful

 🤖AI Engineering
siliconopera.com·

Running Ollama on a 15W CPU sounded ridiculous until I got it working with decent results

 🤖AI Engineering
xda-developers.com·

2x GH200 for LLM inference, Part 2: vLLM, DeepSeek V4 Flash, and MTP

 🚀Inference  Content type: Blog
dnhkng.github.io·

MLPerf and the rise of latency-aware LLM benchmarking

 🤖AI Engineering
edn.com·

LLM AI Chatbots are letting me down every single day

 🤖AI Engineering

Deep Learning Weekly: Issue 458

 🤖AI Engineering

Alignment Defends LLMs from Property Inference Attacks

 🤖AI Engineering  Content type: Academic
arxiv.org·

I built an open-source persistent memory layer for AI coding agents

 🤖AI Engineering  Content type: Code

LLM Research Papers: The 2026 List (January to May)

 🤖AI Engineering  Content type: News

MechLens: Late Crystallization of Factual Knowledge Explains Intervention Effectiveness in Language Models

 🤖AI Engineering  Content type: Academic
arxiv.org·

techjarves/Portable-AI-USB: A 100% offline, fully portable, zero-trace AI (Ollama + Llama 3 + AnythingLLM) that runs natively from a USB drive on Windows and Mac.

 🤖AI Engineering  Content type: Code
github.com·

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help