LLMs

large language models, GPT, inference, transformers

Feeds to Scour
SubscribedAll
Scoured 566 posts in 9.5 ms

Research Proposal: Decoupled RISC-LLM Architectures via Circadian Synaptic Consolidation

 🖥️Systems Programming
aermia.com··Hacker News

Your AI agent reads the fine print: building a RAG pipeline over EU regulations with Elasticsearch and OGX

 🤖Coding Agents  Content type: Blog
elastic.co·

AI Glossary

 🤖Coding Agents  Content type: Blog
0xdf.gitlab.io·

UniSVQ: 2-bit Unified Scalar-Vector Quantization

 ⚙️Compilers  Content type: Academic
arxiv.org·

It’s safe to close your laptop now: Hosting coding agents on Amazon Bedrock AgentCore

 🤖Coding Agents  Content type: Blog
aws.amazon.com·

Running LLM Inference on Kubernetes: What It Actually Takes

 λType Systems  Content type: Blog
fairwinds.com·

I built a tool to figure out what an AI agent actually costs per run, and the numbers surprised me

 🤖Coding Agents

local llm on laptop 780M GPU using llama + gemma 4 qat

 ⚙️Compilers  Content type: Blog
alper.bearblog.dev·

How J.A.R.V.I.S. Became the Smartest Mind on Earth — What is an LLM?

 📐PL Design  Content type: Blog
medium.com·

What Are Tokens in LLMs?

 λType Systems  Content type: Blog

LLM Research Papers: The 2026 List (January to May)

 ⚙️Compilers  Content type: News

Alignment Defends LLMs from Property Inference Attacks

 ⚙️Compilers  Content type: Academic
arxiv.org·

The hidden bottleneck in LLM inference and the impact on MLPerf benchmarking

 ⚙️Compilers
edn.com·

I built an open-source persistent memory layer for AI coding agents

 🛠️Developer Tools  Content type: Code

The Edge LLM Offload Story

 ⚙️Compilers
semiengineering.com·

Show HN: Ext-Infer

 🦀Rust

Why Claude Produces High-Quality Output: A Developer’s Guide to Token Efficiency and Hallucination…

 🏗️Software Architecture  Content type: Blog
medium.com·

AIs like ChatGPT fall apart in classic 'Stroop' psychological test — and that could stand in the way of achieving artificial general intelligence

 🤖Coding Agents
techradar.com
·

The Order Matters: Sequential Fine-Tuning of LLaMA for Coherent Automated Essay Scoring

 ⚙️Compilers  Content type: Academic
arxiv.org·

StereoTales: Multilingual Open-Ended Stereotype Discovery in LLMs

 λType Systems  Content type: Blog
Sign up or log in to see more results

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help