AI

Feeds to Scour
SubscribedAll
Scoured 387 posts in 9.3 ms

I built an open-source persistent memory layer for AI coding agents

 ⚙️Zig  Content type: Code

Show HN: Run Llama.cpp In-Process from Java with Project Panama FFM

 🎯Escape Analysis

Energy-Efficient On-Device RAG on a Mobile NPU: System Design and Benchmark on Snapdragon X Elite

 🔍RAG  Content type: Academic
arxiv.org·

Alignment Defends LLMs from Property Inference Attacks

 🎯Escape Analysis  Content type: Academic
arxiv.org·

Benchmarking Large Language Models for Safety Data Extraction

 🎯Escape Analysis  Content type: Academic
arxiv.org·

The Order Matters: Sequential Fine-Tuning of LLaMA for Coherent Automated Essay Scoring

 🔍RAG  Content type: Academic
arxiv.org·

vla.cpp: A Unified Inference Runtime for Vision-Language-Action Models

 🤖Machine Learning  Content type: Academic
arxiv.org·

harshuljain13/llm-inference-at-scale: A Practitioner handbook for production llm serving.

 🤖Machine Learning  Content type: Code
github.com··Hacker News, r/LLM

LLM-Based Code Documentation Generation and Multi-Judge Evaluation

 🔨Compiler Design  Content type: Academic
arxiv.org·

fix(gateway): fail closed for unknown model auth · openclaw/openclaw@85343ea

 low-level programming  Content type: Code
github.com·

BUDDY: BUdget-Driven DYnamic Depth Routing for Adaptive Large Language Model Inference

 👁️Attention Mechanisms  Content type: Academic
arxiv.org·

LLM-as-a-Discriminator: When Synthetic Tables Still Look Real

 🔍RAG  Content type: Academic
arxiv.org·

mingusb/transformer-golf: The Fully Unrolled Transformer: An experimental repository for architecture simplification and compilation. [2026]

 🤖Machine Learning  Content type: Code
github.com··Hacker News

A retrieval conditioned rebinding circuit for dynamic entity tracking in large language models

 🔍RAG  Content type: Academic
arxiv.org·

The Neutral Mask: How RLHF Provides Shallow Alignment while Leaving Partisan Structure Intact in a Large Language Model

 🤖Reinforcement Learning  Content type: Academic
arxiv.org·

KJLdefeated/RL.cu: RLVR training for LLM in CUDA/C++

 🌟Ray Tracing  Content type: Code
github.com··Hacker News

Automatic Extraction of Structured Information from Brain MRI Reports Using an Open-Weight Large Language Model

 🤖Transformers  Content type: Academic
arxiv.org·

heterodoxin/graphkv: Graph-guided KV cache compression for memory-efficient LLM inference.

 👁️Attention Mechanisms  Content type: Code
github.com··r/LocalLLaMA

ReasonAlloc: Hierarchical Decoding-Time KV Cache Budget Allocation for Reasoning Models

 👁️Attention Mechanisms  Content type: Academic
arxiv.org·

From Rigid to Dynamic: Entropy-Guided Adaptive Inference for Long-Context LLMs

 👁️Attention Mechanisms  Content type: Academic
arxiv.org·
Sign up or log in to see more results

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help