OBCache: Optimal Brain KV Cache Pruning for Efficient Long-Context LLM Inference
arxiv.orgยท18h
๐Ÿ Self-hosted AI
2025-10-10 # LLMs Are Transpilers
alloc.devยท22hยท
Discuss: Hacker News
๐Ÿ Self-hosted AI
A small number of samples can poison LLMs of any size
dev.toยท19hยท
Discuss: DEV
๐Ÿ Self-hosted AI
LLM-Based AI Agent That Automates The Transistor Sizing Process (Univ. of Edinburgh)
semiengineering.comยท1h
๐Ÿง Neuromorphic Chips
RND1: Simple, Scalable AR-to-Diffusion Conversion
radicalnumerics.aiยท1dยท
Discuss: Hacker News
๐Ÿ—๏ธAI Infrastructure
Three Solutions to Nondeterminism in AI
blog.hellas.aiยท2dยท
Discuss: Hacker News
๐Ÿ—๏ธAI Infrastructure
LoRA Explained: Faster, More Efficient Fine-Tuning with Docker
docker.comยท1d
๐Ÿ Self-hosted AI
VLLM Predicted Outputs
cascadetech.aiยท1hยท
Discuss: Hacker News
๐ŸŽ™๏ธWhisper
Hardware Vulnerability Allows Attackers to Hack AI Training Data โ€“ NC State News
news.ncsu.eduยท1hยท
Discuss: Hacker News
โšกHardware Acceleration
Analyzing Dialectical Biases in LLMs for Knowledge and Reasoning Benchmarks
machinelearning.apple.comยท1d
๐ŸŽ™๏ธWhisper
SPAD: Specialized Prefill and Decode Hardware for Disaggregated LLM Inference
arxiv.orgยท18hยท
Discuss: r/LLM
๐Ÿ—๏ธAI Infrastructure
LLM Optimization Notes: Memory, Compute and Inference Techniques
gaurigupta19.github.ioยท4dยท
Discuss: Hacker News
๐Ÿ—๏ธAI Infrastructure
The Hidden Oracle Inside Your AI: Unveiling Data Density with Latent Space Magic by Arvind Sundararajan
dev.toยท1dยท
Discuss: DEV
๐Ÿ“ฑEdge AI
Towards a Typology of Strange LLM Chains-of-Thought
lesswrong.comยท1d
๐Ÿ—๏ธAI Infrastructure
Neuro-Symbolic AI
en.wikipedia.orgยท7hยท
Discuss: Hacker News
๐Ÿง Neuromorphic Chips
Evaluating Gemini 2.5 Deep Think's math capabilities
epoch.aiยท8hยท
Discuss: Hacker News
๐Ÿ—๏ธAI Infrastructure
Scaling LLM Multi-turn RL with End-to-end Summarization-based Context Management
arxiv.orgยท1d
๐Ÿ Self-hosted AI
Evolution Strategies at Scale: LLM Fine-Tuning Beyond Reinforcement Learning
arxiviq.substack.comยท1dยท
Discuss: Substack
๐Ÿ—๏ธAI Infrastructure
Operationalizing Data Minimization for Privacy-Preserving LLM Prompting
arxiv.orgยท3d
๐Ÿ Self-hosted AI
Show HN: Nanowakeword โ€“ Automates custom wake word model training
github.comยท10hยท
Discuss: Hacker News
๐ŸŽ™๏ธWhisper