OBCache: Optimal Brain KV Cache Pruning for Efficient Long-Context LLM Inference
arxiv.org·19h
🧠LLM Inference
Open Vision Agents by Stream. Build Vision Agents with any model/ video provider.
github.com·13h·
Discuss: r/programming
🤖AI
Custom AI models in hours not months with auto Data Synth and LLM-as-a-Judge
blog.oumi.ai·23h·
Discuss: Hacker News
🆕New AI
NExF: Learning Neural Exposure Fields for View Synthesis
m-niemeyer.github.io·16h·
Discuss: Hacker News
🏗️LLM Infrastructure
IREN: A 10x Growth Story Following A Textbook Pivot From Bitcoin Mining To AI Cloud
seekingalpha.com·13h
🚀Startups
Thousands of AI Authors on the Future of AI
arxiv.org·19h
🛡️AI Safety
OpenAI's inflated valuation, as I understand it
taloranderson.com·7h·
Discuss: Hacker News
🏆LLM Benchmarking
Nvidia stock gets a price target hike from one analyst as another says AI applications are just getting started
qz.com·8h
🖥GPUs
Neural Networks from Scratch in Python: Simpler Than You Think
hamza.se·3h·
Discuss: Hacker News
📊Vector Databases
AI Guardrails, Gateways, Governance Nightmares
go.mcptotal.io·16h·
Discuss: Hacker News
🛡️AI Security
Harness CEO Jyoti Bansal on Why AI Coding Doesn’t Help You Ship Faster
thenewstack.io·3h
🆕New AI
Physics-informed AI excels at large-scale discovery of new materials
phys.org·8h
🏆LLM Benchmarking
HiPRAG: Hierarchical Process Rewards for Efficient Agentic Retrieval Augmented Generation
arxiv.org·19h
🏗️LLM Infrastructure
How different AI engines generate and cite answers
searchengineland.com·11h
📊Feed Optimization
(Forward) automatic implicit differentiation in Rust with num-dual 0.12.0
reddit.com·8h·
Discuss: r/rust
🎭Rust Macros
Vibe-Coding vs. AI-Assisted Development
adaptivealchemist.com·11h·
Discuss: Hacker News
🆕New AI
The DINOv3 Playbook for Computer Vision Data Science
pub.towardsai.net·10h
📊Vector Databases
GPT-5 for AI-assisted discovery
johndcook.com·8h·
Discuss: Hacker News
🏗️LLM Infrastructure
2025 State of AI Report and Predictions
thezvi.substack.com·6h·
Discuss: Substack
🛡️AI Safety