Apple's LLM Breakthrough
xray.greyb.comยท4hยท
Discuss: Hacker News
๐Ÿ–ฅ๏ธHardware Architecture
Flag this post
Why is MiniMax M2 a Full Attention model?
reddit.comยท4hยท
Discuss: r/LocalLLaMA
๐Ÿ—๏ธLLM Infrastructure
Flag this post
Nested Learning: How Your Neural Network Already Learns at Multiple Timescales
rewire.itยท23h
๐Ÿ“ŠEmbeddings
Flag this post
Andrej Karpathy on LLM cognitive deficits
lesswrong.comยท20h
๐Ÿ—๏ธLLM Infrastructure
Flag this post
IMDMR: An Intelligent Multi-Dimensional Memory Retrieval System for Enhanced Conversational AI
arxiv.orgยท12h
โœจGemini
Flag this post
Bandits in Your LLM Gateway
tensorzero.comยท1hยท
Discuss: Hacker News
๐Ÿ†LLM Benchmarking
Flag this post
Why Language Models Are โ€œLost in the Middleโ€
pub.towardsai.netยท19h
๐Ÿช„Prompt Engineering
Flag this post
Fast and Affordable LLMs serving on Intel Arc Pro B-Series GPUs with vLLM
blog.vllm.aiยท17h
๐Ÿ—๏ธLLM Infrastructure
Flag this post
Study finds AI models store memories and logic in different neural regions
arstechnica.comยท18hยท
๐Ÿ›ก๏ธAI Safety
Flag this post
baidu/ERNIE-4.5-VL-28B-A3B-Thinking released. Curious case..
huggingface.coยท11hยท
Discuss: r/LocalLLaMA
๐Ÿ—๏ธLLM Infrastructure
Flag this post
Synth: The New Data Frontier
pleias.frยท11hยท
Discuss: Hacker News
๐Ÿ—๏ธLLM Infrastructure
Flag this post
AI Memory: Enabling The Next Era Of High-Performance Computing
semiengineering.comยท9h
๐Ÿ’ปChips
Flag this post
A software platform for real-time and adaptive neuroscience experiments
nature.comยท3h
๐Ÿ”AI Interpretability
Flag this post
GKE: From containers to agents, the unified platform for every modern workload
cloud.google.comยท5h
๐Ÿ—๏ธLLM Infrastructure
Flag this post
MoM โ€“ Mixture of Model Service
github.comยท1hยท
Discuss: Hacker News
๐Ÿฆ™Ollama
Flag this post
LLM-Driven Robots Risk Enacting Discrimination, Violence, and Unlawful Actions
link.springer.comยท1hยท
Discuss: Hacker News
๐Ÿ›ก๏ธAI Safety
Flag this post
Exploring RTEB, a New Benchmark To Evaluate Embedding Models
thenewstack.ioยท23h
๐ŸŒBGE Embeddings
Flag this post
AGRAG: Advanced Graph-based Retrieval-Augmented Generation for LLMs
arxiv.orgยท12h
๐Ÿ”Information Retrieval
Flag this post
The Underwear Fixed Point
notes.hella.cheapยท23hยท
๐ŸŽจChroma
Flag this post
I Read Sam Bhagwat's AI Agents Bible So You Don't Have to (But Probably Should)
kuber.studioยท1hยท
Discuss: Hacker News
๐Ÿช„Prompt Engineering
Flag this post