Sparse Attention MoE - a test repo for a novel swappable attention mechanism
🧠Deep Learning
Flag this post
GPTF-8: A tokenizer-based character encoding
lesswrong.com·13h
💬Natural Language Processing
Flag this post
Continuous Autoregressive Language Models : Alternate for traditional LLMs, paper by Tencent
💬Natural Language Processing
Flag this post
ANNEXE: Unified Analyzing, Answering, and Pixel Grounding for Egocentric Interaction - A Blog
habib.bearblog.dev·9h
💬Natural Language Processing
Flag this post
Leaving PyTorch and Meta
📓Jupyter Notebooks
Flag this post
Minimizing Loss ≠ Maximizing Intelligence
lesswrong.com·17h
🤖Machine Learning
Flag this post
13 Arguments About a Transition to Neuralese AIs
lesswrong.com·5h
📓Jupyter Notebooks
Flag this post
Feature Stores 2.0: The Next Frontier of Scalable Data Engineering for AI
hackernoon.com·2d
🎯Recommender Systems
Flag this post
Show HN: Linguistic RL – A 7B model discovers Occam's Razor through reflection
🤖Machine Learning
Flag this post
Agents Work. Sort Of
blog.boringworkflows.ai·12h
🎯Recommender Systems
Flag this post
fran the man (film, 2025)
mighil.com·1d
🧮Vector Databases
Flag this post
New Haven Robotics 001
antoneking.bearblog.dev·1d
💬Natural Language Processing
Flag this post
Just give me the prompt
tacitexposure.bearblog.dev·20h
💬Natural Language Processing
Flag this post
An ARENA 6.0 Capstone: Model Organism of Encoded Reasoning
lesswrong.com·1d
🧮Vector Databases
Flag this post
You Should Write An Agent
🐍Programming
Flag this post
AI Safety at the Frontier: Paper Highlights of October 2025
lesswrong.com·2d
🤖Machine Learning
Flag this post
Loading...Loading more...