Sparse Attention MoE - a test repo for a novel swappable attention mechanism
github.com·8h·
Discuss: r/LocalLLaMA
🧠Deep Learning
Flag this post
GPTF-8: A tokenizer-based character encoding
lesswrong.com·13h
💬Natural Language Processing
Flag this post
Continuous Autoregressive Language Models : Alternate for traditional LLMs, paper by Tencent
reddit.com·1d·
Discuss: r/LocalLLaMA
💬Natural Language Processing
Flag this post
ANNEXE: Unified Analyzing, Answering, and Pixel Grounding for Egocentric Interaction - A Blog
habib.bearblog.dev·9h
💬Natural Language Processing
Flag this post
Leaving PyTorch and Meta
soumith.ch·1d·
📓Jupyter Notebooks
Flag this post
Minimizing Loss ≠ Maximizing Intelligence
lesswrong.com·17h
🤖Machine Learning
Flag this post
13 Arguments About a Transition to Neuralese AIs
lesswrong.com·5h
📓Jupyter Notebooks
Flag this post
Feature Stores 2.0: The Next Frontier of Scalable Data Engineering for AI
hackernoon.com·2d
🎯Recommender Systems
Flag this post
Gated DeltaNet (Linear Attention variant in Qwen3-Next and Kimi Linear)
sebastianraschka.com·4d·
🧠Deep Learning
Flag this post
Show HN: Linguistic RL – A 7B model discovers Occam's Razor through reflection
github.com·6h·
🤖Machine Learning
Flag this post
Agents Work. Sort Of
blog.boringworkflows.ai·12h
🎯Recommender Systems
Flag this post
Benchmarking the Most Reliable Document Parsing API
tensorlake.ai·1d·
Discuss: Hacker News
💬Natural Language Processing
Flag this post
Sweep (YC S23) is hiring to build autocomplete for JetBrains
ycombinator.com·9h·
Discuss: Hacker News
📓Jupyter Notebooks
Flag this post
fran the man (film, 2025)
mighil.com·1d
🧮Vector Databases
Flag this post
New Haven Robotics 001
antoneking.bearblog.dev·1d
💬Natural Language Processing
Flag this post
Just give me the prompt
tacitexposure.bearblog.dev·20h
💬Natural Language Processing
Flag this post
An ARENA 6.0 Capstone: Model Organism of Encoded Reasoning
lesswrong.com·1d
🧮Vector Databases
Flag this post
You Should Write An Agent
fly.io·1d·
🐍Programming
Flag this post
AI Safety at the Frontier: Paper Highlights of October 2025
lesswrong.com·2d
🤖Machine Learning
Flag this post
Show HN: TabPFN-2.5 – SOTA foundation model for tabular data
priorlabs.ai·1d·
Discuss: Hacker News
🧮Vector Databases
Flag this post