Gated DeltaNet (Linear Attention variant in Qwen3-Next and Kimi Linear)
๐ง Deep Learning
Flag this post
Just give me the prompt
tacitexposure.bearblog.devยท20h
๐คTransformers
Flag this post
Show HN: Linguistic RL โ A 7B model discovers Occam's Razor through reflection
๐คTransformers
Flag this post
The Infrastructure of Modern Ranking Systems, Part 3: The MLOps Backbone - From Training to Deployment
shaped.aiยท4d
๐คMachine Learning
Flag this post
Vibing Negative
theblackwall.ukยท1d
๐Programming
Flag this post
New Haven Robotics 001
antoneking.bearblog.devยท1d
๐คTransformers
Flag this post
are LLMs automatically good at poetry yet?
jenn.siteยท21h
๐คAI
Flag this post
Week #4: Read Beyond Words
rootmap.bearblog.devยท13h
๐Data Science
Flag this post
r/mathematics
๐Data Science
Flag this post
How Transformer Models Detect Anomalies in System Logs
hackernoon.comยท4d
๐Retrieval Systems
Flag this post
Fake media seems to be a fact of life now
lesswrong.comยท1d
๐คMachine Learning
Flag this post
A country of alien idiots in a datacenter: AI progress and public alarm
lesswrong.comยท5h
๐คMachine Learning
Flag this post
fran the man (film, 2025)
mighil.comยท1d
๐คTransformers
Flag this post
My resume!
cant.bearblog.devยท1d
๐งฎVector Databases
Flag this post
Science, Power and the Myth of Neutrality
autismanswersback.bearblog.devยท5h
๐Data Science
Flag this post
A memo on Takeoff
lesswrong.comยท1d
๐Jupyter Notebooks
Flag this post
Loading...Loading more...