Sparse Attention MoE - a test repo for a novel swappable attention mechanism
🤖Transformers
Flag this post
Minimizing Loss ≠ Maximizing Intelligence
lesswrong.com·10h
🤖Machine Learning
Flag this post
3 RTX 3090 graphics cards in a computer for inference and neural network training
🤖Machine Learning
Flag this post
ANNEXE: Unified Analyzing, Answering, and Pixel Grounding for Egocentric Interaction - A Blog
habib.bearblog.dev·2h
💬Natural Language Processing
Flag this post
Why Multimodal AI Broke the Data Pipeline — And How Daft Is Beating Ray and Spark to Fix It
hackernoon.com·4d
🧮Vector Databases
Flag this post
My Hands-On Review of Kimi K2 Thinking: The Open-Source AI That's Changing the Game
🧮Vector Databases
Flag this post
Supercharging the ML and AI Development Experience at Netflix
netflixtechblog.com·2d
🔄Concurrency
Flag this post
Leaving PyTorch and Meta
📓Jupyter Notebooks
Flag this post
Continuous Autoregressive Language Models : Alternate for traditional LLMs, paper by Tencent
💬Natural Language Processing
Flag this post
The Infrastructure of Modern Ranking Systems, Part 3: The MLOps Backbone - From Training to Deployment
shaped.ai·4d
🤖Machine Learning
Flag this post
I made a complete tutorial on fine-tuning Qwen2.5 (1.5B) on a free Colab T4 GPU. Accuracy boosted from 91% to 98% in ~20 mins!
📓Jupyter Notebooks
Flag this post
fran the man (film, 2025)
mighil.com·21h
🤖Transformers
Flag this post
The state of SIMD in Rust in 2025
🧮Vector Databases
Flag this post
Agents Work. Sort Of
blog.boringworkflows.ai·5h
🎯Recommender Systems
Flag this post
My resume!
cant.bearblog.dev·19h
🧮Vector Databases
Flag this post
Loading...Loading more...