Kimi Linear: An Expressive, Efficient Attention Architecture
arxiviq.substack.com·16h·
Discuss: Substack
🧩Attention Kernels
Flag this post
Machine Learning Fundamentals: Everything I Wish I Knew When I Started
dev.to·8h·
Discuss: DEV
🎓Model Distillation
Flag this post
Yes, you should understand backprop (2016)
karpathy.medium.com·10h·
Discuss: Hacker News
📉Model Quantization
Flag this post
Learning to program "recycles" preexisting F-P pop codes of logical algorithms
jneurosci.org·50m·
Discuss: Hacker News
Flash Attention
Flag this post
Gradient GPS: Turbocharge Your Diffusion Models with Targeted Tuning
dev.to·1d·
Discuss: DEV
🏎️TensorRT
Flag this post
ViSurf: Visual Supervised-and-Reinforcement Fine-Tuning for LargeVision-and-Language Models
paperium.net·13h·
Discuss: DEV
🏎️TensorRT
Flag this post
The middle brother in classifier development: What is RandAugment?
openaccess.thecvf.com·3h·
Discuss: DEV
👁️Attention Optimization
Flag this post
Bayesian continual learning and forgetting in neural networks
nature.com·3d
👁️Attention Optimization
Flag this post
Adaptive Stemming via Graph-Augmented Recurrent Variational Autoencoders
dev.to·9h·
Discuss: DEV
🏎️TensorRT
Flag this post
[D] Best (free) courses on neural networks
reddit.com·23h·
👁️Attention Optimization
Flag this post
When Five Dumb AIs Beat One Smart AI: The Case for Multi-Agent Systems
ksramalakshmi.medium.com·4h·
Discuss: r/LocalLLaMA
🔄ONNX
Flag this post
From hours to seconds: AI tools to detect animal calls
seangoedecke.com·8h·
Discuss: Hacker News
📉Model Quantization
Flag this post
Counteracting Matthew Effect in Self-Improvement of LVLMs through Head-Tail Re-balancing
arxiv.org·2d
👁️Attention Optimization
Flag this post
Supervised Reinforcement Learning: From Expert Trajectories to Step-wise Reasoning (Paper Review)
pub.towardsai.net·8h
🎓Model Distillation
Flag this post
Hybrid Neuro-Symbolic Reasoning for Adaptive Robotics Control in Dynamic Environments
dev.to·7h·
Discuss: DEV
ONNX Runtime
Flag this post
Your Transformer is Secretly an EOT Solver
elonlit.com·2d·
Discuss: Hacker News
👁️Attention Optimization
Flag this post
Attention Illuminates LLM Reasoning: The Preplan-and-Anchor Rhythm EnablesFine-Grained Policy Optimization
paperium.net·4h·
Discuss: DEV
🧩Attention Kernels
Flag this post
MobileNetV3 Paper Walkthrough: The Tiny Giant Getting Even Smarter
towardsdatascience.com·2h
🎯Tensor Cores
Flag this post
Surrey Uni show AI systems based on the human brain's save energy
epsomandewelltimes.com·2h·
Discuss: Hacker News
🎯Tensor Cores
Flag this post