Everything About Transformers
krupadave.com·3d
✂️Tokenization
Flag this post
Attention Illuminates LLM Reasoning: The Preplan-and-Anchor Rhythm EnablesFine-Grained Policy Optimization
paperium.net·2h·
Discuss: DEV
🔢Kolmogorov Complexity
Flag this post
An underqualified reading list about the transformer architecture
fvictorio.github.io·2d·
Discuss: Hacker News
🔢Kolmogorov Complexity
Flag this post
Kimi Linear: An Expressive, Efficient Attention Architecture
arxiviq.substack.com·14h·
Discuss: Substack
🔢Kolmogorov Complexity
Flag this post
Everything About Transformers
krupadave.com·3d·
✂️Tokenization
Flag this post
A Minimal Route to Transformer Attention
neelsomaniblog.com·3d·
Discuss: Hacker News
🔢Kolmogorov Complexity
Flag this post
Algorithmic Olfactory Receptor Mimicry for Accelerated Anosmia Rehabilitation
dev.to·18h·
Discuss: DEV
🔁Spaced Repetition
Flag this post
Attention Illuminates LLM Reasoning: The Preplan-and-Anchor Rhythm EnablesFine-Grained Policy Optimization
dev.to·2h·
Discuss: DEV
🔁Spaced Repetition
Flag this post
Specialized structure of neural population codes in parietal cortex outputs
nature.com·2d
🔗Mutual Information
Flag this post
The Kinetics of Reasoning: How Chain-of-Thought Shapes Learning in Transformers?
arxiv.org·2d
🔢Kolmogorov Complexity
Flag this post
Emergent introspective awareness in large language models
transformer-circuits.pub·2d·
Discuss: Hacker News
✂️Tokenization
Flag this post
Large reasoning models almost certainly can think
venturebeat.com·1d
🔢Kolmogorov Complexity
Flag this post
ViSurf: Visual Supervised-and-Reinforcement Fine-Tuning for LargeVision-and-Language Models
dev.to·11h·
Discuss: DEV
🔢Kolmogorov Complexity
Flag this post
“Existential Risk” – AI Is Evolving Faster than Our Understanding of Consciousness
scitechdaily.com·17h
🤖AI Ethics
Flag this post
🧠 Soft Architecture (Part B): Emotional Timers and the Code of Care (Part 5 of the SaijinOS series)
dev.to·1d·
Discuss: DEV
🤖AI Ethics
Flag this post
[D] Best (free) courses on neural networks
reddit.com·21h·
🤖Machine Learning
Flag this post
Your Transformer is Secretly an EOT Solver
elonlit.com·2d·
Discuss: Hacker News
🔢Kolmogorov Complexity
Flag this post
Digesting AI Research: Day 3 — Transformer’s
pub.towardsai.net·6d
🔢Kolmogorov Complexity
Flag this post
The Cargo Cult in the Machine: Why LLMs Are the Ultimate Imitators
steviee.medium.com·35m·
Discuss: Hacker News
🔢Kolmogorov Complexity
Flag this post
Machine Learning Fundamentals: Everything I Wish I Knew When I Started
dev.to·6h·
Discuss: DEV
🤖Machine Learning
Flag this post