👁️ Attention Mechanisms - tomasz · Scour

Everything About Transformers

krupadave.com·3d

✂️Tokenization

Flag this post

Attention Illuminates LLM Reasoning: The Preplan-and-Anchor Rhythm EnablesFine-Grained Policy Optimization

paperium.net·2h·

Discuss: DEV

🔢Kolmogorov Complexity

Flag this post

An underqualified reading list about the transformer architecture

fvictorio.github.io·2d·

Discuss: Hacker News

🔢Kolmogorov Complexity

Flag this post

Kimi Linear: An Expressive, Efficient Attention Architecture

arxiviq.substack.com·14h·

Discuss: Substack

🔢Kolmogorov Complexity

Flag this post

Everything About Transformers

krupadave.com·3d·

Discuss: Hacker News, Hacker News

✂️Tokenization

Flag this post

A Minimal Route to Transformer Attention

neelsomaniblog.com·3d·

Discuss: Hacker News

🔢Kolmogorov Complexity

Flag this post

Algorithmic Olfactory Receptor Mimicry for Accelerated Anosmia Rehabilitation

dev.to·18h·

Discuss: DEV

🔁Spaced Repetition

Flag this post

Attention Illuminates LLM Reasoning: The Preplan-and-Anchor Rhythm EnablesFine-Grained Policy Optimization

dev.to·2h·

Discuss: DEV

🔁Spaced Repetition

Flag this post

Specialized structure of neural population codes in parietal cortex outputs

nature.com·2d

🔗Mutual Information

Flag this post

The Kinetics of Reasoning: How Chain-of-Thought Shapes Learning in Transformers?

arxiv.org·2d

🔢Kolmogorov Complexity

Flag this post

Emergent introspective awareness in large language models

transformer-circuits.pub·2d·

Discuss: Hacker News

✂️Tokenization

Flag this post

Large reasoning models almost certainly can think

venturebeat.com·1d

🔢Kolmogorov Complexity

Flag this post

ViSurf: Visual Supervised-and-Reinforcement Fine-Tuning for LargeVision-and-Language Models

dev.to·11h·

Discuss: DEV

🔢Kolmogorov Complexity

Flag this post

“Existential Risk” – AI Is Evolving Faster than Our Understanding of Consciousness

scitechdaily.com·17h

Flag this post

🧠 Soft Architecture (Part B): Emotional Timers and the Code of Care (Part 5 of the SaijinOS series)

dev.to·1d·

Discuss: DEV

Flag this post

[D] Best (free) courses on neural networks

reddit.com·21h·

Discuss: r/MachineLearning

🤖Machine Learning

Flag this post

Your Transformer is Secretly an EOT Solver

elonlit.com·2d·

Discuss: Hacker News

🔢Kolmogorov Complexity

Flag this post

Digesting AI Research: Day 3 — Transformer’s

pub.towardsai.net·6d

🔢Kolmogorov Complexity

Flag this post

The Cargo Cult in the Machine: Why LLMs Are the Ultimate Imitators

steviee.medium.com·35m·

Discuss: Hacker News

🔢Kolmogorov Complexity

Flag this post

Machine Learning Fundamentals: Everything I Wish I Knew When I Started

dev.to·6h·

Discuss: DEV

🤖Machine Learning

Flag this post

Loading more...