Everything About Transformers
krupadave.com·4d
📡Information Theory
Flag this post
Attention Illuminates LLM Reasoning: The Preplan-and-Anchor Rhythm EnablesFine-Grained Policy Optimization
🎲Bayesian Cognition
Flag this post
An underqualified reading list about the transformer architecture
🎨Computational Creativity
Flag this post
Un-Attributability: Computing Novelty From Retrieval & Semantic Similarity
arxiv.org·5h
📝NLP
Flag this post
Spiking Neural Networks: The Future of Brain-Inspired Computing
arxiv.org·5h
🎨Computational Creativity
Flag this post
Unlock Autonomy: Next-Gen LLMs Learn to Decode Themselves by Arvind Sundararajan
💬Philosophy of Language
Flag this post
Don’t Just Normalize, Batch Normalize! A Guide to Stable Neural Networks
pub.towardsai.net·3h
🤖AI
Flag this post
Everything About Transformers
📡Information Theory
Flag this post
[D] Best (free) courses on neural networks
📝NLP
Flag this post
Identifying the Periodicity of Information in Natural Language
arxiv.org·5h
📝NLP
Flag this post
The Kinetics of Reasoning: How Chain-of-Thought Shapes Learning in Transformers?
arxiv.org·3d
🧮Information theory
Flag this post
The Cargo Cult in the Machine: Why LLMs Are the Ultimate Imitators
💬Philosophy of Language
Flag this post
Loading...Loading more...