🤖 Transformers - yfff · Scour

Transformers 💬LLMs

chizkidd.github.io·2d·Hacker News

ml-intern 💬LLMs

producthunt.com·3d

The Recurrent Transformer: Greater Effective Depth and Efficient Decoding 🧠LLM

kyegomez/OpenMythos: A theoretical reconstruction of the Claude Mythos architecture, built from first principles using the available research literature. 🧠LLM

github.com·5d·Hacker News

Watch language models think. 💬LLMs

openinterp.org·1d·Hacker News

Transformer vs CNN-LSTM: CWRU Bearing 96% vs 92% Accuracy 🧠LLM

tildalice.io·4d

Neural Networks Explained In Plain English 🧠Machine Learning

blog.algomaster.io·4d

Working Memory Constraints Scaffold Learning in Transformers under Data Scarcity 🧠LLM

Variance Is Not Importance: Structural Analysis of Transformer Compressibility Across Model Scales 📐Optimization Theory

Absorber LLM: Harnessing Causal Synchronization for Test-Time Training 💬LLMs

Nexusformer: Nonlinear Attention Expansion for Stable and Inheritable Transformer Scaling 🧠LLM

Hyperloop Transformers 💬LLMs

An explicit operator explains end-to-end computation in the modern neural networks used for sequence and language modeling 🧠LLM

The Topological Trouble With Transformers 💬LLMs

OThink-SRR1: Search, Refine and Reasoning with Reinforced Learning for Large Language Models 💬LLMs

Tracing Relational Knowledge Recall in Large Language Models 💬LLMs

SigGate-GT: Taming Over-Smoothing in Graph Transformers via Sigmoid-Gated Attention 🧠LLM

Stream-CQSA: Avoiding Out-of-Memory in Attention Computation via Flexible Workload Scheduling 🧠LLM

The Spectral Geometry of Thought: Phase Transitions, Instruction Reversal, Token-Level Dynamics, and Perfect Correctness Prediction in How Transformers Reason 💡AI Reasoning

MIRROR: A Hierarchical Benchmark for Metacognitive Calibration in Large Language Models 💬LLMs

Log in to enable infinite scrolling