🧠 Transformers - saeedesmaili · Scour

markusheimerl/gpt: A generative pretrained transformer implementation

🧠LLMs Code

github.com··Hacker News

Reachability and asymptotics of Gaussian Transformer dynamics

💬Natural Language Processing Academic

know the mother tongue of your LLMs

mothertoken.inigoimaz.com··Hacker News

Less-relevant results

Apple WWDC On-Device AI Deep Dive - Google Docs

gist.is··Hacker News

How LLMs Actually Work: A Friendly Map for Humans • oreoro

💬Natural Language Processing

oreoro.github.io··Hacker News

GPT-2: Too Dangerous To Release (2019)

🎯Fine-tuning Blog

naokishibuya.github.io··Hacker News

How LLMs work | Practical Leaders

💬Natural Language Processing

practical-leaders.com··Hacker News

SpikeDecoder: Realizing the GPT Architecture with Spiking Neural Networks

🤖Machine Learning Academic

Introducing North Mini Code: Cohere’s First Model For Developers

🎯Fine-tuning Blog

huggingface.co··Hacker News

How we fight GPU scarcity without compromise

🧠LLM Inference Blog

equixly.com··Hacker News

SPADE: Split-and-Delay Embeddings for Autoregressive High-Granularity Calorimeter Simulation

💬Natural Language Processing Academic

mingusb/transformer-golf: The Fully Unrolled Transformer: An experimental repository for architecture simplification and compilation. [2026]

🧠Neural Networks Code

github.com··Hacker News

Tokenminning: Because Tokenmaxxing Is a Bad Idea

🪟Context Windows

tokenminning.com··Hacker News

Markov Chains: The Grandparents of LLMs

💬Natural Language Processing

dmanco.dev··Hacker News

Kuramoto Attention: Synchronizing Self-Attention on the Torus

🔢Embeddings Academic

Introducing the Third Generation of Apple’s Foundation Models

machinelearning.apple.com··Hacker News, r/apple

RePAIR: Predictive Self-Supervised Representation Learning in Chess

🎮Reinforcement Learning Academic

princezuda/-RequiemGPT-: Fully open source and open weights built and trained by fable five with one prompt. An experience in how AI actually works

🧠Neural Networks Code

github.com··Hacker News

CRUMB: Efficient Prior Fitted Network Inference via Distributionally Matched Context Batching

💬Natural Language Processing Academic

Best-Known Sorting Networks

🕸️Graph Theory

bertdobbelaere.github.io··Hacker News

Log in to enable infinite scrolling