Transformers

Feeds to Scour
SubscribedAll
Scoured 237 posts in 7.5 ms

SpikeDecoder: Realizing the GPT Architecture with Spiking Neural Networks

 🔬ML Research  Content type: Academic
arxiv.org·

Deeper Dive: Untangling Tasks in a Toy Transformer

 🤗HuggingFace  Content type: Blog
chirilcalin.medium.com·

ELI5 is a terrible learning prompt, here's the structural reason it fails and a 4-level replacement that actually sticks

 🧠LLMs  Content type: Blog  Content type: Tutorial

Why Transformer Models Get Costlier as Context Grows

 🧠LLMs
siliconopera.com·

New comment by okl1m3k in "Ask HN: Who wants to be hired? (June 2026)"

 ⚙️DevOps  Content type: Reference
docs.google.com··Hacker News

The Sequence Knowledge #874: Transformers or Not?

 🤖AI Coding
substackcdn.com··Substack

Roommate Therapy with 'Sesame Street's Bert and Ernie

 🤗HuggingFace  Content type: Video
mashable.com·

Geometric Foundations of AI Interpretability

 🧠LLMs
psychologyinaction.org·

The Transformer Architecture: A Step-by-Step Guide

 🧠LLMs  Content type: Blog
Less-relevant results

Exploration of a DNA Sequencing Basecaller using Activation Patching

 🧠LLMs
lesswrong.com·

Machine learning from scratch, what to build before using scikit-learn

 🔬ML Research  Content type: Tutorial
iwtlp.com··DEV

Dr. Ashish Bamania (@drashishbamania)

 🔬ML Research
substack.com··Substack

Transformer-based coreference resolution modeling for Amharic text

 🔍RAG  Content type: Academic
nature.com·

Visual Artist and Percussionist Bob Bert (Sonic Youth, Pussy Galore) Talks Experimenting With Sounds on Debut Solo Album ‘Beach Bongo Bloodbath’ (INTERVIEW)

 🎵Vibe Coding
glidemagazine.com·

OCOO-T : A SIMPLE AND SCALABLE VIRTUAL CELL MODEL FOR TRANSCRIPTIONAL PERTURBATION RESPONSE PREDICTION

 🎵Vibe Coding  Content type: Academic
biorxiv.org·

microsoft/LLMLingua: [EMNLP'23, ACL'24] To speed up LLMs' inference and enhance LLM's perceive of key information, compress the prompt and KV-Cache, which achieves up to 20x compression with minimal performance loss.

 🔗LangChain  Content type: Code
github.com··DEV

DiffusionGemma 26B A4B results on my 5090

 🤗Open Source AI

know the mother tongue of your LLMs

 🧠LLMs

How LLMs are Actually Trained

 🧠LLMs  Content type: News  Content type: Blog
blog.algomaster.io·

How Does Attention Work in LLMs? 2026 Deep Dive

 🧠LLMs  Content type: Blog
medium.com
·

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help