Transformers

Feeds to Scour
SubscribedAll
Scoured 206 posts in 10.1 ms

markusheimerl/gpt: A generative pretrained transformer implementation

 💬LLMs  Content type: Code
github.com··Hacker News

ELI5 is a terrible learning prompt, here's the structural reason it fails and a 4-level replacement that actually sticks

 🧠AI Research  Content type: Blog  Content type: Tutorial

SpikeDecoder: Realizing the GPT Architecture with Spiking Neural Networks

 🧠AI Research  Content type: Academic
arxiv.org·

The Transformer Architecture: A Step-by-Step Guide

 🧠AI Research  Content type: Blog
m7mdelyoussef.medium.com·

The Sequence Knowledge #874: Transformers or Not?

 🔍Interpretability
substackcdn.com··Substack

Attention Based Interpretability With Concept Transformer

 🧠AI Research  Content type: Blog
medium.com
·

Dr. Ashish Bamania (@drashishbamania)

 🧠AI Research
substack.com··Substack

Machine learning from scratch, what to build before using scikit-learn

 📉Deep Learning  Content type: Tutorial
iwtlp.com··DEV

Your LLM Isn’t Reading Your Manners — It’s Counting Your Tokens

 💬LLMs  Content type: Blog
medium.com
·

Why LLMs hallucinate?

 💬LLMs  Content type: Blog
medium.com
·

Visual Artist and Percussionist Bob Bert (Sonic Youth, Pussy Galore) Talks Experimenting With Sounds on Debut Solo Album ‘Beach Bongo Bloodbath’ (INTERVIEW)

 🤖AI Agents
glidemagazine.com·

know the mother tongue of your LLMs

 💬LLMs

The Transformer, Demystified — Let's Actually Build One

 📉Deep Learning  Content type: News
mlwhiz.com
·
Less-relevant results

Don't let the LLM speak, just probe it (8 minute read)

 💬LLMs  Content type: Blog
blog.j11y.io·

The 400-Millisecond History: How Seventy Years of AI Built Modern Advertising

 🧠AI Research
iabtechlab.com·

Analyzing the geometric dependence of thermoelastic Q -factor in micro hemispherical resonators via a data-augmented CNN-transformer model

 🧠AI Research  Content type: Academic
nature.com·

I Built a Collection of 100+ Free Developer Tools That Run Entirely in the Browser

 📄arXiv
solutiontoolkit.com··DEV

Pathetic pretense

 📄arXiv  Content type: Blog
freethoughtblogs.com·

AIs like ChatGPT fall apart in classic 'Stroop' psychological test — and that could stand in the way of achieving artificial general intelligence

 💬LLMs
techradar.com
·

PT-WNO: Point Transformer with Wavelet Neural Operator for 3D Point Cloud Semantic Segmentation

 📐Scaling Laws  Content type: Academic
arxiv.org·

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help