Transformers

Feeds to Scour
SubscribedAll
Scoured 185 posts in 4.7 ms

markusheimerl/gpt: A generative pretrained transformer implementation

 📝Natural Language Processing  Content type: Code
github.com··Hacker News

know the mother tongue of your LLMs

 🤖LLM

SPADE: Split-and-Delay Embeddings for Autoregressive High-Granularity Calorimeter Simulation

 📝Natural Language Processing  Content type: Academic
arxiv.org·

Your LLM Isn’t Reading Your Manners — It’s Counting Your Tokens

 🤖LLM  Content type: Blog
medium.com
·

The Sequence Knowledge #874: Transformers or Not?

 ⛓️LangChain
substackcdn.com··Substack

Transformer-based coreference resolution modeling for Amharic text

 📝Natural Language Processing  Content type: Academic
nature.com·

How we fight GPU scarcity without compromise

 📝Natural Language Processing  Content type: Blog
equixly.com··Hacker News

OCOO-T : A SIMPLE AND SCALABLE VIRTUAL CELL MODEL FOR TRANSCRIPTIONAL PERTURBATION RESPONSE PREDICTION

 🗄️Vector Databases  Content type: Academic
biorxiv.org·

ELI5 is a terrible learning prompt, here's the structural reason it fails and a 4-level replacement that actually sticks

 🤖AI  Content type: Blog  Content type: Tutorial

The Transformer, Demystified — Let's Actually Build One

 🤖AI  Content type: News
mlwhiz.com
·

Two old GPUs I salvaged are doing more AI work than a brand new $2000 card, and I won't be upgrading anytime soon

 📝Natural Language Processing
xda-developers.com·

Markov Chains: The Grandparents of LLMs

 📝Natural Language Processing
dmanco.dev··Hacker News

Visual Artist and Percussionist Bob Bert (Sonic Youth, Pussy Galore) Talks Experimenting With Sounds on Debut Solo Album ‘Beach Bongo Bloodbath’ (INTERVIEW)

 🚀MLOps
glidemagazine.com·
Less-relevant results

DiffusionGemma: Discrete diffusion in a large language model

 🤖LLM

Guardian Angels: LLM Personalization for Productivity and Security

 ⛓️LangChain
gwern.net··Hacker News

Machine learning from scratch, what to build before using scikit-learn

 🧠Machine Learning  Content type: Tutorial
iwtlp.com··DEV

Mi50 32GB / GFX906 - vLLM Qwen 3.5 Configuration for Qwen 3.5:9B AWQ-4bit

 🤖AI

Context compression finally works in production: new research cuts LLM input 16x without the accuracy hit

 ⚙️Model Fine-tuning
venturebeat.com·

The Memory Problem is Solved: How Google’s Memory Caching Makes RNNs Smart Again

 📈Model Evaluation  Content type: Blog
medium.com·

Google open-sources speedy DiffusionGemma text diffusion model

 🤖AI
siliconangle.com·

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help