Transformers

Feeds to Scour
SubscribedAll
Scoured 335 posts in 7.6 ms

markusheimerl/gpt: A generative pretrained transformer implementation

 👁️Attention Mechanisms  Content type: Code
github.com··Hacker News

ELI5 is a terrible learning prompt, here's the structural reason it fails and a 4-level replacement that actually sticks

 👁️Attention Mechanisms  Content type: Blog  Content type: Tutorial

SpikeDecoder: Realizing the GPT Architecture with Spiking Neural Networks

 🤖Machine Learning  Content type: Academic
arxiv.org·

The Transformer Architecture: A Step-by-Step Guide

 👁️Attention Mechanisms  Content type: Blog
m7mdelyoussef.medium.com·

LeLab Is Hugging Face’s New Browser-Based GUI for the LeRobot Ecosystem

 🤖Machine Learning  Content type: News
hackster.io·

Analyzing the geometric dependence of thermoelastic Q -factor in micro hemispherical resonators via a data-augmented CNN-transformer model

 🤖Machine Learning  Content type: Academic
nature.com·

Your LLM Isn’t Reading Your Manners — It’s Counting Your Tokens

 👁️Attention Mechanisms  Content type: Blog
medium.com
·

The Sequence Knowledge #874: Transformers or Not?

 🤖Machine Learning
substackcdn.com··Substack

Two old GPUs I salvaged are doing more AI work than a brand new $2000 card, and I won't be upgrading anytime soon

 🤖AI
xda-developers.com·

AIs like ChatGPT fall apart in classic 'Stroop' psychological test — and that could stand in the way of achieving artificial general intelligence

 👁️Attention Mechanisms
techradar.com
·

Dr. Ashish Bamania (@drashishbamania)

 📐Linear Algebra
substack.com··Substack

know the mother tongue of your LLMs

 🤖AI

Google open-sources speedy DiffusionGemma text diffusion model

 🤖Machine Learning
siliconangle.com·

Attention Based Interpretability With Concept Transformer

 👁️Attention Mechanisms  Content type: Blog
medium.com
·

Machine learning from scratch, what to build before using scikit-learn

 🤖Machine Learning  Content type: Tutorial
iwtlp.com··DEV
Less-relevant results

Google's new open-weights model brings image-generation tricks to AI text generation

 🤖AI  Content type: News
theregister.com·

How LLMs work | Practical Leaders

 🤖Machine Learning

Introducing North Mini Code: Cohere’s First Model For Developers

 🤖Machine Learning  Content type: Blog

Generative AI in the Real World: Agentic Systems Fundamentals with Maarten Grootendorst

 🤖Machine Learning  Content type: Audio
oreilly.com·

Markov Chains: The Grandparents of LLMs

 👁️Attention Mechanisms
dmanco.dev··Hacker News

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help