Transformer Architecture

Feeds to Scour
SubscribedAll
Scoured 132 posts in 5.4 ms

markusheimerl/gpt: A generative pretrained transformer implementation

 🧠Neural Network Architectures  Content type: Code
github.com··Hacker News

Reachability and asymptotics of Gaussian Transformer dynamics

 🧠Deep Learning  Content type: Academic
arxiv.org·

Machine learning from scratch, what to build before using scikit-learn

 🧠Neural Network Architectures  Content type: Tutorial
iwtlp.com··DEV

Your LLM Isn’t Reading Your Manners — It’s Counting Your Tokens

 👁️Attention Mechanisms  Content type: Blog
medium.com
·

ELI5 is a terrible learning prompt, here's the structural reason it fails and a 4-level replacement that actually sticks

 👁️Attention Mechanisms  Content type: Blog  Content type: Tutorial

Analyzing the geometric dependence of thermoelastic Q -factor in micro hemispherical resonators via a data-augmented CNN-transformer model

 🧠Deep Learning  Content type: Academic
nature.com·

The Sequence Knowledge #874: Transformers or Not?

 🧠Deep Learning
Less-relevant results

Multimodal Browser AI with Transformers.js for Images and Speech

 🧠Deep Learning

know the mother tongue of your LLMs

 🔮ML

How LLMs Actually Work: A Friendly Map for Humans • oreoro

 👁️Attention Mechanisms

Visual Artist and Percussionist Bob Bert (Sonic Youth, Pussy Galore) Talks Experimenting With Sounds on Debut Solo Album ‘Beach Bongo Bloodbath’ (INTERVIEW)

 🚀Model Deployment
glidemagazine.com·

AIs like ChatGPT fall apart in classic 'Stroop' psychological test — and that could stand in the way of achieving artificial general intelligence

 🧠Neural Network Architectures
techradar.com
·

The Memory Problem is Solved: How Google’s Memory Caching Makes RNNs Smart Again

 🧠Neural Network Architectures  Content type: Blog
medium.com·

Adventurer becomes first British woman to cross Atlantic by hydrogen balloon

 🔮ML  Content type: News
the-independent.com·

Pathetic pretense

 🎲Synthetic Data Generation  Content type: Blog

Researchers say they trained a foundation model from scratch for about $1,500

 📈Time Series Forecasting
venturebeat.com·

The Edge LLM Offload Story

 🧠Neural Network Architectures
semiengineering.com·

What shapes your power bill? Explainable AI outlines forecasts behind grid and price decisions

 📈Time Series Forecasting
techxplore.com·

Context windows in AI: why every token is a budget decision

 🧠Deep Learning  Content type: Blog
redis.io·

We Taught a Model to Speak Legalese. Here’s What Changed.

 🔮ML  Content type: Blog
medium.com·

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help