Transformers

Feeds to Scour
SubscribedAll
Scoured 333 posts in 8.2 ms

markusheimerl/gpt: A generative pretrained transformer implementation

 👁️Attention Mechanisms  Content type: Code
github.com··Hacker News

ELI5 is a terrible learning prompt, here's the structural reason it fails and a 4-level replacement that actually sticks

 👁️Attention Mechanisms  Content type: Blog  Content type: Tutorial

SpikeDecoder: Realizing the GPT Architecture with Spiking Neural Networks

 🤖Machine Learning  Content type: Academic
arxiv.org·

The Transformer Architecture: A Step-by-Step Guide

 👁️Attention Mechanisms  Content type: Blog
m7mdelyoussef.medium.com·

LeLab Is Hugging Face’s New Browser-Based GUI for the LeRobot Ecosystem

 🤖Machine Learning  Content type: News
hackster.io·

Analyzing the geometric dependence of thermoelastic Q -factor in micro hemispherical resonators via a data-augmented CNN-transformer model

 🤖Machine Learning  Content type: Academic
nature.com·

Dr. Ashish Bamania (@drashishbamania)

 📐Linear Algebra
substack.com··Substack

Your LLM Isn’t Reading Your Manners — It’s Counting Your Tokens

 👁️Attention Mechanisms  Content type: Blog
medium.com
·

The Sequence Knowledge #874: Transformers or Not?

 🤖Machine Learning
substackcdn.com··Substack

AIs like ChatGPT fall apart in classic 'Stroop' psychological test — and that could stand in the way of achieving artificial general intelligence

 👁️Attention Mechanisms
techradar.com
·

Google open-sources speedy DiffusionGemma text diffusion model

 🤖Machine Learning
siliconangle.com·

know the mother tongue of your LLMs

 🤖AI

Attention Based Interpretability With Concept Transformer

 👁️Attention Mechanisms  Content type: Blog
medium.com
·

Machine learning from scratch, what to build before using scikit-learn

 🤖Machine Learning  Content type: Tutorial
iwtlp.com··DEV
Less-relevant results

Don't let the LLM speak, just probe it (8 minute read)

 🤖AI  Content type: Blog
blog.j11y.io·

How LLMs work | Practical Leaders

 🤖Machine Learning

Introducing North Mini Code: Cohere’s First Model For Developers

 🤖Machine Learning  Content type: Blog

Deep Learning Meets GPR: Exploring Transformer Models for Precision Soil Depth Prediction

 🤖Machine Learning  Content type: Academic
sciencedirect.com·

Markov Chains: The Grandparents of LLMs

 👁️Attention Mechanisms
dmanco.dev··Hacker News

Malicious Hugging Face Models Could Trigger Remote Code Execution

 🤖Machine Learning
techrepublic.com·

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help