Transformer Architecture

Feeds to Scour
SubscribedAll
Scoured 163 posts in 8.4 ms

markusheimerl/gpt: A generative pretrained transformer implementation

 🔗RAG  Content type: Code
github.com··Hacker News

know the mother tongue of your LLMs

 🤖Local LLMs

PENet+: A Lightweight Residual Transformer Framework for Efficient Image Steganalysis

 🔗RAG  Content type: Academic
arxiv.org·

How Confident Are AI Classifiers About Their Own Confidence?

 💬Natural Language Processing  Content type: Blog

How LLMs Actually Work: A Friendly Map for Humans • oreoro

 💬Natural Language Processing

Visual Artist and Percussionist Bob Bert (Sonic Youth, Pussy Galore) Talks Experimenting With Sounds on Debut Solo Album ‘Beach Bongo Bloodbath’ (INTERVIEW)

 🎺Jazz
glidemagazine.com·

The Memory Problem is Solved: How Google’s Memory Caching Makes RNNs Smart Again

 🤖Machine Learning  Content type: Blog
medium.com·

AIs like ChatGPT fall apart in classic 'Stroop' psychological test — and that could stand in the way of achieving artificial general intelligence

 💬Natural Language Processing
techradar.com
·

Attention Based Interpretability With Concept Transformer

 🔍Vector Search  Content type: Blog
medium.com
·

Why LLMs hallucinate?

 🧠LLM Reasoning  Content type: Blog
medium.com
·

The Transformer, Demystified — Let's Actually Build One

 📝TextRank  Content type: News
mlwhiz.com
·

Post-training is (Massive) Supervised Learning

 🤖Machine Learning  Content type: Academic
arxiv.org·

Issue #390 - The ML Engineer 🤖

 💬Natural Language Processing  Content type: News  Content type: Blog

Guardian Angels: LLM Personalization for Productivity and Security

 🎭Anthropic Claude
gwern.net··Hacker News

MLPerf and the rise of latency-aware LLM benchmarking

 💬Natural Language Processing
edn.com·

What an LLM Actually Does With Your Prompt First

 💬Natural Language Processing
siliconopera.com·

Towards Tight Bounds for Streaming Attention

 🧮Algorithms  Content type: Academic
arxiv.org·

My research agenda and work

 🧩Cognitive Science
lesswrong.com·

Analyzing the geometric dependence of thermoelastic Q -factor in micro hemispherical resonators via a data-augmented CNN-transformer model

 🧠Deep Learning  Content type: Academic
nature.com·

Uncertainty-Aware LLM-Guided Policy Shaping for Sparse-Reward Reinforcement Learning

 🧠LLM Reasoning  Content type: Academic
arxiv.org·

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help