Transformers

Feeds to Scour
SubscribedAll
Scoured 199 posts in 7.8 ms

AIs like ChatGPT fall apart in classic 'Stroop' psychological test — and that could stand in the way of achieving artificial general intelligence

 🤖LLMs
techradar.com
·
Less-relevant results

defai-digital/ax-engine: Apple Silicon LLM runtime supporting Gemma 4 and Qwen 3.6 MTP modes

 🤖LLMs  Content type: Code
github.com··Hacker News

Guardian Angels: LLM Personalization for Productivity and Security

 🔧Developer Tools
gwern.net··Hacker News

The Edge LLM Offload Story

 🤖AI
semiengineering.com·

Towards Tight Bounds for Streaming Attention

 🧠Deep Learning  Content type: Academic
arxiv.org·

Hugging Face Transformers RCE flaw enables stealthy compromise via AI model configs

 🔄DevOps
csoonline.com·

What an LLM Actually Does With Your Prompt First

 🤖LLMs
siliconopera.com·

Introducing Granite Libraries and Project Granite Switch

 🤖LLMs  Content type: Blog

NVIDIA releases Nemotron 3 Ultra, claiming five times the speed and 30 percent lower costs than prior modelsThe model delivers 300 tokens per second on benchmar...

 📝NLP
digg.com·

Contribution Weights: A Geometrical Analysis of Self-Attention Transformers

 🤖LLMs  Content type: Academic
arxiv.org·

You’ve Been Using AI for Years. You Just Didn’t Call It That.

 🤖LLMs  Content type: Blog
medium.com·

Issue #390 - The ML Engineer 🤖

 🤖Machine Learning  Content type: News  Content type: Blog

RightNow-AI/AutoMegaKernel: An agent harness that compiles a model into one provably-correct, self-retargeting CUDA megakernel and self-tunes it past cuBLAS at batch-1 LLM decode.

 🤖LLMs  Content type: Code
github.com··Hacker News

DeepSeek V4, LeCun's Bet Against LLMs, and Lovable's Self-Improving Agent - The Tokenizer Edition #30

 🤖Machine Learning

Building Semantic Search with Transformers.js and Sentence Embeddings

 🤖AI

Chiaroscuro Attention: Spending Compute in the Dark

 📈Optimization  Content type: Academic
arxiv.org·

What Does Abliteration Actually Cost?

 📝NLP
lesswrong.com·

My research agenda and work

 🤖LLMs
lesswrong.com·

nex-agi/Nex-N2-mini • Huggingface

 📝NLP

BioMedGraphica: An All-in-One Platform for Joint Textual Biomedical Prior Knowledge and Numeric Graph Generation

 🗂️Data Structures
academic.oup.com
·

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help