🤖 Transformers - jyunzhang · Scour

AIs like ChatGPT fall apart in classic 'Stroop' psychological test — and that could stand in the way of achieving artificial general intelligence

·

Less-relevant results

defai-digital/ax-engine: Apple Silicon LLM runtime supporting Gemma 4 and Qwen 3.6 MTP modes

🤖LLMs Code

github.com··Hacker News

Guardian Angels: LLM Personalization for Productivity and Security

🔧Developer Tools

gwern.net··Hacker News

The Edge LLM Offload Story

semiengineering.com·

Towards Tight Bounds for Streaming Attention

🧠Deep Learning Academic

Hugging Face Transformers RCE flaw enables stealthy compromise via AI model configs

csoonline.com·

What an LLM Actually Does With Your Prompt First

siliconopera.com·

Introducing Granite Libraries and Project Granite Switch

🤖LLMs Blog

research.ibm.com··Hacker News

NVIDIA releases Nemotron 3 Ultra, claiming five times the speed and 30 percent lower costs than prior modelsThe model delivers 300 tokens per second on benchmar...

Contribution Weights: A Geometrical Analysis of Self-Attention Transformers

🤖LLMs Academic

You’ve Been Using AI for Years. You Just Didn’t Call It That.

🤖LLMs Blog

Issue #390 - The ML Engineer 🤖

🤖Machine Learning News Blog

machinelearning.substack.com··Substack

RightNow-AI/AutoMegaKernel: An agent harness that compiles a model into one provably-correct, self-retargeting CUDA megakernel and self-tunes it past cuBLAS at batch-1 LLM decode.

🤖LLMs Code

github.com··Hacker News

DeepSeek V4, LeCun's Bet Against LLMs, and Lovable's Self-Improving Agent - The Tokenizer Edition #30

🤖Machine Learning

newsletter.artofsaience.com·

Building Semantic Search with Transformers.js and Sentence Embeddings

machinelearningmastery.com·

Chiaroscuro Attention: Spending Compute in the Dark

📈Optimization Academic

What Does Abliteration Actually Cost?

lesswrong.com·

My research agenda and work

lesswrong.com·

nex-agi/Nex-N2-mini • Huggingface

huggingface.co··r/LocalLLaMA

BioMedGraphica: An All-in-One Platform for Joint Textual Biomedical Prior Knowledge and Numeric Graph Generation

🗂️Data Structures

academic.oup.com

·

Log in to enable infinite scrolling