🤖 Transformers - liqihui02 · Scour

Researchers say they trained a foundation model from scratch for about $1,500

🤖reinforcement learning, deep learning, machine learning

venturebeat.com··Hacker News

Automated doubt 🤔, open code review 📝, how LLMs really work 🔨

✍️Prompt Engineering

Apple WWDC On-Device AI Deep Dive - Google Docs

🤖reinforcement learning, deep learning, machine learning

gist.is··Hacker News

Weekly Bookmarks

The Next Industrial Revolution

🔤NLP Blog

hexmhell.writeas.com·

Machine learning from scratch, what to build before using scikit-learn

🤖reinforcement learning, deep learning, machine learning Tutorial

iwtlp.com··DEV

Introducing North Mini Code: Cohere’s First Model For Developers

✍️Prompt Engineering Blog

huggingface.co··Hacker News

Boltzmann Attention: Learnable Ising Couplings for Cooperative Attention

📊Embeddings Academic

A deep learning framework for emotion recognition in music using multimodal data fusion

🤖reinforcement learning, deep learning, machine learning Academic

markusheimerl/gpt: A generative pretrained transformer implementation

🔤NLP Code

github.com··Hacker News

VelocityFM: Short-Horizon Protein Trajectory Prediction via Flow Matching in Velocity Space

🎮Q-Learning Academic

Mixture-of-Experts (MoE), Explained: Why “Active Parameters” Decide What Runs on Your Machine

🤖reinforcement learning, deep learning, machine learning

vettedconsumer.com··Hacker News

The Sequence Knowledge #874: Transformers or Not?

🤖reinforcement learning, deep learning, machine learning

substackcdn.com··Substack

Human-Like Neural Nets by Catapulting

🤖reinforcement learning, deep learning, machine learning

gwern.net··Hacker News

Tokenminning: Because Tokenmaxxing Is a Bad Idea

✍️Prompt Engineering

tokenminning.com··Hacker News

What the ocean taught me about AI.

🔤NLP Blog

Operator Fusion for LLM Inference on the Tensix Architecture

🔤NLP Academic

NVIDIA at Computex 2026: RTX Spark Gaming Hands-On, DLSS 4.5, and More

techpowerup.com·

The Memory Problem is Solved: How Google’s Memory Caching Makes RNNs Smart Again

🤖reinforcement learning, deep learning, machine learning Blog

Architecture-Aware Reinforcement Learning Makes Sliding-Window Attention Competitive in Math Reasoning

✍️Prompt Engineering Academic

Log in to enable infinite scrolling