🗄️ Web Datasets - emschwartz

🤖AI Academic

arxiv.org··Hacker News

NVIDIA releases Nemotron 3 Ultra, claiming five times the speed and 30 percent lower costs than prior modelsThe model delivers 300 tokens per second on benchmar...

⚡Fast AI Inference

digg.com·

Tejas-TA/predikit: The missing bridge between your ML models and your AI agents.

🔧Agent Tooling Code

github.com··Hacker News

LangChain Series #2: Models Explained — LLMs, Chat Models, and Embeddings with Practical…

📊Embeddings

pub.towardsai.net

nex-agi/Nex-N2-mini • Huggingface

🏗️LLM Infrastructure

huggingface.co··r/LocalLLaMA

Google’s DiffusionGemma is 4x faster than its other Gemma models

🤖AI

thenewstack.io·

My life as a human pincushion continues (Day 17, post-surgery)

🎆Year End

creolened.com·

Stack Overflow didn't just help AI learn to code

🤖AI

zozo123.github.io··Hacker News

OpenEnv is now owned by HF, Torch, Prime Intellect, Unsloth, Modal, Mercor, and more! Use it for training agents.

🔗Interoperability Blog

huggingface.co··Hacker News, r/LocalLLaMA

Less-relevant results

Testing MiniMax M3 on real tasks: repo refactor, screenshot debugging, and Spotify recommendations

🆕New AI Blog

andlukyane.com··Hacker News

Enshittification Merch That Actually Fights Enshittification

🎨Graphic Design

eff.org·

How I stay connected (Bear Blog Carnival)

🧘Digital Minimalism Blog

hung.bearblog.dev·

Purpose-built local AI agents

🤖AI Blog

samihonkonen.com··Hacker News

Notice from SASAC and MIIT on jointly launching the 2026 Special Action Plan for Real-Scene Training of Humanoid Robots and Embodied Intelligence

🇨🇳China Tech Policy

threadreaderapp.com·

Job Searcher

🤖AI Blog

huggingface.co·

SafeRun: Enabling Determinism in LLM Planning for Running

🏆LLM Benchmarking Academic

arxiv.org·

Publishers push Common Crawl to stop collecting content for AI training

US publishers tell Common Crawl to stop scraping and delete archive

Pythia 1.4B reproduces 3.6% of training samples verbatim given 950-token prompts

Common Crawl Foundation at IIPC-WAC 2026

AutoMegaKernel: A Statically-Checked Agent Harness for Self-Retargeting Megakernel Synthesis

NVIDIA releases Nemotron 3 Ultra, claiming five times the speed and 30 percent lower costs than prior modelsThe model delivers 300 tokens per second on benchmar...

Tejas-TA/predikit: The missing bridge between your ML models and your AI agents.

LangChain Series #2: Models Explained — LLMs, Chat Models, and Embeddings with Practical…

nex-agi/Nex-N2-mini • Huggingface

Google’s DiffusionGemma is 4x faster than its other Gemma models

My life as a human pincushion continues (Day 17, post-surgery)

Stack Overflow didn't just help AI learn to code

OpenEnv is now owned by HF, Torch, Prime Intellect, Unsloth, Modal, Mercor, and more! Use it for training agents.

Testing MiniMax M3 on real tasks: repo refactor, screenshot debugging, and Spotify recommendations

Enshittification Merch That Actually Fights Enshittification

How I stay connected (Bear Blog Carnival)

Purpose-built local AI agents

Notice from SASAC and MIIT on jointly launching the 2026 Special Action Plan for Real-Scene Training of Humanoid Robots and Embodied Intelligence

Job Searcher

SafeRun: Enabling Determinism in LLM Planning for Running