🟢 NVIDIA - kudolink · Scour

NVIDIA Accelerates Google DeepMind’s DiffusionGemma for Local AI

🤗Open Source AI Blog

blogs.nvidia.com·

KJLdefeated/RL.cu: RLVR training for LLM in CUDA/C++

🧠LLMs Code

github.com··Hacker News

1-bit and 1.58 bit LLM Benchmarking on Jetson Orin Nano Super | Bonsai LM

smolhub.com··r/LocalLLaMA

DiffusionGemma: 4x Faster Text Generation

🤗Open Source AI News Blog

blog.google··Hacker News, r/LocalLLaMA, r/singularity

AutoMegaKernel: A Statically-Checked Agent Harness for Self-Retargeting Megakernel Synthesis

🤗Hugging Face Academic

arxiv.org··Hacker News

Luce Spark: a 35B MoE on a 16 GB GPU, without the offload tax

🏠Local LLMs Blog

lucebox.com··Hacker News

Timing Trick Cuts Energy Used in LLM Training by Up to 14 Percent

🔬ML Research News

spectrum.ieee.org

··Hacker News

Less-relevant results

Show HN: Monitoring Confidential Inference Providers

🔶Cloudflare Discussion

confidentialinference.net··Hacker News

🫧 AI Companies' Shared Destiny Recalls Dot-Com Bubble Memories

📈AI Industry Discussion

bullbear.ninja··Hacker News

Apple WWDC On-Device AI Deep Dive - Google Docs

gist.is··Hacker News

Upstart chipmakers keep challenging Nvidia. This time it's Microsoft-backed D-Matrix

💻Tech Industry News

cnbc.com··Hacker News

Running Qwen 35B MoE at 450k Context on a Single 32GB GPU

local-llm.utop.workers.dev··Hacker News

DiffusionGemma: The Developer Guide- Google Developers Blog

🧠LLMs Blog

developers.googleblog.com··r/LocalLLaMA

Google Will Pay SpaceX $920 Million Per Month For Compute - Slashdot

💻Tech Industry

hardware.slashdot.org·

Remove padding and multiple D2D copies for MTP by gaugarg-nv · Pull Request #24086 · ggml-org/llama.cpp

⚙️DevOps Code

github.com··r/LocalLLaMA

NVIDIA Confidential Computing to Help Expand Apple’s Private Cloud Compute

🎵Vibe Coding Blog

blogs.nvidia.com·

The Death of the App: Why Jensen Huang Just Blew Up the 40-Year-Old PC Bargain

🏗️Software Architecture Blog

·

Apple's New AI Models Contain 'None' of Google's Gemini Assistant

🤗Open Source AI News

macrumors.com··Hacker News

GGUF vs GPTQ vs AWQ: The Plain-English Guide to LLM Quantization (and Which One to Pick)

vettedconsumer.com··Hacker News

South Korean Online Communities Will Need to Scan Every Images with AI Censorship Tools

💻Tech Industry

discuss.privacyguides.net

··Hacker News

Log in to enable infinite scrolling