🖥 GPUs - emschwartz · Scour

Ollama Doesn't Know Its GPU Is on Another Machine 🤖AI

loopholelabs.io·1d·Hacker News

GPU Memory Math for LLMs: Formula That Tells You What Fits on Your GPU 🏗️LLM Infrastructure

theahmadosman.substack.com·18h·Substack, r/LocalLLaMA

Luce Megakernal: Why nobody is taking about this? 🏗️LLM Infrastructure

github.com·5d·r/LocalLLaMA

AMD says its $4K Ryzen AI Halo workstation practically pays for itself ⚡Hardware Acceleration

theregister.com·15h

12x faster Elasticsearch vector indexing: deploying NVIDIA cuVS with GPU and CPU tiers 🏗️LLM Infrastructure

Get an entire RTX 5090 gaming PC for around the price of just the GPU — a high-end battle station for under $4,000 ⚡Hardware Acceleration

tomshardware.com

·5d

PySIFT: GPU-Resident Deterministic SIFT for Deep Learning Vision Pipelines 📦Batch Embeddings

Deep Moats and Platform Shifts in Computing ⚡Hardware Acceleration

semiconductor.substack.com·3d·Substack

Artain-AI/ignite-ms: Fast self-hosted embedding engine for search, RAG, and reindexing workloads on NVIDIA GPUs. Built in Rust + TensorRT for teams that care about scale, cost, and control. 🕯️Candle ML

github.com·22h·Hacker News

Running DramaBox on Strix Halo ⚡Hardware Acceleration

sleepingrobots.com·6d

CUDA Books ⚡Hardware Acceleration

news.ycombinator.com·3d·Hacker News

ai-gpu-energy-optimizer-/WHITEPAPER.md at main · mikebains41-debug/ai-gpu-energy-optimizer- 📊Model Serving Economics

github.com·1d·Hacker News

Grok Build 👨‍💻 , Codex customizations 🤖, xAI exodus 👋 🤖AI

Jensen Huang slams 'stupid' analogy comparing GPUs to nuclear weapons — Nvidia CEO says government should allow selling GPUs to 'adversarial countries' 🇨🇳China Tech Policy

tomshardware.com

·4d

Uncle Sam's next big supercomputer might use something more exotic than GPUs ⚡Hardware Acceleration

theregister.com·2d·r/hardware

vyasgiridhar/moleqular: Molecular dynamics on Apple M4 — NEON intrinsics, SME2, Metal compute shaders, OpenMP. Pushing Apple Silicon to its limits. ⚙️Mechanical Sympathy

github.com·4d·Hacker News

(VBS-NN) ML – 512k context length pre-training on a 12GB GPU 🏗️LLM Infrastructure

github.com·4d·Hacker News

How do I get the superfast DFlash / MTP tokens per second that I'm seeing on here? Dual 3090s ⚡Zero-Copy APIs

github.com·4d·r/LocalLLaMA

AcculuxSystems/openroad-gpu-sidecar: CUDA sidecar for OpenROAD-style placement, global routing, and detailed routing ⚡Hardware Acceleration

github.com·4d·Hacker News

AMD says its $4K Ryzen AI Halo workstation practically pays for itself ⚡Hardware Acceleration

theregister.com·1d

No more posts from emschwartz's subscribed feeds.

Scour all 24650 feeds Learn more about Feeds

Log in to enable infinite scrolling