🧮 Tensor Cores - dane8036 · Scour

CoreML vs TFLite: iPhone 15 Pro GPU 2.3x Faster

📱Edge AI Blog Discussion

OpenCV 5 release - New DNN engine with enhanced ONNX and LLM/VLM support, Intel, Arm, and RISC-V hardware optimizations - CNX Software

🖼图像处理 News

cnx-software.com·

From GPU to Token: The 8-Layer Observability Stack for AI Infrastructure

🔲AI,GPU IC, SOC IC Blog

GGUF vs GPTQ vs AWQ: The Plain-English Guide to LLM Quantization (and Which One to Pick)

🔲System on chip

vettedconsumer.com··Hacker News

Minimax M3 sm_120

🔓RISC-V Code

github.com··r/LocalLLaMA

SPEAR: A System for Post-Quantization Error-Adaptive Recovery Enabling Efficient Low-Bit LLM Serving

🔓RISC-V Academic

MoQ GGUFs and GSQ: Low-Bit GGUFs Are About to Get Much Better

🔲AI,GPU IC, SOC IC News Blog

kaitchup.substack.com··r/LocalLLaMA

Two old GPUs I salvaged are doing more AI work than a brand new $2000 card, and I won't be upgrading anytime soon

🔲AI,GPU IC, SOC IC

xda-developers.com·

SpenseGPT: Practical One-shot Pruning Enabling Sparse and Dense GEMMs for LLM Inference

📱Edge AI Academic

DiffusionGemma 26B A4B results on my 5090

🔲AI,GPU IC, SOC IC

huggingface.co··r/LocalLLaMA

What's in the Box? A Field Guide to AI Models

🔲AI,GPU IC, SOC IC Blog

iankduncan.com·

AMD Radeon RX 9070 GRE vs. Nvidia GeForce RTX 5070

🔲AI,GPU IC, SOC IC

harshuljain13/llm-inference-at-scale: A Practitioner handbook for production llm serving.

🔲System on chip Code

github.com··Hacker News, r/LLM

Unreleased RTX 3050 Ti engineering sample appears in photos and benchmarks — the RTX 3060 alternative that never happened

🔲AI,GPU IC, SOC IC News

tomshardware.com

·

Toward a Small ML Runtime Stack for Raspberry Pi 5 QPUs

🔲AI,GPU IC, SOC IC Academic

Build a local voice agent with Red Hat OpenShift AI

developers.redhat.com·

Google DeepMind releases Gemma 4 QAT, but Unsloth developer Daniel Han warns naive llama.cpp conversions suffer accuracy loss

🧠NPU News

1-bit and 1.58 bit LLM Benchmarking on Jetson Orin Nano Super | Bonsai LM

🔲System on chip

smolhub.com··r/LocalLLaMA

Edge AI deployment made easy for system integrators

Apple rebuilt its on-device AI stack at WWDC 2026

🔲System on chip Blog

ziraph.com··Hacker News

Sign up or log in to see more results

Log in to enable infinite scrolling