Tensor Cores

Feeds to Scour
SubscribedAll
Scoured 129 posts in 7.5 ms

CoreML vs TFLite: iPhone 15 Pro GPU 2.3x Faster

 📱Edge AI  Content type: Blog  Content type: Discussion
tildalice.io·

OpenCV 5 release - New DNN engine with enhanced ONNX and LLM/VLM support, Intel, Arm, and RISC-V hardware optimizations - CNX Software

 🖼图像处理  Content type: News
cnx-software.com·

From GPU to Token: The 8-Layer Observability Stack for AI Infrastructure

 🔲AI,GPU IC, SOC IC  Content type: Blog
jimmysong.io·

GGUF vs GPTQ vs AWQ: The Plain-English Guide to LLM Quantization (and Which One to Pick)

 🔲System on chip

Minimax M3 sm_120

 🔓RISC-V  Content type: Code
github.com··r/LocalLLaMA

SPEAR: A System for Post-Quantization Error-Adaptive Recovery Enabling Efficient Low-Bit LLM Serving

 🔓RISC-V  Content type: Academic
arxiv.org·

MoQ GGUFs and GSQ: Low-Bit GGUFs Are About to Get Much Better

 🔲AI,GPU IC, SOC IC  Content type: News  Content type: Blog

Two old GPUs I salvaged are doing more AI work than a brand new $2000 card, and I won't be upgrading anytime soon

 🔲AI,GPU IC, SOC IC
xda-developers.com·

SpenseGPT: Practical One-shot Pruning Enabling Sparse and Dense GEMMs for LLM Inference

 📱Edge AI  Content type: Academic
arxiv.org·

DiffusionGemma 26B A4B results on my 5090

 🔲AI,GPU IC, SOC IC

What's in the Box? A Field Guide to AI Models

 🔲AI,GPU IC, SOC IC  Content type: Blog
iankduncan.com·

AMD Radeon RX 9070 GRE vs. Nvidia GeForce RTX 5070

 🔲AI,GPU IC, SOC IC
club386.com·

harshuljain13/llm-inference-at-scale: A Practitioner handbook for production llm serving.

 🔲System on chip  Content type: Code
github.com··Hacker News, r/LLM

Unreleased RTX 3050 Ti engineering sample appears in photos and benchmarks — the RTX 3060 alternative that never happened

 🔲AI,GPU IC, SOC IC  Content type: News
tomshardware.com
·

Toward a Small ML Runtime Stack for Raspberry Pi 5 QPUs

 🔲AI,GPU IC, SOC IC  Content type: Academic
arxiv.org·

Build a local voice agent with Red Hat OpenShift AI

 📱Edge AI
developers.redhat.com·

Google DeepMind releases Gemma 4 QAT, but Unsloth developer Daniel Han warns naive llama.cpp conversions suffer accuracy loss

 🧠NPU  Content type: News
digg.com·

1-bit and 1.58 bit LLM Benchmarking on Jetson Orin Nano Super | Bonsai LM

 🔲System on chip
smolhub.com··r/LocalLLaMA

Edge AI deployment made easy for system integrators

 📱Edge AI
edn.com·

Apple rebuilt its on-device AI stack at WWDC 2026

 🔲System on chip  Content type: Blog
ziraph.com··Hacker News
Sign up or log in to see more results

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help