🎯 Tensor Cores - miterion · Scour

Tensor learning with orthogonal, Lorentz, and symplectic symmetries

arxiv.org·14h

🏎️TensorRT

Zvec: SQLite-like simplicity in an embedded vector database (By Alibaba)

zvec.org·6h·

Discuss: Hacker News

From Lightweight CNNs to SpikeNets: Benchmarking Accuracy-Energy Tradeoffs with Pruned Spiking SqueezeNet

arxiv.org·1d

📊Gradient Accumulation

Samsung: "Expanding CPU Capabilities for On-device AI with Arm SME2 [as implemented in the Exynos 2600]"

semiconductor.samsung.com·1d·

Discuss: r/hardware

⚡Flash Attention

Carnegie Mellon at NeurIPS 2025

blog.ml.cmu.edu·1d

📊Gradient Accumulation

LAI #114: The Real Work of Production AI

pub.towardsai.net·4h

🤖AI Coding Tools

Google Tensor G3: Difference between revisions

wiki.postmarketos.org·3d

🏎️TensorRT

New Ovis2.6-30B-A3B, a lil better than Qwen3-VL-30B-A3B

huggingface.co·7h·

Discuss: r/LocalLLaMA

Generalized Lanczos method for systematic optimization of neural-network quantum states

link.aps.org·9h

📉Model Quantization

Show HN: Solving Sudoku reasoning via Energy Geometric models

davisgeometric.com·10h·

Discuss: Hacker News

AI in Multiple GPUs: Understanding the Host and Device Paradigm

towardsdatascience.com·6h

⏱️CUDA Events

How We Built the Fastest Kimi K2.5 on Artificial Analysis

baseten.co·1d·

Discuss: Hacker News

⚡ONNX Runtime

Edge AI in a DRAM shortage: Doing more with less

edn.com·9h

⚡Flash Attention

What Agentic AI "Vibe Coding" In The Hands Of Actual Programmers / Engineers

stochasticlifestyle.com·7h

Tuesday 17 February 2026 | IML

informatics.ed.ac.uk·1d

Beyond Kuramoto Models: Associative Memory and Plastic Synapses in ML Ensembles

hackernoon.com·1d

📊Gradient Accumulation

OpenAI’s new Codex Spark model is built for speed

thenewstack.io·1h

🌊CUDA Streams

My Go-To AI Tools: February 2026 Update

whytryai.com

·8h

🤖AI Coding Tools

EyesOff: Why Some Models Quantize Better Than Others

ym2132.github.io·20h·

Discuss: Hacker News

📉Model Quantization

Introducing Dedicated Container Inference: Delivering 2.6x faster inference for custom AI models

together.ai·19h

⚡ONNX Runtime

Loading more...