🏎️ TensorRT - miterion · Scour

Inference-Time Rethinking with Latent Thought Vectors for Math Reasoning

arxiv.org·18h

🎓Model Distillation

DAVE: Distribution-aware Attribution via ViT Gradient Decomposition

arxiv.org·18h

📊Gradient Accumulation

Introducing Composer 1.5

cursor.com·2h·

Discuss: Hacker News

🤖AI Coding Tools

OrthoRay – A native, lightweight DICOM viewer written in Rust/wgpu by a surgeon

news.ycombinator.com·4h·

Discuss: Hacker News

From print to digital: Making weekly flyers shoppable at Instacart through computer vision and LLMs

tech.instacart.com·2h

🧩Attention Kernels

LocalGPT: A local AI assistant with persistent memory in a single binary

localgpt.app·4h·

Discuss: Hacker News

⚡ONNX Runtime

Main Content || Math ∩ Programming

jeremykun.com·1d

📉Model Quantization

Increasing the Speed of Offline Raspberry Pi AI Chatbot #raspberrypi

blog.adafruit.com·2h

📊Gradient Accumulation

RoomKit, Pipecat, TEN Framework, LiveKit Agents: Choosing the Right Conversational AI Framework

dev.to·8h·

Discuss: DEV

🤖AI Coding Tools

Hypernetworks: Neural Networks for Hierarchical Data

blog.sturdystatistics.com·4d·

Discuss: Hacker News

📊Gradient Accumulation

A Chinese Traditional Opera Video Super-Resolution Dataset Based on the “Real-world+” Degradation Fusion

nature.com·2d

qrafty-ai/teleop_xr: Transforms your VR/AR headset into a powerful, precise robot controller

github.com·1d·

Discuss: Hacker News

🤖AI Coding Tools

Prompt injection in Google Translate reveals base model behaviors behind task-specific fine-tuning

lesswrong.com·2d·

Discuss: Hacker News

🤖AI Coding Tools

Stanford AI Breakthrough: Unlock ChatGPT Creativity

medium.com

·17h

🤖AI Coding Tools

ggml: backend-agnostic tensor parallelism by JohannesGaessler · Pull Request #19378

github.com·4d·

Discuss: r/LocalLLaMA

🎯Tensor Cores

Turn Claude From a Chatbot Into a Thinking Partner 🧠

linas.substack.com

·13h·

Discuss: Substack

🤖AI Coding Tools

How Netflix, Uber, and Google Build AI Systems: Architecture Deep Dive

dev.to·18h·

Discuss: DEV

🤖AI Coding Tools

AI-augmented data quality engineering

infoworld.com·13h

🤖AI Coding Tools

VSORA Board Chair Sandra Rivera on Solutions for AI Inference and LLM Processing

semiwiki.com·9h

⚡ONNX Runtime

Inside Mesa 26.0's RADV RT improvements

pixelcluster.github.io·45m·

Discuss: r/linux_gaming

Loading more...