🏎️ TensorRT - miterion · Scour

Automating Inference Optimizations with NVIDIA TensorRT LLM AutoDeploy

developer.nvidia.com·23h

⚡ONNX Runtime

TreeTensor: Boost AI System on Nested Data with Constrained Tree-Like Tensor

arxiv.org·13h

🎯Tensor Cores

Tutorial – What is a variational autoencoder?

jaan.io·1d·

Discuss: Hacker News

📉Model Quantization

Quantized Tensor Train Compression For Turbulent Flow Simulation: O(log N) Scaling with Reynolds-Independent Bond Dimension

zenodo.org·1d·

Discuss: Hacker News

📉Model Quantization

Parallel Track Transformers: Enabling Fast GPU Inference with Reduced Synchronization

machinelearning.apple.com·18h

⏱️CUDA Events

Writing a ONNX Neural Network Inference Engine from Scratch in C to run image classification with MobileNetV2

flexw.github.io·1d·

Discuss: r/C_Programming

⚡ONNX Runtime

Turning Any Model into an XAI-Ready Model: Formats and Gradient Flow

dev.to·6h·

Discuss: DEV

📜TorchScript

Quantization-Aware Distillation

ternarysearch.blogspot.com·2d·

Discuss: Hacker News

📉Model Quantization

Antirez Strikes Again: The Creator of Redis Builds a Bare-Metal Vision AI in Pure C — And It Actually Works

webpronews.com·4h

🎯Tensor Cores

LQA: A Lightweight Quantized-Adaptive Framework for Vision-Language Models on the Edge

arxiv.org·13h

📉Model Quantization

Faster AI Training Unlocked With New System For Massive Language Models

quantumzeitgeist.com·1d

🎯Tensor Cores

Deep transfer learning based on cross-domain subsequence alignment and feature contribution interpretation for remaining useful life prediction

sciencedirect.com·2h

A Time-Synchronized Multi-Sensor drone dataset acquired from multiple radars and RF receiver

nature.com·5h

🔗Kernel Fusion

Autoregressive Model Beats Diffusion: Llama for Scalable Image Generation

paperium.net·4d·

Discuss: DEV

📊Gradient Accumulation

marketplace.visualstudio.com·4h

How2Everything: Mining the web to evaluate and improve LLMs on real-world procedures

allenai.org·1h·

Discuss: Hacker News

⏱️Benchmarking

Build Voice AI in Python: Complete Speech-to-Text Developer Guide (2026)

dev.to·3h·

Discuss: DEV

🤖AI Coding Tools

Trainy-ai/pluto: Next Generation Experimental Tracking for Machine Learning Operations

github.com·22h·

Discuss: Hacker News

How Anam Achieved 250% Faster Inference Using Zymtrace Continuous GPU Profiling

zymtrace.com·1d

MiRAGE: Open-source framework for multimodal RAG evaluation

news.ycombinator.com·2h·

Discuss: Hacker News

Loading more...