📉 Model Quantization - miterion · Scour

Regularized Calibration with Successive Rounding for Post-Training Quantization

arxiv.org·4d

🏎️TensorRT

LQA: A Lightweight Quantized-Adaptive Framework for Vision-Language Models on the Edge

arxiv.org·12h

🏎️TensorRT

Cleaning Up Complexity: Preprocessing Attribution Maps for Better Evaluation

dev.to·5h·

Discuss: DEV

👁️Attention Optimization

MiRAGE: Open-source framework for multimodal RAG evaluation

news.ycombinator.com·1h·

Discuss: Hacker News

Manufacturing QMS Software

samrian.com·1d·

Discuss: Hacker News

⏱️Benchmarking

the mathematics of compression in database systems

bitsxpages.com·21h

📈Occupancy Optimization

From Pixels to Precision

dev.to·5h·

Discuss: DEV

⚡Flash Attention

Gated Attention & DeltaNets: The Missing Link for Long-Context AI

pub.towardsai.net

·11h

👁️Attention Optimization

A Note on Flat Abstract Syntax Trees

gist.github.com·22h·

Discuss: Hacker News

🔬Static Analysis

Geometrically Allocated Ads in AI Conversations

june.kim·14h·

Discuss: Hacker News

🧩Attention Kernels

Automating Inference Optimizations with NVIDIA TensorRT LLM AutoDeploy

developer.nvidia.com·22h

🏎️TensorRT

Scale LLM fine-tuning with Hugging Face and Amazon SageMaker AI

aws.amazon.com·1d

🎓Model Distillation

marketplace.visualstudio.com·3h

Sense8 WorldToolKit Demo v1.01 : Sense8 : Free Download, Borrow, and Streaming

archive.org·18h

🏎️TensorRT

A Time-Synchronized Multi-Sensor drone dataset acquired from multiple radars and RF receiver

nature.com·4h

🔗Kernel Fusion

Drifting models

breno.bearblog.dev·1d

🎓Model Distillation

Show HN: Model Training Memory Simulator

czheo.github.io·2d·

Discuss: Hacker News

📊Gradient Accumulation

Handwriting vs AI: Real Performance of AI on Handwritten Documents

hackernoon.com·21m

📊Gradient Accumulation

Your VCL App: 4x to 11x Faster Math Performance with Elements

blogs.remobjects.com·1d·

Discuss: Hacker News

AI-augmented data quality engineering

infoworld.com·1d

🤖AI Coding Tools

Loading more...