🔗 Kernel Fusion - miterion · Scour

Disciplined Biconvex Programming

arxiv.org·3h

📉Model Quantization

Flag this post

Attention Is All You Need for KV Cache in Diffusion LLMs

paperium.net·4h·

Discuss: DEV

🎯Tensor Cores

Flag this post

Fast, Scalable LDA in C++ with Stochastic Variational Inference

github.com·17h·

Discuss: r/cpp

🏎️TensorRT

Flag this post

Enhanced spatial clustering of single-molecule localizations with graph neural networks

nature.com·1d

🔀Operator Fusion

Flag this post

Connectivity Structure and Dynamics of Nonlinear Recurrent Neural Networks

journals.aps.org·8h

📉Model Quantization

Flag this post

Enhanced Richardson Extrapolation via Adaptive Kernel Regression and Uncertainty Quantification

dev.to·18h·

Discuss: DEV

Flag this post

My First Multi-GPU Kernel: Writing All-to-All for AMD MI300X

gau-nernst.github.io·1d·

Discuss: Hacker News

🎯GPU Kernels

Flag this post

T3: Test-Time Model Merging in VLMs for Zero-Shot Medical Imaging Analysis

arxiv.org·1d

🏎️TensorRT

Flag this post

H-FA: A Hybrid Floating-Point and Logarithmic Approach to Hardware Accelerated FlashAttention

arxiv.org·3h

⚡Flash Attention

Flag this post

Matrix Phylogeny: Compact Spectral Fingerprints for Trap-Robust Preconditioner Selection

arxiv.org·3h

🔀Operator Fusion

Flag this post

A Practitioner's Guide to Kolmogorov-Arnold Networks

arxiviq.substack.com·1d·

Discuss: Substack

📉Model Quantization

Flag this post

Relation-Aware Bayesian Optimization of DBMS Configurations Guided by Affinity Scores

arxiv.org·1d

⚡ONNX Runtime

Flag this post

Dynamic Model Selection for Trajectory Prediction via Pairwise Ranking and Meta-Features

arxiv.org·3h

🔀Operator Fusion

Flag this post

Correlation detection as a stimulus computable account for audiovisual perception, causal inference, and saliency maps in mammals

elifesciences.org·8h

🧩Attention Kernels

Flag this post

Explore More, Learn Better: Parallel MLLM Embeddings under Mutual Information Minimization

arxiv.org·3h

Flag this post

How Transformer Models Detect Anomalies in System Logs

hackernoon.com·14h

📊Gradient Accumulation

Flag this post

News for October 2025

ptreview.sublinear.info·9h

Flag this post

Hybrid-Attention models are the future for SLMs

inference.net·6h·

Discuss: Hacker News

⚡Flash Attention

Flag this post

Transformer-Based Decoding in Concatenated Coding Schemes Under Synchronization Errors

arxiv.org·3h

⚡Flash Attention

Flag this post

Predicting Encoding Energy from Low-Pass Anchors for Green Video Streaming

arxiv.org·3h

🏎️TensorRT

Flag this post

Loading more...