⚡ Model Efficiency - jimman · Scour

Masked Softmax Layers in PyTorch

mcognetta.github.io·15h·

Discuss: Hacker News

⚡LLM Optimization

Flag this post

The Evolution of GPUs: How Floating-Point Changed Computing

dell.com·1d·

Discuss: Hacker News

Flag this post

On the Structure of Floating-Point Noise in Batch-Invariant GPU Matrix Multiplication

arxiv.org·2h

⚡LLM Optimization

Flag this post

Accelerated Dielectric Barrier Coating Optimization via Multi-Modal Data Fusion & Bayesian Hyperparameter Tuning

dev.to·11h·

Discuss: DEV

⚡LLM Optimization

Flag this post

Tackling the Kidnapped Robot Problem via Sparse Feasible Hypothesis Sampling and Reliable Batched Multi-Stage Inference

arxiv.org·2h

⚡LLM Optimization

Flag this post

Neural Transparency: Mechanistic Interpretability Interfaces for Anticipating Model Behaviors for Personalized AI

arxiv.org·2h

🔍AI Interpretability

Flag this post

X-TRACK: Physics-Aware xLSTM for Realistic Vehicle Trajectory Prediction

arxiv.org·2h

⚡LLM Optimization

Flag this post

LeMiCa: Lexicographic Minimax Path Caching for Efficient Diffusion-Based Video Generation

arxiv.org·2h

⚡LLM Optimization

Flag this post

From Uniform to Adaptive: General Skip-Block Mechanisms for Efficient PDE Neural Operators

arxiv.org·2h

✍️Prompt Engineering

Flag this post

Optimizing Native Sparse Attention with Latent Attention and Local Global Alternating Strategies

arxiv.org·2h

⚡LLM Optimization

Flag this post

Region-Aware Reconstruction Strategy for Pre-training fMRI Foundation Model

arxiv.org·2h

⚡LLM Optimization

Flag this post

Geonum – geometric number library for unlimited dimensions with O(1) complexity

github.com·17h·

Discuss: Hacker News

⚡LLM Optimization

Flag this post

Scalable In-Memory Associative Processing for Graph Neural Network Inference

dev.to·1d·

Discuss: DEV

⚡LLM Optimization

Flag this post

AI Inference: The Silent Budget Killer (and How to Stop It)

dev.to·2d·

Discuss: DEV

⚡LLM Optimization

Flag this post

A Soft‑Fork Proposal for Blockchain‑Based Distributed AI Computation

hackernoon.com·20h

⚡LLM Optimization

Flag this post

Disciplined Biconvex Programming

arxiv.org·2h

⚡LLM Optimization

Flag this post

GPU Pro – Master Your AI Workflow

github.com·1d·

Discuss: Hacker News, Hacker News, Hacker News

🛠️Developer Tools

Flag this post

ClipTagger-12B VLM: Frame Captioning Tutorial

dev.to·1d·

Discuss: DEV

⚡LLM Optimization

Flag this post

CoT-Saliency: Unified Chain-of-Thought Reasoning for Heterogeneous Saliency Tasks

arxiv.org·2h

✍️Prompt Engineering

Flag this post

Advanced 3D IC Heterogeneous Integration Analysis via Bayesian Optimization and AI-Driven Defect Mapping

dev.to·3d·

Discuss: DEV

🔍AI Interpretability

Flag this post

Loading more...