Solving a problem with mindware
lesswrong.comΒ·21h
⚑Flash Attention
Flag this post
Low-Level Hacks
blog.raycursive.comΒ·10hΒ·
Discuss: Hacker News
πŸ“ŠProfiling Tools
Flag this post
Don’t Just Normalize, Batch Normalize! A Guide to Stable Neural Networks
pub.towardsai.netΒ·1d
πŸ“ŠGradient Accumulation
Flag this post
Predicting & Mitigating Data Corruption in Pure Storage Flash Arrays via Adaptive Bit Error Rate Modeling
dev.toΒ·2hΒ·
Discuss: DEV
⏱️Benchmarking
Flag this post
From Uniform to Adaptive: General Skip-Block Mechanisms for Efficient PDE Neural Operators
arxiv.orgΒ·8h
🏎️TensorRT
Flag this post
Coverage Analysis and Optimization of FIRES-Assisted NOMA and OMA Systems
arxiv.orgΒ·8h
⚑Flash Attention
Flag this post
Predicting Encoding Energy from Low-Pass Anchors for Green Video Streaming
arxiv.orgΒ·8h
πŸ”—Kernel Fusion
Flag this post
Towards Automated Petrography
arxiv.orgΒ·8h
πŸ“‰Model Quantization
Flag this post
Real-Time Vibrational Spectroscopy with AI-Driven Spectral Deconvolution for On-Site Material Identification
dev.toΒ·17hΒ·
Discuss: DEV
⏱️Benchmarking
Flag this post
Gated DeltaNet (Linear Attention variant in Qwen3-Next and Kimi Linear)
sebastianraschka.comΒ·1dΒ·
Discuss: r/LLM
πŸ‘οΈAttention Optimization
Flag this post
Efficiency vs. Alignment: Investigating Safety and Fairness Risks in Parameter-Efficient Fine-Tuning of LLMs
arxiv.orgΒ·8h
πŸ”„ONNX
Flag this post
Transforming Quality Control with Vision-Guided Inspection Systems
dev.toΒ·7hΒ·
Discuss: DEV
πŸ”Nsight
Flag this post
Bayesian Natural Gradient Fine-Tuning of CLIP Models via Kalman Filtering
arxiv.orgΒ·8h
πŸ“ŠGradient Accumulation
Flag this post
GeneFlow: Translation of Single-cell Gene Expression to Histopathological Images via Rectified Flow
arxiv.orgΒ·8h
πŸ”„ONNX
Flag this post
Few-Shot Multimodal Medical Imaging: A Theoretical Framework
arxiv.orgΒ·8h
🏎️TensorRT
Flag this post