🧠 BF16 - miterion · Scour

Gated DeltaNet (Linear Attention variant in Qwen3-Next and Kimi Linear)

sebastianraschka.com·16h·

Discuss: r/LLM

👁️Attention Optimization

Flag this post

Enhanced Richardson Extrapolation via Adaptive Kernel Regression and Uncertainty Quantification

dev.to·6h·

Discuss: DEV

Flag this post

Heart rate response and recovery during exercise and dementia risk: a prospective UK biobank study

nature.com·20h

📊Gradient Accumulation

Flag this post

Don’t Just Normalize, Batch Normalize! A Guide to Stable Neural Networks

pub.towardsai.net·13h

📊Gradient Accumulation

Flag this post

Masked Softmax Layers in PyTorch

mcognetta.github.io·4h·

Discuss: Hacker News

Flag this post

The Evolution of GPUs: How Floating-Point Changed Computing

dell.com·1d·

Discuss: Hacker News

🎯Tensor Cores

Flag this post

Uncertainty-weighted with gradient-based to re-weight domain generalization for remaining useful life prediction of rotating machinery under unseen conditions

sciencedirect.com·1d

⏱️Benchmarking

Flag this post

Spiking Neural Networks: The Future of Brain-Inspired Computing

arxiv.org·15h

⚡Flash Attention

Flag this post

Weak-To-Strong Generalization

lesswrong.com·1d

📉Model Quantization

Flag this post

Why Multimodal AI Broke the Data Pipeline — And How Daft Is Beating Ray and Spark to Fix It

hackernoon.com·14h

Flag this post

Yes, you should understand backprop (2016)

karpathy.medium.com·1d·

Discuss: Hacker News

📊Gradient Accumulation

Flag this post

A groundbreaking brain map could revolutionize Parkinson’s treatment

sciencedaily.com·4h

👁️Attention Optimization

Flag this post

A Practitioner's Guide to Kolmogorov-Arnold Networks

arxiviq.substack.com·1d·

Discuss: Substack

📉Model Quantization

Flag this post

Geonum – geometric number library for unlimited dimensions with O(1) complexity

github.com·5h·

Discuss: Hacker News

Flag this post

FG-CLIP 2: A Bilingual Fine-grained Vision-Language Alignment Model

paperium.net·19h·

Discuss: DEV

🧩Attention Kernels

Flag this post

AI-Powered Development and GitHub Copilot Integration

edstellar.com·11h·

Discuss: DEV

🤖AI Coding Tools

Flag this post

<p>**Abstract:** Accurate characterization of geothermal fluids and subsurface reservoirs is critical for efficient and sustainable energy extraction. Tradition...

freederia.com·1d

Flag this post

Thought-to-Text: AI Bridges the Silent Divide

dev.to·6h·

Discuss: DEV

⚡Flash Attention

Flag this post

NLD: Skillhunt Mix-7 Gen 2 Plus. So much fun!

reddit.com·16h·

Discuss: r/flashlight

⚡Flash Attention

Flag this post

Can-t stop till you get enough

cant.bearblog.dev·1d·

Discuss: Hacker News

📜TorchScript

Flag this post

Loading more...