T3: Test-Time Model Merging in VLMs for Zero-Shot Medical Imaging Analysis
arxiv.orgยท20h
๐Ÿ”Grad-CAM
Flag this post
Show HN: I built an edge ML system to detect and classify trick-or-treaters
basecase.vcยท7hยท
Discuss: Hacker News
๐Ÿ‘Computer vision
Flag this post
Gated DeltaNet (Linear Attention variant in Qwen3-Next and Kimi Linear)
sebastianraschka.comยท21hยท
Discuss: r/LLM
๐Ÿง OpenAI
Flag this post
UniME-V2: MLLM-as-a-Judge for Universal Multimodal Embedding Learning
paperium.netยท1dยท
Discuss: DEV
๐Ÿ”Grad-CAM
Flag this post
Computers Are Getting Much Better at Image Recognition
smithsonianmag.comยท9h
๐Ÿ‘Computer vision
Flag this post
Donโ€™t Just Normalize, Batch Normalize! A Guide to Stable Neural Networks
pub.towardsai.netยท18h
๐Ÿ”Grad-CAM
Flag this post
ViSurf: Visual Supervised-and-Reinforcement Fine-Tuning for LargeVision-and-Language Models
dev.toยท1dยท
Discuss: DEV
๐Ÿง OpenAI
Flag this post
Connectivity Structure and Dynamics of Nonlinear Recurrent Neural Networks
journals.aps.orgยท1h
๐Ÿ”Grad-CAM
Flag this post
Fast, Scalable LDA in C++ with Stochastic Variational Inference
github.comยท10hยท
Discuss: r/cpp
๐Ÿค–Machine learning
Flag this post
How Transformer Models Detect Anomalies in System Logs
hackernoon.comยท7h
๐Ÿ”ฌscikit-learn
Flag this post
Hybrid channel attention network for auditory attention detection
nature.comยท1d
๐Ÿ”Grad-CAM
Flag this post
LinEAS: End-to-end Learning of Activation Steering with a Distributional Loss
machinelearning.apple.comยท1d
๐Ÿ”Grad-CAM
Flag this post
Masked Softmax Layers in PyTorch
mcognetta.github.ioยท9hยท
Discuss: Hacker News
๐Ÿค–Machine learning
Flag this post
[R] We were wrong about SNNs. The bo.ttleneck isn't binary/sparsity, it's frequency.
reddit.comยท15hยท
๐Ÿ”ฅPyTorch
Flag this post
What Are Auto-regressive Models? A Deep Dive and Typical Use Cases
blog.pangeanic.comยท12h
๐Ÿง OpenAI
Flag this post
Multi-Representation Attention Framework for Underwater Bioacoustic Denoising and Recognition
arxiv.orgยท20h
๐Ÿ”Grad-CAM
Flag this post
FOCUS: Efficient Keyframe Selection for Long Video Understanding
arxiv.orgยท20h
๐Ÿ”Grad-CAM
Flag this post
Kimi Linear: An Expressive, Efficient Attention Architecture
arxiviq.substack.comยท2dยท
Discuss: Substack
๐Ÿง OpenAI
Flag this post
How We Built a Custom Vision LLM to Improve Document Processing at Grab
engineering.grab.comยท1h
๐Ÿง OpenAI
Flag this post
VISAT: Benchmarking Adversarial and Distribution Shift Robustness in Traffic Sign Recognition with Visual Attributes
arxiv.orgยท20h
๐Ÿ”Grad-CAM
Flag this post