👁️ Vision Transformers - upchuck5372 · Scour

Understanding and Optimizing Attention-Based Sparse Matching for Diverse Local Features

arxiv.org·2d

👁Computer vision

VersaViT: Enhancing MLLM Vision Backbones via Task-Guided Optimization

arxiv.org·1d

HoWDe: A validated algorithm for Home and Work location Detection

sciencedirect.com·3h

🔬scikit-learn

Visual microphone based on computational imaging

opg.optica.org·1d

👁Computer vision

Link-checking with generative AI

natemeyvis.com·19h

CoWTracker: Tracking by Warping instead of Correlation

cowtracker.github.io·2d

A Large-Scale In-the-wild Dataset for Plant Disease Segmentation

nature.com·3d

👁Computer vision

Finding cancer cells in a cocktail of complex tissues

sciworthy.com·48m

🔬scikit-learn

blog.engora.com·17h·

Discuss: Hacker News

🤖Machine learning

Deterministic Inference with EigenAI

deterministicinference.com·18h

The feature space for drifting models

breno.bearblog.dev·1d

Meet Youtu-VL-4B: Tencent’s Tiny Model That Does Segmentation, Depth, and VQA

hackernoon.com·12h

EyesOff: Why Some Models Quantize Better Than Others

ym2132.github.io·13h·

Discuss: Hacker News

🔬scikit-learn

GRAIL Text Recognizer

jackschaedler.github.io·11h

Deepseek v4lite这是吃了多少GPT5语料

linux.do·7h

Sora: A Review on Background, Technology, Limitations, and Opportunities of Large Vision Models

paperium.net·2d·

Discuss: DEV

Using Generative AI tooling with Clojure

dev.solita.fi·1d

Building a Robust Classifier with Stacked Generalization

dev.to·1d·

Discuss: DEV

🤖Machine learning

AnkitNayak-eth/EpsteinFiles-RAG: A RAG pipeline implementation built on the 'Epstein Files 20K' dataset from Hugging Face (Teyler).

github.com·1d·

Discuss: r/LocalLLaMA

Tutorial – What is a variational autoencoder?

jaan.io·2d·

Discuss: Hacker News

Loading more...