On the Relationship Between the Choice of Representation and In-Context Learning
arxiv.orgยท1d
๐Ÿง Machine Learning
YouTube gets ~5% CTR lift on Shorts by replacing embedding tables with Semantic IDs
shaped.aiยท1d
๐Ÿ“ŠFeed Optimization
A gentle introduction to Generative AI: Historical perspective
medium.comยท4hยท
Discuss: Hacker News
๐Ÿง Learned Codecs
GNN Blind Spots: The Hidden Cost of Powerful Graph Models
dev.toยท3hยท
Discuss: DEV
๐Ÿ•ธ๏ธGraph Embeddings
Why Low-Precision Transformer Training Fails: An Analysis on Flash Attention
huggingface.coยท1dยท
Discuss: Hacker News
๐Ÿง Intelligence Compression
LogSTOP: Temporal Scores over Prediction Sequences for Matching and Retrieval
arxiv.orgยท2d
๐Ÿง Learned Codecs
NExF: Learning Neural Exposure Fields for View Synthesis
m-niemeyer.github.ioยท22hยท
Discuss: Hacker News
๐Ÿง Neural Codecs
Doing Math with Embeddings for Better AI Ad Targeting
ethicalads.ioยท2dยท
Discuss: Hacker News
๐Ÿ“ŠFeed Optimization
To Sink or Not to Sink: Visual Information Pathways in Large Vision-Language Models
arxiv.orgยท1d
๐Ÿค–Advanced OCR
Contrastive Weak-to-strong Generalization
arxiv.orgยท1d
โง—Information Bottleneck
RND1: Simple, Scalable AR-to-Diffusion Conversion
radicalnumerics.aiยท1dยท
Discuss: Hacker News
๐Ÿ’ปLocal LLMs
MARC: Memory-Augmented RL Token Compression for Efficient Video Understanding
arxiv.orgยท1d
๐Ÿง Learned Codecs
SliceFine: The Universal Winning-Slice Hypothesis for Pretrained Networks
arxiv.orgยท1d
๐Ÿง Neural Codecs
CIR-CoT: Towards Interpretable Composed Image Retrieval via End-to-End Chain-of-Thought Reasoning
arxiv.orgยท1d
๐ŸงฎVector Embeddings
Optimal Stopping in Latent Diffusion Models
arxiv.orgยท1d
๐Ÿง Machine Learning
In-Depth Analysis: "Attention Is All You Need"
dev.toยท13hยท
Discuss: DEV
๐Ÿง Intelligence Compression
Show HN: 1M retail interior image dataset for computer vision (UK/US/EU)
groceryinsight.comยท17hยท
Discuss: Hacker News
๐ŸบCompression Museums
TransFIRA: Transfer Learning for Face Image Recognizability Assessment
arxiv.orgยท2d
๐Ÿ›Digital humanities
Gaze on the Prize: Shaping Visual Attention with Return-Guided Contrastive Learning
arxiv.orgยท1d
๐ŸงญContent Discovery
Improving Temporal Understanding Logic Consistency in Video-Language Models via Attention Enhancement
arxiv.orgยท1d
โฑ๏ธInterval Parsing