Gothic Scripts, Character Segmentation, Historical Text Recognition, Paleographic AI

The Best Ways to Digitize Your Notes
lifehacker.com·1d
📄Document Digitization
Show HN: 1M retail interior image dataset for computer vision (UK/US/EU)
groceryinsight.com·1d·
Discuss: Hacker News
🏺Compression Museums
Deep Learning Based Approach to Enhanced Recognition of Emotions and Behavioral Patterns of Autistic Children
arxiv.org·1d
🤖Advanced OCR
CaRDiff: Video Salient Object Ranking Chain of Thought Reasoning for Saliency Prediction with Diffusion
arxiv.org·2d
📊Learned Metrics
TRIM: Token-wise Attention-Derived Saliency for Data-Efficient Instruction Tuning
arxiv.org·2d
🔨Compilers
The Hidden Bias: A Study on Explicit and Implicit Political Stereotypes in Large Language Models
arxiv.org·1d
🤖Grammar Induction
Addressing the ID-Matching Challenge in Long Video Captioning
arxiv.org·2d
📐Vector Similarity
LogSTOP: Temporal Scores over Prediction Sequences for Matching and Retrieval
arxiv.org·2d
📊Learned Metrics
I Built an AI Text Humanizer Tool That Makes Robotic Writing Sound 100% Human
dev.to·1d·
Discuss: DEV
🎙️Whisper
Machines in the Crowd? Measuring the Footprint of Machine-Generated Text on Reddit
arxiv.org·2d
🏛Digital humanities
ARES: Multimodal Adaptive Reasoning via Difficulty-Aware Token-Level Entropy Shaping
arxiv.org·1d
🧮Kolmogorov Complexity
Homomorphism Problems in Graph Databases and Automatic Structures
arxiv.org·1d
🔗Graph Isomorphism
A Beginner's GAN Adventure with Digits
dev.to·4d·
Discuss: DEV
🤖Advanced OCR
Optimal Stopping in Latent Diffusion Models
arxiv.org·1d
🧠Machine Learning
Effective and Stealthy One-Shot Jailbreaks on Deployed Mobile Vision-Language Agents
arxiv.org·1d
🕵️Vector Smuggling
Real-time Anomaly Detection in Financial Transactions via Hybrid Reinforcement Learning and Graph Neural Networks
dev.to·6h·
Discuss: DEV
🔍Vector Forensics
Enhancing Synthetic Data Generation via Adaptive Kernel Density Estimation with Bayesian Optimization
dev.to·2d·
Discuss: DEV
🧠Machine Learning
Unlock Deep Learning Stability: Navigate the Activation Function Galaxy with 9 Dimensions!
dev.to·2h·
Discuss: DEV
🧠Machine Learning