Document Processing, Neural OCR, Multilingual Archives, Computational Philology

Welcome to LIL’s Data.gov Archive Search
lil.law.harvard.edu·17h
💾Data Preservation
The Health Effects of Electromagnetic Radiation
huijzer.xyz·2h·
📄PostScript
AI receptionist that answers real phone calls
news.ycombinator.com·19h·
Discuss: Hacker News
🎙️Whisper
GPT-5 for AI-assisted discovery
johndcook.com·23h·
Discuss: Hacker News
🎯Performance Proofs
🚀 From Rejection to Reinvention: How I Built an AI That Finds My Jobs
dev.to·9h·
Discuss: DEV
🇨🇳Chinese Computing
CIR-CoT: Towards Interpretable Composed Image Retrieval via End-to-End Chain-of-Thought Reasoning
arxiv.org·1d
🧮Vector Embeddings
Presenting a Paper is an Art: Self-Improvement Aesthetic Agents for Academic Presentations
arxiv.org·3d
🏛Digital humanities
MARC: Memory-Augmented RL Token Compression for Efficient Video Understanding
arxiv.org·1d
🧠Learned Codecs
Evaluating Small Vision-Language Models on Distance-Dependent Traffic Perception
arxiv.org·1d
🧠Machine Learning
VideoNorms: Benchmarking Cultural Awareness of Video Language Models
arxiv.org·1d
📊Learned Metrics
Krish Naik: Complete RAG Crash Course With Langchain In 2 Hours
dev.to·1d·
Discuss: DEV
📊Multi-vector RAG
Homomorphism Problems in Graph Databases and Automatic Structures
arxiv.org·1d
🔗Graph Isomorphism
Vibe-Coding vs. AI-Assisted Development
adaptivealchemist.com·1d·
Discuss: Hacker News
Incremental Computation
GPT Translator vs Google Translate: Which One Understands You Better?
dev.to·4d·
Discuss: DEV
🇨🇳Chinese Computing
Stress-Testing Model Specs Reveals Character Differences among Language Models
arxiv.org·1d
📋Document Grammar
LightReasoner: Can Small Language Models Teach Large Language Models Reasoning?
arxiv.org·1d
🔗Parser Combinators
Best Japanese to English Document Translation Software
dev.to·3d·
Discuss: DEV
🇯🇵Japanese Computing
Relational Transformer: Toward Zero-Shot Foundation Models for Relational Data
arxiv.org·2d
🧠Learned Indexes
What to Look For in Image Annotation Services Today
dev.to·1d·
Discuss: DEV
🤖AI Curation