SciDaSynth: Interactive Structured Data Extraction from Sci Literature with LLM
onlinelibrary.wiley.comΒ·2hΒ·
Discuss: Hacker News
πŸ”Information Retrieval
Flag this post
Essential Chunking Techniques for Building Better LLM Applications
machinelearningmastery.comΒ·2d
πŸ“„Text Chunking
Flag this post
Adaptive Contrastive Learning via Dynamic Feature Masking for Fine-Grained Attribute Recognition
dev.toΒ·1hΒ·
Discuss: DEV
πŸ“ŠLearned Metrics
Flag this post
Segmenting Ancient Chinese-Japanese Texts for HTR (from the RDDS Blog)
uniqueatpenn.wordpress.comΒ·1d
πŸ‘οΈDocument OCR
Flag this post
From data to corpus: semiotic and documentary issues in audiovisual archives
arxiv.orgΒ·2d
πŸ”Archive Semantics
Flag this post
LightGBM Explained
yanisfalaki.comΒ·4hΒ·
Discuss: Hacker News
🧠Machine Learning
Flag this post
Archive extraction support in na_game_tool
codecs.multimedia.cxΒ·16h
πŸ“¦Archive Formats
Flag this post
Show HN: PyNIFE. 400-900Γ— speedup for embedding-based retrieval pipelines
github.comΒ·3hΒ·
Discuss: Hacker News
πŸŒ€Brotli Internals
Flag this post
Virtual Bodies and Flickering Signifiers | Katherine Hayles
web.archive.orgΒ·16h
🧲Magnetic Philosophy
Flag this post
DeepOCR – a free image β†’ text extractor,no signup
deepocr.ccΒ·1dΒ·
Discuss: Hacker News
πŸ‘οΈOCR Verification
Flag this post
Benchmarking the Most Reliable Document Parsing API
tensorlake.aiΒ·2dΒ·
Discuss: Hacker News
βš™οΈCompression Benchmarking
Flag this post
Building effective workflows for oral history projects: Collaboration, structure, and AI innovation
hangingtogether.orgΒ·2d
πŸ”„Archival Workflows
Flag this post
Automated Figure-Text Alignment & Knowledge Extraction for Scientific Literature
dev.toΒ·4dΒ·
Discuss: DEV
πŸ”Semantic Search
Flag this post
2025-11-08: Using Amazon SageMaker Ground Truth for Crowdsourcing
ws-dl.blogspot.comΒ·8hΒ·
πŸ€–Archive Automation
Flag this post
Normalized tensor train decomposition
arxiv.orgΒ·2d
πŸ•ΈοΈTensor Networks
Flag this post
How LLMs Read Docs
aiwiki.devΒ·23hΒ·
Discuss: Hacker News
⚑Proof Automation
Flag this post
Just know stuff (or, how to achieve success in a machine learning PhD) (2023)
kidger.siteΒ·15hΒ·
Discuss: Hacker News
πŸ“Linear Algebra
Flag this post
The Production Generative AI Stack: Architecture and Components
thenewstack.ioΒ·2d
πŸŒ€Brotli Internals
Flag this post