Using LangExtract and Elasticsearch
elastic.coยท18h
๐Ÿš€LanceDB
Optimizing for AI: How search engines power ChatGPT, Gemini and more
searchengineland.comยท5h
๐Ÿ“ŠFeed Optimization
Towards Knowledge-Aware Document Systems: Modeling Semantic Coverage Relations via Answerability Detection
arxiv.orgยท14h
๐Ÿ“„Semantic Chunking
Urlref: Website Bookmarking for Handwritten Notes
benjaminhollon.comยท9hยท
Discuss: Hacker News
๐Ÿ“‹Markdown
Overview of the DiskANN Project (2018โ€“present)
harsha-simhadri.orgยท21hยท
Discuss: Hacker News
๐Ÿ—‚๏ธVector Indexes
Building RAG systems at enterprise scale (20K+ docs): lessons from 10+ enterprise implementations
reddit.comยท2hยท
Discuss: r/LocalLLaMA
๐Ÿ“‡Indexing Strategies
Announcing New SQL Features in Public Preview to Optimize Snowflake Workloads
snowflake.comยท18h
โš™๏ธDatabase Internals
Semlib: LLM-powered Data Processing
anishathalye.comยท18hยท
Discuss: Lobsters
๐Ÿ†LLM Benchmarking
Advanced SEO schema markup strategies for 2025:
threadreaderapp.comยท23h
๐Ÿ“‡Indexing Strategies
The most interesting documents we've had to process as an OCR company
trycardinal.medium.comยท17hยท
Discuss: Hacker News
๐Ÿ“„Semantic Chunking
Judgeโ€™s Google Data Sharing Order Leaves Small Rivals Hanging
theinformation.comยท5h
๐Ÿ”—Hybrid Search
D1, Workers - D1 automatically retries read-only queries
developers.cloudflare.comยท18h
๐Ÿ’งLitestream
Choosing a model for a research platform with real data and metrics
maxirwin.comยท23hยท
Discuss: Hacker News
๐Ÿ†LLM Benchmarking
Web Scraping With Python
medium.comยท4hยท
Discuss: r/programming
๐Ÿ’ณContent Monetization
<p>๐Ÿ”— <a href="https://manuelmoreale.com/on-em-dashes">Manuel Moreale: On em dashes</a></p>
lmika.orgยท18h
๐Ÿ“‘Inverted Indexes
How Exabeam uses ClickHouse for scalable, searchable security analytics
clickhouse.comยท18h
๐Ÿ’งLitestream
A free tool to check website traffic and reverse Adsense IDs
sitedata.devยท12hยท
Discuss: Hacker News
๐Ÿ’ณContent Monetization
Real-Time Detection of Hallucinated Entities in Long-Form Generation
hallucination-probes.comยท20hยท
Discuss: Hacker News
๐Ÿง LLM Inference
The strategy is enquiry
gilest.orgยท11h
๐Ÿ’ซSearch UX
OpenAI, DeepSeek, and Google vary widely in identifying hate speech
techxplore.comยท5h
๐Ÿ›ก๏ธContent Moderation