๐Ÿฟ๏ธ ScourBrowse
LoginSign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
๐Ÿ“„ Text Chunking

Semantic Segmentation, Context Windows, Document Boundaries, Retrieval Units

A.I. Is Homogenizing Our Thoughts
newyorker.comยท3h
๐Ÿ›Digital humanities
BPCLIP: A Bottom-up Image Quality Assessment from Distortion to Semantics Based on CLIP
arxiv.orgยท1d
๐Ÿ–ผ๏ธJPEG XL
What LLMs Know About Their Users
schneier.comยท7hยท
Discuss: Hacker News
๐Ÿ’ปLocal LLMs
Semantic-Aware Parsing for Security Logs
arxiv.orgยท1d
๐Ÿ“Log Parsing
Leveraging Large Language Models for Information Verification -- an Engineering Approach
arxiv.orgยท1d
๐Ÿ“‹Document Grammar
Could Open Table Formats End the Reign of Snowflake and Databricks?
prequel.coยท48mยท
Discuss: Hacker News
๐Ÿ“šMARC Evolution
Named Entity Recognition using Bidirectional LSTM and Conditional Random Fields
dev.toยท3dยท
Discuss: DEV
๐Ÿค–Grammar Induction
The collective waste caused by poor documentation
shanrauf.comยท16hยท
Discuss: Hacker News
๐Ÿ“ฆDeflate
Using an LLM for query planning in RAG โ€“> 40% better answer relevance
techcommunity.microsoft.comยท22hยท
Discuss: Hacker News
๐Ÿ”Information Retrieval
Show HN: Writer J โ€“ AI Content Generator with 7-Step SEO Workflow
writer-j.comยท1dยท
Discuss: Hacker News
๐Ÿ”ƒFeed Algorithms
V2T-CoT: From Vision to Text Chain-of-Thought for Medical Reasoning and Diagnosis
arxiv.orgยท14h
๐Ÿค–Advanced OCR
Why Your Next LLM Might Not Have A Tokenizer
towardsdatascience.comยท23h
๐Ÿค–Grammar Induction
10 FREE AI Tools Thatโ€™ll Save You 10+ Hours a Week
kdnuggets.comยท6h
๐ŸŽ™๏ธWhisper
Launch HN: Reducto Studio (YC W24) โ€“ Build accurate document pipelines, fast
news.ycombinator.comยท2dยท
Discuss: Hacker News
๐ŸŒ€Brotli Internals
Driving cost-efficiency and speed in claims data processing with Amazon Nova Micro and Amazon Nova Lite
aws.amazon.comยท1h
๐ŸŒŠStream Processing
LLMs for Customized Marketing Content Generation and Evaluation at Scale
arxiv.orgยท1d
๐Ÿ“ŠFeed Optimization
Text2Struct: A Machine Learning Pipeline for Mining Structured Data from Text
arxiv.orgยท1d
๐Ÿ”คCharacter Classification
Detect Narrative Threats with AI Personas
askrally.comยท2dยท
Discuss: Hacker News
๐Ÿ“กFeed Archaeology
SUTRA: Decoupling Concept & Language for Multilingual LLM Excellence
hackernoon.comยท2h
๐Ÿ’ปLocal LLMs
Kumo Surfaces Structured Data Patterns Generative AI Misses
thenewstack.ioยท4h
๐Ÿ“ŠGraph Databases
Loading...Loading more...
AboutBlogChangelogRoadmap