Indexical Collapse: How Predictive Systems Make Authority Without Reference
dev.toยท4hยท
Discuss: DEV
๐ŸŽฏGradual Typing
Indent: Indent and Format C Program Source
gnu.orgยท4hยท
Discuss: Hacker News
๐Ÿ“Text Compression
Expressing Text and Data Mining Rights with Datalogics PDF Optimizer + TDMRep
pdfa.orgยท2h
๐Ÿ“„Document Digitization
Using an LLM on the Advent of Code
funcall.blogspot.comยท4hยท
โš”๏ธLean Tactics
Death of the Novel(ty): Beyond n-Gram Novelty as a Metric for Textual Creativity
arxiv.orgยท18h
๐Ÿง Intelligence Compression
In Defense of Tokenizers
huggingface.coยท1dยท
Discuss: Hacker News
๐Ÿ“Text Parsing
A Practical Guide to Regular Expressions โ€“ Learn RegEx with Real Life Examples
freecodecamp.orgยท16h
๐Ÿ”RegEx Engines
Document Workflow Transformation: How Modern AI Models Transform CRM Systems
dev.toยท2hยท
Discuss: DEV
๐Ÿ”„Schema Evolution
Two Decades Of Hackaday In Words
hackaday.comยท8h
๐ŸงชCassette Hacks
How to Detect Forbidden Words in Text (Without Slowing Down) โ€“ Part II
blog.codeminer42.comยท3d
๐ŸŒณTrie Structures
Why and When to Use Sentence Embeddings Over Word Embeddings
machinelearningmastery.comยท3d
๐Ÿ“Text Embeddings
Oldness: how does learn new productive work-study practices?
notes.kateva.orgยท7hยท
๐ŸŒฑPersonal Wikis
AI Is Great at Parsing
keeb.devยท1dยท
Discuss: Hacker News
๐Ÿง Learned Codecs
General Pruning Criteria for Fast SBL
arxiv.orgยท18h
โง—Information Bottleneck
Known Anomalies in Unicode Character Names
unicode.orgยท2dยท
Discuss: Hacker News
๐Ÿ”คUnicode Normalization
Apply the Trie: Word Search
mmhaskell.comยท13hยท
Discuss: Hacker News
๐ŸŒณTrie Structures
Why, in old books, are dates often given with the years redacted? (2012)
english.stackexchange.comยท12hยท
Discuss: Hacker News
๐Ÿ“œText Collation
Rags for dummies
dev.toยท2hยท
Discuss: DEV
๐Ÿ“ŠMulti-vector RAG
ML on Apple ][+
mdcramer.github.ioยท5hยท
Discuss: Hacker News
๐ŸงฎKolmogorov Bounds
REMA: A Unified Reasoning Manifold Framework for Interpreting Large Language Model
arxiv.orgยท18h
๐Ÿ“‹Document Grammar