String Algorithms, Suffix Arrays, Burrows-Wheeler Transform, Text Processing
Rethinking Tokenization for Rich Morphology: The Dominance of Unigram over BPE and Morphological Alignment
arxiv.orgยท10h
Friday 22 August 2025 - 11.00
informatics.ed.ac.ukยท4h
Morpho-phonologically AI
languagelog.ldc.upenn.eduยท1d
The AI Was Fed Sloppy Code. It Turned Into Something Evil.
quantamagazine.orgยท35m
Training Kindai OCR with parallel textline images and self-attention feature distance-based loss
arxiv.orgยท10h
Loading...Loading more...