Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
You're currently offline. Some features may not work.
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
๐ Tokenizer Performance
Lexical Analysis, Unicode Handling, SIMD Optimization, Streaming
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
80606
posts in
220.5
ms
Coding A PoS
Tagger
from Scratch โ A Statistical Part-of-Speech
Tagger
|
NLP
pub.towardsai.net
ยท
18h
๐ค
Language Tokenizers
LOCA-bench
: Benchmarking Language Agents Under
Controllable
and Extreme Context Growth
arxiv.org
ยท
11h
๐
Incremental Parsers
Large Language Models for
Mortals
book released
crimede-coder.com
ยท
2h
ยท
Discuss:
Hacker News
๐ฑ
Minimal ML
Build Voice AI in Python: Complete Speech-to-Text Developer Guide (2026)
dev.to
ยท
1h
ยท
Discuss:
DEV
๐
Incremental Tokenizers
A Note on
Flat
Abstract
Syntax
Trees
gist.github.com
ยท
22h
ยท
Discuss:
Hacker News
๐ณ
Tree Walking
Adaptive
Protein
Tokenization
arxiv.org
ยท
1d
๐ค
Language Tokenizers
Taming the Regex Monster: Optimizing Massive
Literal
Alternations
modern-c.blogspot.com
ยท
4d
ยท
Discuss:
r/golang
๐ค
Regex Engines
Faster
AI Training
Unlocked
With New System For Massive Language Models
quantumzeitgeist.com
ยท
1d
โก
Tokenizer Optimization
Show HN: Deterministic
linguistic
enrichment
pipeline for Node.js
npmjs.com
ยท
5h
ยท
Discuss:
Hacker News
๐ค
Language Tokenizers
Show HN:
AetherLang
โ A
DSL
for building AI workflows with visual debugging
github.com
ยท
15h
ยท
Discuss:
Hacker News
โจ
Gleam
Colab
marketplace.visualstudio.com
ยท
2h
๐
Comby
Document
Clustering
with LLM Embeddings in
Scikit-learn
machinelearningmastery.com
ยท
5h
๐
Text Algorithms
25W06
. Learning a language with the machine
z1nz0l1n.com
ยท
2d
๐ฑ
Minimal ML
Sneaky
quokka
: Testing and debugging with LLMs
honnibal.dev
ยท
7h
๐งช
Parser Testing
Lucene
HNSW
performance: A deep dive into the OS page cache
opensearch.org
ยท
19h
โก
Cache-Aware Algorithms
StatLLM
: A Dataset for Evaluating the Performance of Large Language Models in
Statistical
Analysis
nature.com
ยท
4d
๐
LR Parsing
Using AI to
write
a
transpiler
dev.to
ยท
17h
ยท
Discuss:
DEV
๐ญ
Program Synthesis
What I've Learned From
Digitizing
20 Million
Historical
Documents
noahdasanaike.github.io
ยท
1d
ยท
Discuss:
r/LocalLLaMA
๐
Streaming Lexers
Modernizing
my 150-line Python search engine: Yahoo!
dumps
-> Hugging Face ๐ค
bart.degoe.de
ยท
21h
ยท
Discuss:
Hacker News
๐ฑ
Minimal ML
Domain
Specific
Languages
martinfowler.com
ยท
2h
๐จ
Domain-Specific Languages
Loading...
Loading more...
Page 2 »
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help