Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
You're currently offline. Some features may not work.
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
๐ Tokenizer Performance
Lexical Analysis, Unicode Handling, SIMD Optimization, Streaming
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
80168
posts in
1.29
s
Coding A PoS
Tagger
from Scratch โ A Statistical Part-of-Speech
Tagger
|
NLP
pub.towardsai.net
ยท
16h
๐ค
Language Tokenizers
LOCA-bench
: Benchmarking Language Agents Under
Controllable
and Extreme Context Growth
arxiv.org
ยท
9h
๐
Incremental Parsers
A Note on
Flat
Abstract
Syntax
Trees
gist.github.com
ยท
19h
ยท
Discuss:
Hacker News
๐ณ
Tree Walking
Adaptive
Protein
Tokenization
arxiv.org
ยท
1d
๐ค
Language Tokenizers
Using AI to
write
a
transpiler
dev.to
ยท
14h
ยท
Discuss:
DEV
๐ญ
Program Synthesis
Taming the Regex Monster: Optimizing Massive
Literal
Alternations
modern-c.blogspot.com
ยท
4d
ยท
Discuss:
r/golang
๐ค
Regex Engines
Colab
marketplace.visualstudio.com
ยท
5m
๐
Comby
Show HN: Deterministic
linguistic
enrichment
pipeline for Node.js
npmjs.com
ยท
2h
ยท
Discuss:
Hacker News
๐ค
Language Tokenizers
Faster
AI Training
Unlocked
With New System For Massive Language Models
quantumzeitgeist.com
ยท
1d
โก
Tokenizer Optimization
Show HN:
AetherLang
โ A
DSL
for building AI workflows with visual debugging
github.com
ยท
12h
ยท
Discuss:
Hacker News
โจ
Gleam
25W06
. Learning a language with the machine
z1nz0l1n.com
ยท
2d
๐ฑ
Minimal ML
Document
Clustering
with LLM Embeddings in
Scikit-learn
machinelearningmastery.com
ยท
3h
๐
Text Algorithms
StatLLM
: A Dataset for Evaluating the Performance of Large Language Models in
Statistical
Analysis
nature.com
ยท
4d
๐
LR Parsing
Sneaky
quokka
: Testing and debugging with LLMs
honnibal.dev
ยท
5h
๐งช
Parser Testing
Lucene
HNSW
performance: A deep dive into the OS page cache
opensearch.org
ยท
17h
โก
Cache-Aware Algorithms
What I've Learned From
Digitizing
20 Million
Historical
Documents
noahdasanaike.github.io
ยท
23h
ยท
Discuss:
r/LocalLLaMA
๐
Streaming Lexers
Modernizing
my 150-line Python search engine: Yahoo!
dumps
-> Hugging Face ๐ค
bart.degoe.de
ยท
19h
ยท
Discuss:
Hacker News
๐ฑ
Minimal ML
Building LLMs in
Resource-Constrained
Environments
: A Hands-On Perspective
infoq.com
ยท
1d
๐ช
Recursive Descent
AutoCleanML
โ Intelligent ML Data
preprocessing
automation (pip install
autocleanml
)
dev.to
ยท
1h
ยท
Discuss:
DEV
๐ฑ
Minimal ML
Grammar
and Spell Checker -
LanguageTool
โ Get this Extension for ๐ฆ Firefox (en-GB)
addons.mozilla.org
ยท
23h
ยท
Discuss:
r/firefox
๐ค
Language Tokenizers
Loading...
Loading more...
Page 2 »
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help