Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
You're currently offline. Some features may not work.
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
⚡ Tokenizer Optimization
SIMD Processing, State Machines, Unicode Handling, Performance
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
112500
posts in
460.0
ms
The Script Tax: Measuring
Tokenization-Driven
Efficiency and Latency
Disparities
in Multilingual Language Models
arxiv.org
·
14h
⚡
Tokenizer Benchmarks
Time-Optimal Construction of
String
Synchronizing
Sets
arxiv.org
·
14h
📝
String Interning
BalatroBench
Benchmarks
Large Language Models Playing Balatro
balatrobench.com
·
8h
·
Discuss:
Hacker News
🏁
Language Benchmarks
Lexer
, Parser,
Codegen
github.com
·
13h
·
Discuss:
DEV
📝
Lexer Generators
Breaking the
Tractability
Barrier: A Generic Low-Level Solver for
NP-Hard
Instances (N=63) on Commodity 64-Bit Silicon
zenodo.org
·
13h
·
Discuss:
r/programming
🧩
Constraint Solvers
AI
Infra
HPC
dev.to
·
3h
·
Discuss:
DEV
📊
Register Machines
Completed
Hyperparameter
Transfer across Modules, Width, Depth, Batch and
Duration
machinelearning.apple.com
·
19h
🏗️
MLIR
REPL-Driven
Development Is Back (
Thanks
to AI)
llbbl.blog
·
3h
💬
REPL Design
Contextual
Memory Tools
trendhunter.com
·
5h
📈
Earley Parsing
Technical "
whitepaper
" for
afl-fuzz
lcamtuf.coredump.cx
·
1d
·
Discuss:
Lobsters
🎲
Parser Fuzzing
Zvec
: SQLite-like
simplicity
in an embedded vector database (By Alibaba)
zvec.org
·
1d
·
Discuss:
Hacker News
💾
Minimal Databases
How AI
Generates
Brand Names: The Real
Pipeline
dev.to
·
18h
·
Discuss:
DEV
⚡
Tokenizer Benchmarks
Nvidia’s new
technique
cuts LLM reasoning costs by 8x without losing
accuracy
venturebeat.com
·
21h
·
Discuss:
r/LocalLLaMA
🗺️
Region Inference
facebookresearch/MUSE
: A library for Multilingual Unsupervised or Supervised word Embeddings
github.com
·
3h
🔤
Language Tokenizers
Recursive
Language Models: Stop
Stuffing
the Context Window
nlp.elvissaravia.com
·
23h
🪜
Recursive Descent
The
Fourth
Wave
of Computing
lucibrowser.com
·
9h
·
Discuss:
Hacker News
🌱
Green Threads
Building an Embedding API with Rust, Arm, and
EmbeddingGemma
on AWS
Lambda
sobolev.substack.com
·
8h
·
Discuss:
Substack
📋
JSON Parsing
How
Andrej
Karpathy
Built a Working Transformer in 243 Lines of Code
analyticsvidhya.com
·
1d
🪜
Recursive Descent
LateOn-Code
&
ColGrep
: LightOn unveils state-of-the-art code retrieval models and code search tooling
huggingface.co
·
1d
·
Discuss:
Hacker News
🔤
Language Tokenizers
CMU
Flite
: Speech Synthesizer
festvox.org
·
1d
🧪
Minicompilers
Loading...
Loading more...
Page 2 »
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help