Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
You're currently offline. Some features may not work.
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
🚀 Tokenizer Performance
Lexical Analysis, Unicode Handling, SIMD Optimization, Streaming
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
111632
posts in
370.2
ms
Detecting
Overflow
in
Compressed
Token Representations for Retrieval-Augmented Generation
arxiv.org
·
10h
🌊
Streaming Lexers
Random Access in
Grammar-Compressed
Strings: Optimal Trade-Offs in Almost All Parameter
Regimes
arxiv.org
·
1d
📐
Succinct Data Structures
Show HN:
OCR
Arena – A
playground
for
OCR
models
news.ycombinator.com
·
1d
·
Discuss:
Hacker News
⚡
Tokenizer Optimization
high
error
rate in
text-embedding-3-small
status.openai.com
·
1d
🧪
Parser Testing
Large Language Models for
Mortals
book
andrewpwheeler.com
·
2d
🌱
Minimal ML
DeepSeek-V3.2
on
GB300
: Performance Breakthrough
blog.vllm.ai
·
15h
🗺️
Region Inference
Built a Hybrid RAG API with
FastAPI
&
Ollama
– Sparse + Dense retrieval in action.
youtu.be
·
1d
·
Discuss:
DEV
🌳
Pattern Match Compilation
Show HN:
WavNav
, a desktop app to explore and search large
sample
libraries
maxgraf.space
·
2h
·
Discuss:
Hacker News
🌳
Parser Visualization
Show HN:
Decoder
–
Static
call graph analysis for Python
github.com
·
5h
·
Discuss:
Hacker News
📊
Call Graph Analysis
🔑Beginner-Friendly Guide 'Longest
Balanced
Substring
II' - Problem 3714 (C++, Python, JavaScript)
dev.to
·
6h
·
Discuss:
DEV
🔤
String Algorithms
Programming
languages
mothcodes.bearblog.dev
·
5h
🔬
programming language theory
wordchipper
- my next-gen LLM tokenizer; looking for
LTR
release help
docs.rs
·
3d
·
Discuss:
r/rust
🔤
Language Tokenizers
A History of Large Language Models
gregorygundersen.com
·
1d
🪜
Recursive Descent
Coding A PoS
Tagger
from Scratch — A Statistical Part-of-Speech
Tagger
|
NLP
pub.towardsai.net
·
3d
🔤
Language Tokenizers
How
Andrej
Karpathy
Built a Working Transformer in 243 Lines of Code
analyticsvidhya.com
·
1d
🪜
Recursive Descent
Unleash
your ideas with
ASCII
monosketch.io
·
3h
·
Discuss:
Hacker News
💬
Smalltalk VMs
Index
Compression
,
Query
Execution Improvements
marginalia.nu
·
15h
📊
Query Optimizers
Scaling
LLM Post-Training at Netflix
netflixtechblog.com
·
7h
🗺️
Region Inference
Zvec
: SQLite-like
simplicity
in an embedded vector database (By Alibaba)
zvec.org
·
1d
·
Discuss:
Hacker News
💾
Minimal Databases
[
AINews
] Z.ai GLM-5: New
SOTA
Open Weights LLM
latent.space
·
1d
🏁
Language Benchmarks
Loading...
Loading more...
« Page 1
•
Page 3 »
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help