Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
You're currently offline. Some features may not work.
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
✂️ Tokenization
Text Splitting, Word Boundaries, NLP Pipeline, Lexical Analysis
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
83034
posts in
428.8
ms
Pratt
Parsers
: Expression Parsing Made Easy
journal.stuffwithstuff.com
·
10h
🏭
Code Generation
Rebuilding
the
spellchecker
zverok.space
·
9h
🌱
Stemming
LLMs -
Custom
Tokenizers
dev.to
·
2d
·
Discuss:
DEV
🌱
Stemming
Modelling the Morphology of Verbal
Paradigms
: A Case Study in the Tokenization of Turkish and
Hebrew
arxiv.org
·
1d
🌱
Stemming
The Machine
Learned
Our Language
medium.com
·
18h
·
Discuss:
r/programming
💬
Prompt Engineering
What are
tokens
and how to
count
them?
help.openai.com
·
2d
·
Discuss:
Hacker News
🌱
Stemming
**Abstract:** This paper introduces Hyperdimensional Semantic Alignment for Ancient Text Restoration and
Contextualization
(
HASATRC
), a novel framework lever...
freederia.com
·
3d
📚
Digital Humanities
Words
talktofa.com
·
1d
📖
n+1
Simple
BERT
Models for Relation Extraction and Semantic Role
Labeling
dev.to
·
1d
·
Discuss:
DEV
💬
Natural Language Processing
On
Linguistic
Precision
blog.firedrake.org
·
19h
🌱
Stemming
Practical
NLP
for Risk Modeling, Part II - Fine-tuning
DistilBERT
End-to-End on Tornado Narratives
jtrive.com
·
22h
💬
Natural Language Processing
Are you going to finish that? A Practical Study of the
Tokenization
Boundary
Problem
arxiv.org
·
5d
🌱
Stemming
hanig/engram
: Personal knowledge graph and automation system
github.com
·
3h
🗂️
Obsidian
So
whats
the next word, then? Almost-no-math
intro
to transformer models
matthias-kainer.de
·
1d
·
Discuss:
Hacker News
💬
Prompt Engineering
Count
Words
&
Correct
Writing
wordcounter.net
·
1d
🔁
Spaced Repetition
Helping
Writers
Find The Words
descriptionary.wordpress.com
·
3h
🌱
Digital Gardens
q3m
– A
what3words-like
geocoding library for France, using 3 French words
reddit.com
·
10h
·
Discuss:
r/golang
🗺️
OpenStreetMap
AI
Document
Processing with
Docling
Java and Spring Boot
thomasvitale.com
·
1d
💻
Creative Coding
How I
squeezed
a
BERT
sentiment analyzer into 1GB RAM on a $5 VPS
mohammedeabdelaziz.github.io
·
13h
·
Discuss:
Hacker News
🥶
Cold Start Problem
AI-powered text
correction
for
macOS
taipo.app
·
4h
·
Discuss:
Hacker News
🌱
Stemming
Loading...
Loading more...
Page 2 »
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help