Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
✂️ Tokenization
Text Splitting, Word Boundaries, NLP Pipeline, Lexical Analysis
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
200517
posts in
32.9
ms
Semantic
Reranking
at Inference Time for Hard Examples in
Rhetorical
Role Labeling
📝
TextRank
arxiv.org
·
2h
GiLT
:
Augmenting
Transformer Language Models with Dependency Graphs
🤖
Transformer Architecture
arxiv.org
·
1d
Artificial
Aphasias
in
Lesioned
Language Models
🤖
Transformer Architecture
arxiv.org
·
1d
Learning
Variable-Length
Tokenization
for Generative Recommendation
🔗
RAG
arxiv.org
·
2h
TokAlign
++: Advancing
Vocabulary
Adaptation via Better Token Alignment
🌱
Stemming
arxiv.org
·
5d
Effective Context in Transformers: An Analysis of
Fragmentation
and
Tokenization
🤖
Transformer Architecture
arxiv.org
·
5d
Tokenizer
Fertility
and Zero-Shot Performance of Foundation Models on Ukrainian Legal Text: A Comparative Study
🌱
Stemming
arxiv.org
·
4d
Pretraining Language Models with
Subword
Regularization: An Empirical Study of
BPE
Dropout in Low-Resource NLP
🤖
Transformer Architecture
arxiv.org
·
5d
Language Modeling with
Hyperspherical
Flows
🔢
Kolmogorov Complexity
arxiv.org
·
6d
Seed Bank, Co-op,
Stoop
Swap:
Metaphors
for Governing Language Model Data for Creative Writing
🧩
Cognitive Architecture
arxiv.org
·
5d
Spectral
Vision Transformer for Efficient
Tokenization
with Limited Data
📊
TF-IDF
arxiv.org
·
6d
Probabilistic
Calibration
Is a
Trainable
Capability in Language Models
🔢
Kolmogorov Complexity
arxiv.org
·
6d
Correct Answers from Sound Reasoning:
Verifiable
Process
Supervision
for Language Models
🧠
LLM Reasoning
arxiv.org
·
5d
Images in Sentences: Scaling
Interleaved
Instructions
for Unified Visual Generation
🤖
Transformer Architecture
arxiv.org
·
6d
Dywave
: Event-Aligned Dynamic
Tokenization
for Heterogeneous IoT Sensing Signal
🔢
Kolmogorov Complexity
arxiv.org
·
4d
Exploring Token-Space Manipulation in
Latent
Audio
Tokenizers
🌱
Stemming
arxiv.org
·
6d
Multi-Stream LLMs:
Unblocking
Language Models with Parallel Streams of Thoughts,
Inputs
and Outputs
🧠
LLM Reasoning
arxiv.org
·
6d
InsightTok
: Improving Text and Face Fidelity in Discrete Tokenization for
Autoregressive
Image Generation
🔗
RAG
arxiv.org
·
4d
Discrimination Is Generation:
Unifying
Ranking and Retrieval from a
Tokenizer
Perspective
🔗
RAG
arxiv.org
·
4d
Sampling from Flow Language Models via
Marginal-Conditioned
Bridges
🔢
Kolmogorov Complexity
arxiv.org
·
5d
« Page 1
·
Page 3 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help