Word Normalization, Search Indexing, Language Processing, Token Reduction

Feeds to Scour
SubscribedAll
Scoured 80751 posts in 1.45 s
Multilexnorm++ Achieves 5-Language Asian Lexical Normalization Benchmark for Improved NLP
quantumzeitgeist.com·1d
✂️Tokenization
Preview
Report Post
Abdul Rahman Sibahi | Knuth's Linebreaking Algorithm for non-Programmers
blog.ar-ms.me·1h
✂️Tokenization
Preview
Report Post
Overview of the TREC 2025 Tip-of-the-Tongue track
arxiv.org·18h
📝TextRank
Preview
Report Post
Transformer-based relation extraction and concept normalization using an annotated clinical trials corpus
nature.com·1d
💬Natural Language Processing
Preview
Report Post
A knowledge management system inspired by plain-text accounting
thalo.rejot.dev·15h·
Discuss: Hacker News
🗂️Obsidian
Preview
Report Post
Information Retrieval Part 1: Disambiguation
searchenginejournal.com·1d
📊Search Ranking
Preview
Report Post
Allen School researchers earn EMNLP Best Paper Award for making Internet-scale texts efficiently searchable with infini-gram mini
news.cs.washington.edu·4h
📊TF-IDF
Preview
Report Post
Augmanitai Lexikon Core Words
dev.to·1h·
Discuss: DEV
📝TextRank
Preview
Report Post
Chinese Boost: Grammar and more
chineseboost.com·20h
✂️Tokenization
Preview
Report Post
Taxonomy of the Retrieval System Framework: Pitfalls and Paradigms
arxiv.org·18h
🔍Information Retrieval
Preview
Report Post
Index and Search Every File on Your Homelab Server using Sist2
noted.lol·10h
🥶Cold Start Problem
Preview
Report Post
Efficient String Compression for Modern Database Systems
cedardb.com·23h
🔢Kolmogorov Complexity
Preview
Report Post
RSSDeck- A TweetDeck inspired RSS Reader + AI + Telegram bot
github.com·13h·
Discuss: r/rss
📰RSS Reading Practices
Preview
Report Post
Classifying the Ways LLMs Summarise in Academic Search
aarontay.substack.com·1d·
Discuss: Substack
🔍Information Retrieval
Preview
Report Post
Confluence Data Center 10.2 | Atlassian Documentation
confluence.atlassian.com·1d
📡RSS
Preview
Report Post
A Brief History of Searching
blog.mojeek.com·23h
🫧Filter Bubbles
Preview
Report Post
From Pratt parsing to the Dijkstra shunting yard
matklad.github.io·10h·
Discuss: Hacker News
✂️Tokenization
Preview
Report Post
Designing OCR Pipelines for 95%+ Accuracy: AI Engineering Learnings from Production
visionparser.com·12h·
Discuss: r/programming
✂️Tokenization
Preview
Report Post
Creating a Kaggle-Winning Data Analysis Project
dev.to·1d·
Discuss: DEV
🥶Cold Start Problem
Preview
Report Post

Keyboard Shortcuts

Navigation
Next / previous item
j/k
Open post
oorEnter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help