Character Classification

Feeds to Scour
SubscribedAll
Scoured 80 posts in 18.7 ms

C++26: Cleaning up string literals

 🔤Coded character sets  Content type: Blog
sandordargo.com··Hacker News, r/cpp

Ezi_gex — a Unicode-aware regex engine for Zig, with comptime compilation and pluggable backends

 🔍RegEx Engines
ziggit.dev·

A Taxonomy of Real-World Asset Tokenization for Blockchain-Based Financial Infrastructure

 📄Text Mining  Content type: Academic
arxiv.org·

princezuda/-RequiemGPT-: Fully open source and open weights built and trained by fable five with one prompt. An experience in how AI actually works

 📄Text Mining  Content type: Code
github.com··Hacker News

Unicode Fonts and Tools for X11

 🌏Character Sets
cl.cam.ac.uk··Hacker News

How to convert bytes to grams (2023)

 🔤Coded character sets  Content type: Blog
bryanhu.com··Hacker News

How many consecutive hyphens can you have in a domain name?

 📋Format Specification  Content type: Blog
shkspr.mobi··Hacker News

Perfecting Terminal Character Width Using Correction Tables · Articles

 📟Terminals
jeffquast.com·

What Are Tokens in LLMs?

 🔤Coded character sets  Content type: Blog

Ordered key sharding in DynamoDB

 🗄️Database Sharding
death.andgravity.com·

A system programmer’s guide to LLM inference

 💻Local LLMs  Content type: Blog

Why REAL Finance sees infrastructure as the next phase of tokenized finance

 📄Text Mining  Content type: News
thenextweb.com·

convictional/souls-only: A font for humans not AI and keyboard firmware to type in it.

 🖋Typography  Content type: Code
github.com··Hacker News

MeshTok: Efficient Multi-Scale Tokenization for Scalable PDE Transformers

 📄Text Mining  Content type: Academic
arxiv.org·

LDARNet: DNA Adaptive Representation Network with Learnable Tokenization for Genomic Modeling

 📄Text Mining  Content type: Academic
arxiv.org·

DREAM: Dynamic Refinement of Early Assignment Mappings

 📄Text Mining  Content type: Academic
arxiv.org·

CleanCodec: Efficient and Robust Speech Tokenization via Perceptually Guided Encoding

 📄Text Mining  Content type: Academic
arxiv.org·

ChannelTok: Efficient Flexible-Length Vision Tokenization

 📄Text Mining  Content type: Academic
arxiv.org·

Balancing Image Compression and Generation with Bootstrapped Tokenization

 📄Text Mining  Content type: Academic
arxiv.org·

LimiX-2M: Mitigating Low-Rank Collapse and Attention Bottlenecks in Tabular Foundation Models

 📄Text Mining  Content type: Academic
arxiv.org·

No more posts from matmat's subscribed feeds.

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help