OCR Verification

Feeds to Scour
SubscribedAll
Scoured 16 posts in 37.0 ms

Overcoming Decoder Inconsistencies in Whisper for Dravidian and Low-Resource Languages

 🎙️Whisper  Content type: Academic
arxiv.org·

Higgs Audio v3 TTS 4B. Built for voice chat. Support 100 languages and inline control.

 🎓Academic Torrents
Less-relevant results

lbj96347/nemotron-3.5-asr-ios: On-device, offline speech recognition for iPhone/iPad using NVIDIA's Nemotron-3.5-ASR Streaming 0.6B (multilingual) via CoreML.SwiftUI app with mic capture + audio file import, RNN-Tdecoding, and live benchmark metrics (latency, RTF, memory).

 🔓Open Source Software  Content type: Code
github.com··Hacker News

ZTE showcases AI-driven project management innovations at the 14th IPMA Research Conference 2026

 📄Document Digitization  Content type: News
theregister.com·

Common Accessibility Challenges in PDF Documents

 📋Document Standards
pdfa.org·

The right’s culture war over prostate cancer screening is damaging trust in medicine | Polly Toynbee

 ⚖️Lossy Compression Ethics  Content type: News
theguardian.com·

End-to-End Training for Discrete Token LLM based TTS System

 🤖Grammar Induction  Content type: Academic
arxiv.org·

Discovery of Cold War-era rare Eastern Bloc computers in a German hangar

 ✈️Datasaab D2

Vision Language Model Helps Private Information De-Identification in Vision Data

 📄OCR  Content type: Academic
arxiv.org·

Why We Should Rethink the Term “Unstructured Data”

 🔢OCR Mathematics
info.aiim.org·

Hearing the Unspoken: Language Model Priors for Acoustic Adversarial Attacks

 📊Rate-Distortion Theory  Content type: Academic
arxiv.org·

Beyond WER: A Paired Acoustic Stress Test for Ambient Clinical Scribes

 🔊Acoustic Forensics  Content type: Academic
arxiv.org·

Handwriting Extraction and Analysis of Signature Lists in Swiss Popular Initiatives

 📄Document Digitization  Content type: Academic
arxiv.org·

Cross-Modal Masking for Robust Silent Speech Synthesis Using sEMG and Lipreading

 👂Psychoacoustic Coding  Content type: Academic
arxiv.org·

Real-Time Automatic License Plate Recognition Using YOLOv8, SORT Tracking, and Temporal Data Interpolation

 📄OCR  Content type: Academic
arxiv.org·

VTI-CoT: Visual-Textual Interleaved Chain of Thought for Video Reasoning

 📄OCR  Content type: Academic
arxiv.org·

No more posts from matmat's subscribed feeds.

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help