Whisper

Feeds to Scour
SubscribedAll
Scoured 18 posts in 10.5 ms

Can Voice Agents Handle Bilingual Customers? Benchmarking Frontier ASR on Code-Switched Speech

 👁️OCR Verification  Content type: Blog
huggingface.co·

rccyx/asryx: Daemonless Linux native ASR binary (embedded via whisper.cpp C API, no dependencies beyond the standard C++ and Linux toolchain)

 🔌Operating system internals  Content type: Code
github.com··Hacker News

Whisper Hallucination Detection and Mitigation via Hidden Representation Steering and Sparse AutoEncoders

 🧠Learned Codecs  Content type: Academic
arxiv.org·

Show HN: AudioTap, record calls, transcribe and diarize locally

 🔌Offline-first Apps
audiotap.app··Hacker News

Build a local voice agent with Red Hat OpenShift AI

 Parallel Computing
developers.redhat.com·

How a user's feedback got me to finally use Apple's NaturalLanguage framework (for transcript anonymization)

 ⚛️Quantum Codes
Less-relevant results

Folder Heart Signal Korea S5, E08, 1080p, English Official Subtitles and OpenAI Whisper Machine Translation Subtitles

 📦MKV Containers

Show HN: Every Claw Deserves a Face

 🏠HomeLab
nyxclaw.ai··Hacker News

EvilIrving/ai-transcriber: Transcribe and summarize videos, podcasts, local files, and RSS feeds using Whisper and OpenAI-compatible LLMs.

 📡Feed Archaeology  Content type: Code
github.com··r/WebApps

Building "Customer Escape Room": A Voice-Powered Game That Teaches Customer Experience by Making You Live Through Support Hell

 ⚔️Lean Tactics
dailybuild.xyz··DEV

Subtitle-Aligned Fine-Tuning of Whisper for Swiss German ASR: Benchmark Contamination, Convention Mismatch, and an Honest Baseline at 25.6% WER (13.8% cWER)

 🎵Audio ML  Content type: Academic
arxiv.org·

Community Web UI (unofficial)

 🏠HomeLab
get-hermes.ai··Hacker News

Open Notebook’s AI-powered podcasts are a game-changer for productivity, provided you’re willing to configure them right

 🔓Open Source Software
xda-developers.com·

ibrahimqureshae/whisperx-transcriber: Offline AI transcription for Windows. Word-level timestamps. No cloud. No subscription. Free forever.

 🎬ffmpeg  Content type: Code
github.com··r/editors

OpenAI Whisper in 150 lines of NumPy

 🌊Streaming Compression  Content type: Code
github.com··Hacker News

chipmates/agoracosmica: A Living Library You Can Talk To. Open-source educational platform with 30 historical figures from philosophy, science, art, mysticism, and activism. Stories, dialogues, AI conversation, multi-figure councils. Nonprofit, BYOK, self-hostable, no behavioral tracking.

 🔓Free and open source  Content type: Code
github.com··Hacker News

tetherto/qvac: QVAC - Local AI SDK and libraries for building private, cross-platform, peer-to-peer AI applications. Run LLMs, speech-to-text, translation, and more locally on Linux, macOS, Windows, Android, and iOS.

 🏴‍☠️Piracy  Content type: Code
github.com·

Alradyin/wallie-V2: AI VTuber / streamer framework with real-time vision, personality engine, and lip-synced avatar — built for Twitch, YouTube, and Kick.

 📊Spectral Analysis  Content type: Code
github.com··Hacker News

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help