🎵 Audio ML - matmat · Scour

Towards Robust Arabic Speech Emotion Recognition with Deep Learning

🌊Digital Signal Processing Academic

Your fingers deserve a break — Voibe dictation lifetime access is on sale for $50

🎧Vorbis Encoding

EvilIrving/ai-transcriber: Transcribe and summarize videos, podcasts, local files, and RSS feeds using Whisper and OpenAI-compatible LLMs.

🎙️Whisper Code

github.com··r/WebApps

Build a local voice agent with Red Hat OpenShift AI

developers.redhat.com·

OpenCode Plugin by Aito's Intelligence

🐚Shell Automation

interrupt.camaramagic.com··r/selfhosted

Subtitle-Aligned Fine-Tuning of Whisper for Swiss German ASR: Benchmark Contamination, Convention Mismatch, and an Honest Baseline at 25.6% WER (13.8% cWER)

🎙️Whisper Academic

LocalClicky - 通过语音在本地控制您的 Mac

💻Local LLMs Code

GNSS-FM: A Self-Supervised Foundation Model for Daily GNSS Displacement Time Series

🌀Riemannian Computing Academic

ibrahimqureshae/whisperx-transcriber: Offline AI transcription for Windows. Word-level timestamps. No cloud. No subscription. Free forever.

🎙️Whisper Code

github.com··r/editors

Log in to enable infinite scrolling