๐Ÿฟ๏ธ ScourBrowse
LoginSign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
๐ŸŽ™๏ธ Whisper

speech-to-text, model, local, whisper, whisper.cpp, input, voice, recognition, ai

DLL Injections and System API Notes
standinglynx.comยท14h
๐ŸงฉLow-code
AsmJit: Lightweight C++ library for low-latency machine code generation
asmjit.comยท1dยท
Discuss: Hacker News
๐Ÿ”Static Analysis
Meta Beats Copyright Suit From Authors Over AI Training on Books
tech.slashdot.orgยท41m
๐Ÿ“CMS
Show HN: A different kind of AI Video generation
news.ycombinator.comยท1hยท
Discuss: Hacker News
๐ŸŽš๏ธVoice AI Systems
Vibe Learning to Fearlessly Explore Unfamiliar Tech
kaveh.pageยท2dยท
Discuss: Hacker News
โœจvibe-coding
Vocoder-Free Non-Parallel Conversion of Whispered Speech With Masked Cycle-Consistent Generative Adversarial Networks
arxiv.orgยท1d
๐ŸŽš๏ธVoice AI Systems
Practical tips to optimize documentation for LLMs, AI agents, and chatbots
biel.aiยท1dยท
Discuss: Hacker News
๐Ÿ—๏ธAI Infrastructure
GCP Fundamentals: Contact Center AI Platform API
dev.toยท1dยท
Discuss: DEV
๐Ÿง AI
SUTRA: Decoupling Concept & Language for Multilingual LLM Excellence
hackernoon.comยท10h
๐Ÿ’ปLocal LLMs
Agentic AI: Implementing Long-Term Memory
towardsdatascience.comยท1d
๐Ÿ’ปLocal LLMs
GCP Fundamentals: Cloud Text-to-Speech API
dev.toยท4dยท
Discuss: DEV
๐Ÿ—ฃ๏ธVoice Coding
A Robust Method for Pitch Tracking in the Frequency Following Response using Harmonic Amplitude Summation Filterbank
arxiv.orgยท22h
๐ŸŽš๏ธVoice AI Systems
Semantic Scene Graph for Ultrasound Image Explanation and Scanning Guidance
arxiv.orgยท22h
๐Ÿ—๏ธAI Infrastructure
PlanMoGPT: Flow-Enhanced Progressive Planning for Text to Motion Synthesis
arxiv.orgยท1d
๐ŸŽš๏ธVoice AI Systems
LOGICPO: Efficient Translation of NL-based Logical Problems to FOL using LLMs and Preference Optimization
arxiv.orgยท1d
๐Ÿ’ปLocal LLMs
LLM-driven Medical Report Generation via Communication-efficient Heterogeneous Federated Learning
arxiv.orgยท1d
๐Ÿ’ปLocal LLMs
Conversational Intent-Driven GraphRAG: Enhancing Multi-Turn Dialogue Systems through Adaptive Dual-Retrieval of Flow Patterns and Context Semantics
arxiv.orgยท22h
๐ŸŽคVoice Interfaces
"I understand why I got this grade": Automatic Short Answer Grading with Feedback
arxiv.orgยท1d
๐Ÿ—๏ธAI Infrastructure
Multilingual innovation in LLMs: How open models help unlock global communication
developers.googleblog.comยท2d
๐Ÿ’ปLocal LLMs
MemeMind: A Large-Scale Multimodal Dataset with Chain-of-Thought Reasoning for Harmful Meme Detection
arxiv.orgยท22h
๐Ÿ—๏ธAI Infrastructure
Loading...Loading more...
AboutBlogChangelogRoadmap