🔊 TTS - bloknayrb · Scour

docs: document remaining agent tool tests · openclaw/openclaw@f39aff1

⌨️CLI Tools Code

Cross-Modal Masking for Robust Silent Speech Synthesis Using sEMG and Lipreading

🎤Voice Interfaces Academic

legostin/learn-almost-anything: Local desktop AI tutor: designs personalized courses on any topic with articles, interactive widgets, comprehension tests, homework review, and TTS lectures. Free — uses your Claude Pro/Max or ChatGPT Plus/Pro subscription. Tauri + React + Claude Agent SDK / Codex SDK.

🎭Anthropic Claude Code

github.com··r/SideProject

dots.tts Technical Report

📊Embeddings Academic

Test-Time Scaling in Multimodal Foundation Models: A Comprehensive Survey of Generation and Reasoning

💬LLMs Academic

Audio-Oscar: A Multi-Agent System for Complex Audio Scene Generation, Orchestration, and Refinement

🤝Multi-Agent Systems Academic

Task-Vector Arithmetic for Emotional Expressivity Control in Language-Model-Based Text-to-Speech

🎤Voice Interfaces Academic

UniVoice: A Unified Model for Speech and Singing Voice Generation

🎤Voice Interfaces Academic

GLASS: GRPO-Trained LoRA for Acoustic Style Steering in Zero-Shot Text-to-Speech

📷Photography Academic

FoeGlass: Simple In-Context Learning Is Enough for Red Teaming Audio Deepfake Detectors

🤖LLM Academic

Log in to enable infinite scrolling