Show HN: Interview Transcription with AI Quote Extraction and Q&A Format
๐๏ธVoice AI Systems
Flag this post
VoxScribe: A platform to test Opensource Speech-to-Text models
blog.devops.devยท2d
๐ฃ๏ธSpeech Synthesis
Flag this post
All You Need for Object Detection: From Pixels, Points, and Prompts to Next-Gen Fusion and Multimodal LLMs/VLMs in Autonomous Vehicles
arxiv.orgยท6h
๐ฑEdge AI
Flag this post
Bringing Vision-Language Intelligence to RAG with ColPali
towardsdatascience.comยท1d
๐งฉLow-code
Flag this post
A Senior Developerโs Guide to Vibe Coding and Deep AI Integration in Cursor
๐งฉLow-code
Flag this post
PORTool: Tool-Use LLM Training with Rewarded Tree
arxiv.orgยท6h
๐ปLocal LLMs
Flag this post
AI Front End Generator Comparison: Claude Code vs. v0 vs. Lovable vs. Replit
๐งฉWebAssembly
Flag this post
Implicature in Interaction: Understanding Implicature Improves Alignment in Human-LLM Interaction
arxiv.orgยท1d
๐คVoice Interfaces
Flag this post
Model Inversion with Layer-Specific Modeling and Alignment for Data-Free Continual Learning
arxiv.orgยท6h
๐ฑEdge AI
Flag this post
Why do AI models use so many em-dashes?
โจvibe-coding
Flag this post
What's In My Human Feedback? Learning Interpretable Descriptions of Preference Data
arxiv.orgยท6h
๐ Self-hosted AI
Flag this post
Contribution-Guided Asymmetric Learning for Robust Multimodal Fusion under Imbalance and Noise
arxiv.orgยท6h
๐ปLocal LLMs
Flag this post
EchoMind: An Interrelated Multi-level Benchmark for Evaluating Empathetic Speech Language Models
arxiv.orgยท3d
๐คVoice Interfaces
Flag this post
Arabic Little STT: Arabic Children Speech Recognition Dataset
arxiv.orgยท3d
๐๏ธVoice AI Systems
Flag this post
Testing Cross-Lingual Text Comprehension In LLMs Using Next Sentence Prediction
arxiv.orgยท1d
๐ปLocal LLMs
Flag this post
Loading...Loading more...