Swift Sings: MDX-Net Vocal Splits and RVC Voice Conversion On-Device with ONNX/CoreML
web.navan.dev·2d
🎧Learned Audio
Flag this post
Playing around with org-db-v3 and consult: vector search of my blog post Org files, with previews
sachachua.com·8h
⏱️Interval Archives
Flag this post
In Search of Better Search
cacm.acm.org·18h
🤖AI Curation
Flag this post
How to Transcribe Lectures Longer Than 2 Hours Without Time Limits?
📄Document Phonetics
Flag this post
The Mirror Loop: Recursive Non-Convergence in Generative Reasoning Systems
arxiv.org·1d
🔗Parser Combinators
Flag this post
SentiMaithili: A Benchmark Dataset for Sentiment and Reason Generation for the Low-Resource Maithili Language
arxiv.org·1d
⚙️Compression Benchmarking
Flag this post
MedXplain-VQA: Multi-Component Explainable Medical Visual Question Answering
arxiv.org·1d
📄OCR
Flag this post
Understanding Reader Perception Shifts upon Disclosure of AI Authorship
arxiv.org·4h
🏰Manuscript Networks
Flag this post
AI-Driven Optimization of Gel-Type Air Freshener Formulations for Enhanced Olfactory Longevity and Stability
🧠Machine Learning
Flag this post
GRAD: Real-Time Gated Recurrent Anomaly Detection in Autonomous Vehicle Sensors Using Reinforced EMA and Multi-Stage Sliding Window Techniques
arxiv.org·1d
🧠Machine Learning
Flag this post
Open Korean Historical Corpus: A Millennia-Scale Diachronic Collection of Public Domain Texts
arxiv.org·4h
🔤Character Classification
Flag this post
GroupSHAP-Guided Integration of Financial News Keywords and Technical Indicators for Stock Price Prediction
arxiv.org·1d
🧠Machine Learning
Flag this post
CDrugRed: A Chinese Drug Recommendation Dataset for Discharge Medications in Metabolic Diseases
arxiv.org·2d
🔍Information Retrieval
Flag this post
Automated Prioritization of Rare Disease Clinical Trials via Multi-Modal Data Fusion and HyperScore Evaluation
🤖Archive Automation
Flag this post
Automated Tinnitus Detection Through Dual-Modality Neuroimaging: EEG Microstate Analysis and Resting-State fMRI Classification Using Deep Learning
arxiv.org·1d
🌈Spectral Audio
Flag this post
Beyond IVR Touch-Tones: Customer Intent Routing using LLMs
arxiv.org·1d
🎙️Whisper
Flag this post
ChessQA: Evaluating Large Language Models for Chess Understanding
arxiv.org·4h
🧠Intelligence Compression
Flag this post
AnyECG-Lab: An Exploration Study of Fine-tuning an ECG Foundation Model to Estimate Laboratory Values from Single-Lead ECG Signals
arxiv.org·1d
🎵Audio ML
Flag this post
Loading...Loading more...