Swift Sings: MDX-Net Vocal Splits and RVC Voice Conversion On-Device with ONNX/CoreML
web.navan.dev·2d
🎧Learned Audio
Flag this post
Calligraphers and Storytellers
🇯🇵Japanese Computing
Flag this post
Scientists Need a Positive Vision for AI
spectrum.ieee.org·2h
🔲Cellular Automata
Flag this post
On-Policy Distillation
💻Local LLMs
Flag this post
Quantifying Investor Sentiment & Dynamic Capital Allocation using Multi-Modal Bayesian Networks
🎛️Feed Filtering
Flag this post
AI Transcription for Students on a Budget — Say Goodbye to Per-Minute Fees
✅FLAC Verification
Flag this post
Towards actionable hypotension prediction- predicting catecholamine therapy initiation in the intensive care unit
arxiv.org·11h
🧠Machine Learning
Flag this post
Algorithmic Choreography: Generative Sonic Landscapes via Neural Field Synthesis
🎧Learned Audio
Flag this post
K-DAREK: Distance Aware Error for Kurkova Kolmogorov Networks
arxiv.org·1d
🧮Kolmogorov Complexity
Flag this post
Adaptive Transformer Architecture Optimization via Hyper-parameter Exploration and Reinforcement Learning
⚡Z3 Optimization
Flag this post
1-Minute Ghibli Style Transformation: Unleash Your Creative Potential with Fast Image AI
⟷Bidirectional Programming
Flag this post
A geometric and deep learning reproducible pipeline for monitoring floating anthropogenic debris in urban rivers using in situ cameras
arxiv.org·11h
🌀Differential Geometry
Flag this post
A beginner's guide to the Video-Utils model by Nicolascoutureau on Replicate
🎬Video Codecs
Flag this post
Accurate and Scalable Multimodal Pathology Retrieval via Attentive Vision-Language Alignment
arxiv.org·1d
🧮Vector Embeddings
Flag this post
Evaluating LLMs on Generating Age-Appropriate Child-Like Conversations
arxiv.org·11h
📝ABNF Extensions
Flag this post
Enhanced Predictive Modeling of Brazil Nut Effect Distribution via Dynamic Network Rescaling
🔲Cellular Automata
Flag this post
Unsupervised Machine-Learning Pipeline for Data-Driven Defect Detection and Characterisation: Application to Displacement Cascades
arxiv.org·11h
💾Floppy Imaging
Flag this post
Code-enabled language models can outperform reasoning models on diverse tasks
arxiv.org·2d
🧠Intelligence Compression
Flag this post
Loading...Loading more...