Microsoft AI’s first in-house image generator MAI-Image-1 is now available
๐งLearned Audio
Flag this post
[D][P] PKBoost v2 is out! An entropy-guided boosting library with a focus on drift adaptation and multiclass/regression support.
๐Brotli Dictionary
Flag this post
Readable Code Is Unreadable
๐APL Heritage
Flag this post
AI Summarization Optimization
๐Feed Optimization
Flag this post
Good abstractions for humans turn out to be good abstractions for LLMs
โจEffect Handlers
Flag this post
AI Progress Should Be Measured by Capability-Per-Resource, Not Scale Alone: A Framework for Gradient-Guided Resource Allocation in LLMs
arxiv.orgยท19h
๐ง Intelligence Compression
Flag this post
HyFormer-Net: A Synergistic CNN-Transformer with Interpretable Multi-Scale Fusion for Breast Lesion Segmentation and Classification in Ultrasound Images
arxiv.orgยท19h
๐ง Machine Learning
Flag this post
Cross-Corpus Validation of Speech Emotion Recognition in Urdu using Domain-Knowledge Acoustic Features
arxiv.orgยท1d
๐ตAudio ML
Flag this post
"Less is More": Reducing Cognitive Load and Task Drift in Real-Time Multimodal Assistive Agents for the Visually Impaired
arxiv.orgยท19h
โTactile Computing
Flag this post
Hyper Hawkes Processes: Interpretable Models of Marked Temporal Point Processes
arxiv.orgยท19h
๐Vector Forensics
Flag this post
FreeSliders: Training-Free, Modality-Agnostic Concept Sliders for Fine-Grained Diffusion Control in Images, Audio, and Video
arxiv.orgยท19h
๐Learned Metrics
Flag this post
Ariadne: A Controllable Framework for Probing and Extending VLM Reasoning Boundaries
arxiv.orgยท19h
๐ฏProof Tactics
Flag this post
MedRECT: A Medical Reasoning Benchmark for Error Correction in Clinical Texts
arxiv.orgยท19h
โ
Format Verification
Flag this post
Wave-Particle (Continuous-Discrete) Dualistic Visual Tokenization for Unified Understanding and Generation
arxiv.orgยท19h
๐คAdvanced OCR
Flag this post
Generative human motion mimicking through feature extraction in denoising diffusion settings
arxiv.orgยท19h
๐Learned Metrics
Flag this post
Loquetier: A Virtualized Multi-LoRA Framework for Unified LLM Fine-tuning and Serving
arxiv.orgยท19h
๐ปLocal LLMs
Flag this post
ParaScopes: What do Language Models Activations Encode About Future Text?
arxiv.orgยท19h
๐Monadic Parsing
Flag this post
Loading...Loading more...