My First Multi-GPU Kernel: Writing All-to-All for AMD MI300X
gau-nernst.github.ioยท1dยท
Discuss: Hacker News
๐ŸŽฏGPU Kernels
Flag this post
Hyper Hawkes Processes: Interpretable Models of Marked Temporal Point Processes
arxiv.orgยท3h
๐Ÿ”„ONNX
Flag this post
End-to-End Framework Integrating Generative AI and Deep Reinforcement Learning for Autonomous Ultrasound Scanning
arxiv.orgยท3h
๐ŸงฎcuDNN
Flag this post
Weakly Supervised Concept Learning with Class-Level Priors for Interpretable Medical Diagnosis
arxiv.orgยท3h
๐Ÿ“‰Model Quantization
Flag this post
CueBench: Advancing Unified Understanding of Context-Aware Video Anomalies in Real-World
arxiv.orgยท3h
๐ŸงฎcuDNN
Flag this post
Self-Improving Vision-Language-Action Models with Data Generation via Residual RL
arxiv.orgยท3h
๐Ÿ“ŠGradient Accumulation
Flag this post
Generalizing Test-time Compute-optimal Scaling as an Optimizable Graph
arxiv.orgยท3h
โšกONNX Runtime
Flag this post
MoSa: Motion Generation with Scalable Autoregressive Modeling
arxiv.orgยท3h
๐Ÿ“ŠGradient Accumulation
Flag this post
Loquetier: A Virtualized Multi-LoRA Framework for Unified LLM Fine-tuning and Serving
arxiv.orgยท3h
โšกONNX Runtime
Flag this post
Automated Personalized Chemotherapy Optimization via Multi-Modal Data Fusion & Reinforcement Learning
dev.toยท1dยท
Discuss: DEV
โšกONNX Runtime
Flag this post
NOMAD - Navigating Optimal Model Application to Datastreams
arxiv.orgยท3h
โšกONNX Runtime
Flag this post
ROVER: Benchmarking Reciprocal Cross-Modal Reasoning for Omnimodal Generation
arxiv.orgยท3h
๐ŸงฎcuDNN
Flag this post
Ariadne: A Controllable Framework for Probing and Extending VLM Reasoning Boundaries
arxiv.orgยท3h
โšกONNX Runtime
Flag this post
Active transfer learning for structural health monitoring
arxiv.orgยท1d
๐ŸŽ“Model Distillation
Flag this post
When to Trust the Answer: Question-Aligned Semantic Nearest Neighbor Entropy for Safer Surgical VQA
arxiv.orgยท3h
๐Ÿ“‰Model Quantization
Flag this post