R²D²: Perception-Guided Task & Motion Planning for Long-Horizon Manipulation
developer.nvidia.com·1d
🔺Geometric Learning
Flag this post
Augmenting learning in neuro-embodied systems through neurobiological first principles
arxiv.org·15h
🔍Grad-CAM
Flag this post
A Multi-tiered Human-in-the-loop Approach for Interactive School Mapping Using Earth Observation and Machine Learning
arxiv.org·1d
🛰️Satellite Imagery
Flag this post
HyFormer-Net: A Synergistic CNN-Transformer with Interpretable Multi-Scale Fusion for Breast Lesion Segmentation and Classification in Ultrasound Images
arxiv.org·15h
👁️Vision Transformers
Flag this post
Explore More, Learn Better: Parallel MLLM Embeddings under Mutual Information Minimization
arxiv.org·15h
🤖Machine learning
Flag this post
Design of quasi phase matching crystal based on differential gray wolf algorithm
arxiv.org·15h
🔢NumPy
Flag this post
Why Multimodal AI Broke the Data Pipeline — And How Daft Is Beating Ray and Spark to Fix It
hackernoon.com·1d
🧠OpenAI
Flag this post
flowengineR: A Modular and Extensible Framework for Fair and Reproducible Workflow Design in R
arxiv.org·15h
🧠OpenAI
Flag this post
FairAIED: Navigating Fairness, Bias, and Ethics in Educational AI Applications
arxiv.org·15h
🧠OpenAI
Flag this post
Discriminately Treating Motion Components Evolves Joint Depth and Ego-Motion Learning
arxiv.org·15h
🔺Geometric Learning
Flag this post
Benchmarking individual tree segmentation using multispectral airborne laser scanning data: the FGI-EMIT dataset
arxiv.org·15h
☁️Point Cloud Processing
Flag this post
Multi-refined Feature Enhanced Sentiment Analysis Using Contextual Instruction
arxiv.org·15h
🧠OpenAI
Flag this post
ROVER: Benchmarking Reciprocal Cross-Modal Reasoning for Omnimodal Generation
arxiv.org·15h
🔍Grad-CAM
Flag this post
Thought-For-Food: Reasoning Chain Induced Food Visual Question Answering
arxiv.org·15h
🧠OpenAI
Flag this post
Loading...Loading more...