Make Sense of Video Analytics by Integrating NVIDIA AI Blueprints
developer.nvidia.com·3d
Real-time AI Systems
Flag this post
CoT-Saliency: Unified Chain-of-Thought Reasoning for Heterogeneous Saliency Tasks
arxiv.org·2d
🧠Large Language Models (LLMs)
Flag this post
Eyes on Target: Gaze-Aware Object Detection in Egocentric Video
arxiv.org·2d
Real-time AI Systems
Flag this post
Fleming-VL: Towards Universal Medical Visual Reasoning with Multimodal LLMs
arxiv.org·2d
🧠Large Language Models (LLMs)
Flag this post
VISTA Score: Verification In Sequential Turn-based Assessment
arxiv.org·3d
🧠Large Language Models (LLMs)
Flag this post
Learning Sparse Approximate Inverse Preconditioners for Conjugate Gradient Solvers on GPUs
arxiv.org·3d
🧠Large Language Models (LLMs)
Flag this post
Region-Aware Reconstruction Strategy for Pre-training fMRI Foundation Model
arxiv.org·2d
🔍Retrieval-augmented generation
Flag this post
Thought-For-Food: Reasoning Chain Induced Food Visual Question Answering
arxiv.org·2d
🧠Large Language Models (LLMs)
Flag this post
Adding New Capability in Existing Scientific Application with LLM Assistance
arxiv.org·2d
🧠Large Language Models (LLMs)
Flag this post
Condition-Invariant fMRI Decoding of Speech Intelligibility with Deep State Space Model
arxiv.org·1d
🧠Large Language Models (LLMs)
Flag this post
Coverage Analysis and Optimization of FIRES-Assisted NOMA and OMA Systems
arxiv.org·2d
🔧Systems-level optimizations for LLM serving
Flag this post
A Comparative Analysis of LLM Adaptation: SFT, LoRA, and ICL in Data-Scarce Scenarios
arxiv.org·2d
🧠Large Language Models (LLMs)
Flag this post