🌐 Distributed LLM Systems - pleto · Scour

🌐 Distributed LLM Systems

Load Balancing, Cluster Management, Fault Tolerance, Scaling Strategies

Cell-Free Massive MIMO SWIPT with Beyond Diagonal Reconfigurable Intelligent Surfaces

arxiv.org·4d

⚙️AI Infrastructure Automation

RLVMR: Reinforcement Learning with Verifiable Meta-Reasoning Rewards for Robust Long-Horizon Agents

arxiv.org·5d

🧠Large Language Models (LLMs)

SpatioTemporal Difference Network for Video Depth Super-Resolution

arxiv.org·21h

⚡Real-time AI Systems

Composable Effect Handling for Programming LLM-integrated Scripts

arxiv.org·6d

💬Prompt optimizations for LLM serving

RecUserSim: A Realistic and Diverse User Simulator for Evaluating Conversational Recommender Systems

arxiv.org·4d

🧠Large Language Models (LLMs)

Discrete Tokenization for Multimodal LLMs: A Comprehensive Survey

arxiv.org·4d

🧠Large Language Models (LLMs)

A Mixed User-Centered Approach to Enable Augmented Intelligence in Intelligent Tutoring Systems: The Case of MathAIde app

arxiv.org·1d

🔍Retrieval-augmented generation

Knowledge-Guided Memetic Algorithm for Capacitated Arc Routing Problems with Time-Dependent Service Costs

arxiv.org·6d

🔧Systems-level optimizations for LLM serving

Can LLMs Reason About Trust?: A Pilot Study

arxiv.org·6d

🧠Large Language Models (LLMs)

Good Learners Think Their Thinking: Generative PRM Makes Large Reasoning Model More Efficient Math Learner

arxiv.org·4d

🧠Large Language Models (LLMs)

SU-ESRGAN: Semantic and Uncertainty-Aware ESRGAN for Super-Resolution of Satellite and Drone Imagery with Fine-Tuning for Cross Domain Evaluation

arxiv.org·1d

🔍Retrieval-augmented generation

Rethinking Multimodality: Optimizing Multimodal Deep Learning for Biomedical Signal Classification

arxiv.org·21h

⚡Real-time AI Systems

RedCoder: Automated Multi-Turn Red Teaming for Code LLMs

arxiv.org·5d

⚙️AI Infrastructure Automation

TITAN-Guide: Taming Inference-Time AligNment for Guided Text-to-Video Diffusion Models

arxiv.org·1d

🧠Large Language Models (LLMs)

SpA2V: Harnessing Spatial Auditory Cues for Audio-driven Spatially-aware Video Generation

arxiv.org·1d

🔍Retrieval-augmented generation

TRIDENT: Benchmarking LLM Safety in Finance, Medicine, and Law

arxiv.org·6d

🚀LLM serving frameworks

Interpretable Anomaly-Based DDoS Detection in AI-RAN with XAI and LLMs

arxiv.org·6d

🧠Large Language Models (LLMs)

Efficient handover based on Near-field and Far-field RIS for seamless connectivity

arxiv.org·5d

⚙️AI Infrastructure Automation

Efficient Neural Combinatorial Optimization Solver for the Min-max Heterogeneous Capacitated Vehicle Routing Problem

arxiv.org·6d

⚙️AI Infrastructure Automation

The Repeated-Stimulus Confound in Electroencephalography

arxiv.org·1d

🔍Retrieval-augmented generation

Loading more...