🐿️ ScourBrowse
LoginSign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
🌐 Distributed LLM Systems

Load Balancing, Cluster Management, Fault Tolerance, Scaling Strategies

Discrete Tokenization for Multimodal LLMs: A Comprehensive Survey
arxiv.org·3d
🧠Large Language Models (LLMs)
PointGauss: Point Cloud-Guided Multi-Object Segmentation for Gaussian Splatting
arxiv.org·7h
🔢Quantization of LLMs
TITAN-Guide: Taming Inference-Time AligNment for Guided Text-to-Video Diffusion Models
arxiv.org·7h
🧠Large Language Models (LLMs)
Cell-Free Massive MIMO SWIPT with Beyond Diagonal Reconfigurable Intelligent Surfaces
arxiv.org·3d
⚙️AI Infrastructure Automation
RLVMR: Reinforcement Learning with Verifiable Meta-Reasoning Rewards for Robust Long-Horizon Agents
arxiv.org·4d
🧠Large Language Models (LLMs)
Composable Effect Handling for Programming LLM-integrated Scripts
arxiv.org·5d
💬Prompt optimizations for LLM serving
RecUserSim: A Realistic and Diverse User Simulator for Evaluating Conversational Recommender Systems
arxiv.org·3d
🧠Large Language Models (LLMs)
Ultra-Low-Latency Edge Inference for Distributed Sensing
arxiv.org·6d
📊AI Performance Profiling
Knowledge-Guided Memetic Algorithm for Capacitated Arc Routing Problems with Time-Dependent Service Costs
arxiv.org·5d
🔧Systems-level optimizations for LLM serving
SpA2V: Harnessing Spatial Auditory Cues for Audio-driven Spatially-aware Video Generation
arxiv.org·7h
🔍Retrieval-augmented generation
Can LLMs Reason About Trust?: A Pilot Study
arxiv.org·5d
🧠Large Language Models (LLMs)
The Prosody of Emojis
arxiv.org·7h
🧠Large Language Models (LLMs)
Good Learners Think Their Thinking: Generative PRM Makes Large Reasoning Model More Efficient Math Learner
arxiv.org·3d
🧠Large Language Models (LLMs)
Exploring the Feasibility of Deep Learning Techniques for Accurate Gender Classification from Eye Images
arxiv.org·7h
🔍Retrieval-augmented generation
RedCoder: Automated Multi-Turn Red Teaming for Code LLMs
arxiv.org·4d
⚙️AI Infrastructure Automation
Towards Cognitive Synergy in LLM-Based Multi-Agent Systems: Integrating Theory of Mind and Critical Evaluation
arxiv.org·5d
🤖Agents using LLMs
TRIDENT: Benchmarking LLM Safety in Finance, Medicine, and Law
arxiv.org·5d
🚀LLM serving frameworks
Interpretable Anomaly-Based DDoS Detection in AI-RAN with XAI and LLMs
arxiv.org·5d
🧠Large Language Models (LLMs)
An Algorithm-to-Contract Framework without Demand Queries
arxiv.org·6d
💬Prompt optimizations for LLM serving
Efficient handover based on Near-field and Far-field RIS for seamless connectivity
arxiv.org·4d
⚙️AI Infrastructure Automation
Loading...Loading more...
AboutBlogChangelogRoadmap