🐿️ ScourBrowse
LoginSign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
🌐 Distributed LLM Systems

Load Balancing, Cluster Management, Fault Tolerance, Scaling Strategies

Knowledge-Guided Memetic Algorithm for Capacitated Arc Routing Problems with Time-Dependent Service Costs
arxiv.org·4d
🔧Systems-level optimizations for LLM serving
Can LLMs Reason About Trust?: A Pilot Study
arxiv.org·4d
🧠Large Language Models (LLMs)
Good Learners Think Their Thinking: Generative PRM Makes Large Reasoning Model More Efficient Math Learner
arxiv.org·2d
🧠Large Language Models (LLMs)
Automated Catamorphism Synthesis for Solving Constrained Horn Clauses over Algebraic Data Types
arxiv.org·5d
✨Model optimizations in LLMs
RedCoder: Automated Multi-Turn Red Teaming for Code LLMs
arxiv.org·3d
⚙️AI Infrastructure Automation
Towards Cognitive Synergy in LLM-Based Multi-Agent Systems: Integrating Theory of Mind and Critical Evaluation
arxiv.org·4d
🤖Agents using LLMs
TRIDENT: Benchmarking LLM Safety in Finance, Medicine, and Law
arxiv.org·4d
🚀LLM serving frameworks
Adversarial-Guided Diffusion for Multimodal LLM Attacks
arxiv.org·2d
🧠Large Language Models (LLMs)
Interpretable Anomaly-Based DDoS Detection in AI-RAN with XAI and LLMs
arxiv.org·4d
🧠Large Language Models (LLMs)
An Algorithm-to-Contract Framework without Demand Queries
arxiv.org·5d
💬Prompt optimizations for LLM serving
Efficient handover based on Near-field and Far-field RIS for seamless connectivity
arxiv.org·3d
⚙️AI Infrastructure Automation
Efficient Neural Combinatorial Optimization Solver for the Min-max Heterogeneous Capacitated Vehicle Routing Problem
arxiv.org·4d
⚙️AI Infrastructure Automation
StaffPro: an LLM Agent for Joint Staffing and Profiling
arxiv.org·4d
🤖Agents using LLMs
Quantize Once, Train Fast: Allreduce-Compatible Compression with Provable Guarantees
arxiv.org·4d
🔢Quantization of LLMs
Exploring LLM-generated Culture-specific Affective Human-Robot Tactile Interaction
arxiv.org·2d
🧠Large Language Models (LLMs)
From Propagator to Oscillator: The Dual Role of Symmetric Differential Equations in Neural Systems
arxiv.org·2d
⚡Real-time AI Systems
A Scalable Pipeline for Estimating Verb Frame Frequencies Using Large Language Models
arxiv.org·3d
🧠Large Language Models (LLMs)
Deep Learning-based Prediction of Clinical Trial Enrollment with Uncertainty Estimates
arxiv.org·2d
🧠Large Language Models (LLMs)
Assessing Value of Renewable-based VPP Versus Electrical Storage: Multi-market Participation Under Different Scheduling Regimes and Uncertainties
arxiv.org·3d
✨Model optimizations in LLMs
AI paradigm for solving differential equations: first-principles data generation and scale-dilation operator AI solver
arxiv.org·2d
📊AI Performance Profiling
Loading...Loading more...
AboutBlogChangelogRoadmap