🐿️ ScourBrowse
LoginSign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
🔧 Systems-level optimizations for LLM serving
Beyond Manually Designed Pruning Policies with Second-Level Performance Prediction: A Pruning Framework for LLMs
arxiv.org·18h
🧠Large Language Models (LLMs)
Optimal Scheduling Algorithms for LLM Inference: Theory and Practice
arxiv.org·18h
🧠Large Language Models (LLMs)
NVIDIA vGPU 19.0 Enables Graphics and AI Virtualization on NVIDIA Blackwell GPUs
developer.nvidia.com·22h
📊AI Performance Profiling
DBAIOps: A Reasoning LLM-Enhanced Database Operation and Maintenance System using Knowledge Graphs
arxiv.org·18h
🧠Large Language Models (LLMs)
Autonomous Penetration Testing: Solving Capture-the-Flag Challenges with LLMs
arxiv.org·18h
💬Prompt optimizations for LLM serving
ProCut: LLM Prompt Compression via Attribution Estimation
arxiv.org·18h
💬Prompt optimizations for LLM serving
AlignGuard-LoRA: Alignment-Preserving Fine-Tuning via Fisher-Guided Decomposition and Riemannian-Geodesic Collision Regularization
arxiv.org·18h
🧠Large Language Models (LLMs)
MOPrompt: Multi-objective Semantic Evolution for Prompt Optimization
arxiv.org·18h
💬Prompt optimizations for LLM serving
CAPO: Towards Enhancing LLM Reasoning through Verifiable Generative Credit Assignment
arxiv.org·18h
🧠Large Language Models (LLMs)
Machine Learning Pipeline for Software Engineering: A Systematic Literature Review
arxiv.org·1d
⚙️AI Infrastructure Automation
L3M+P: Lifelong Planning with Large Language Models
arxiv.org·18h
🧠Large Language Models (LLMs)
Refine-n-Judge: Curating High-Quality Preference Chains for LLM-Fine-Tuning
arxiv.org·18h
🧠Large Language Models (LLMs)
LinkQA: Synthesizing Diverse QA from Multiple Seeds Strongly Linked by Knowledge Points
arxiv.org·18h
🧠Large Language Models (LLMs)
A Methodological Framework for LLM-Based Mining of Software Repositories
arxiv.org·18h
🧠Large Language Models (LLMs)
Improving performance of content-centric networks via decentralized coded caching for multi-level popularity and access
arxiv.org·18h
⚙️AI Infrastructure Automation
Path-LLM: A Shortest-Path-based LLM Learning for Unified Graph Representation
arxiv.org·18h
🧠Large Language Models (LLMs)
AutoML-Med: A Framework for Automated Machine Learning in Medical Tabular Data
arxiv.org·18h
🧠Large Language Models (LLMs)
TRACEALIGN -- Tracing the Drift: Attributing Alignment Failures to Training-Time Belief Sources in LLMs
arxiv.org·18h
🧠Large Language Models (LLMs)
TriP-LLM: A Tri-Branch Patch-wise Large Language Model Framework for Time-Series Anomaly Detection
arxiv.org·1d
🧠Large Language Models (LLMs)
KCR: Resolving Long-Context Knowledge Conflicts via Reasoning in LLMs
arxiv.org·18h
🧠Large Language Models (LLMs)
Loading...Loading more...
AboutBlogChangelogRoadmap