Towards Sub-millisecond Latency and Guaranteed Bit Rates in 5G User Plane
arxiv.org·1d
🔧Systems-level optimizations for LLM serving
Flag this post
Multimodal Detection of Fake Reviews using BERT and ResNet-50
arxiv.org·1d
🧠Large Language Models (LLMs)
Flag this post
UnifiedFL: A Dynamic Unified Learning Framework for Equitable Federation
arxiv.org·5d
🧠Large Language Models (LLMs)
Flag this post
End-to-End Framework Integrating Generative AI and Deep Reinforcement Learning for Autonomous Ultrasound Scanning
arxiv.org·1d
⚡Real-time AI Systems
Flag this post
DynBERG: Dynamic BERT-based Graph neural network for financial fraud detection
arxiv.org·1d
🧠Large Language Models (LLMs)
Flag this post
Empowering RepoQA-Agent based on Reinforcement Learning Driven by Monte-carlo Tree Search
arxiv.org·5d
💬Prompt optimizations for LLM serving
Flag this post
OceanAI: A Conversational Platform for Accurate, Transparent, Near-Real-Time Oceanographic Insights
arxiv.org·1d
⚙️AI Infrastructure Automation
Flag this post
How to Predict Biomolecular Structures Using the OpenFold3 NIM
developer.nvidia.com·1d
🔧Systems-level optimizations for LLM serving
Flag this post
Online Energy Storage Arbitrage under Imperfect Predictions: A Conformal Risk-Aware Approach
arxiv.org·1d
✨Model optimizations in LLMs
Flag this post
Contrastive Knowledge Transfer and Robust Optimization for Secure Alignment of Large Language Models
arxiv.org·2d
🧠Large Language Models (LLMs)
Flag this post
Pelican-VL 1.0: A Foundation Brain Model for Embodied Intelligence
arxiv.org·1d
🧠Large Language Models (LLMs)
Flag this post
Saliency-Guided Domain Adaptation for Left-Hand Driving in Autonomous Steering
arxiv.org·1d
🧠Large Language Models (LLMs)
Flag this post
High Resolution Seismic Waveform Generation using Denoising Diffusion
arxiv.org·1d
🔢Quantization of LLMs
Flag this post
What is the Return on Investment of Digital Engineering for Complex Systems Development? Findings from a Mixed-Methods Study on the Post-production Design Chang...
arxiv.org·1d
⚙️AI Infrastructure Automation
Flag this post
FLoRA: Fused forward-backward adapters for parameter efficient fine-tuning and reducing inference-time latencies of LLMs
arxiv.org·1d
🧠Large Language Models (LLMs)
Flag this post
The Curvature Rate {\lambda}: A Scalar Measure of Input-Space Sharpness in Neural Networks
arxiv.org·1d
🧠Large Language Models (LLMs)
Flag this post
SciTextures: Collecting and Connecting Visual Patterns, Models, and Code Across Science and Art
arxiv.org·1d
🔍Retrieval-augmented generation
Flag this post
Deep Learning-Accelerated Shapley Value for Fair Allocation in Power Systems: The Case of Carbon Emission Responsibility
arxiv.org·1d
⚙️AI Infrastructure Automation
Flag this post
Melanoma Classification Through Deep Ensemble Learning and Explainable AI
arxiv.org·1d
⚡Real-time AI Systems
Flag this post
Can SAEs reveal and mitigate racial biases of LLMs in healthcare?
arxiv.org·1d
🧠Large Language Models (LLMs)
Flag this post
Loading...Loading more...