Large Language Models (LLMs)

Feeds to Scour
SubscribedAll
Scoured 96 posts in 10.1 ms

TrustMargin: Training-Free Arbitration between Parametric Memory and Retrieved Evidence in Large Language Models

 🔍Retrieval-augmented generation  Content type: Academic
arxiv.org·

When Roleplaying, Do Models Believe What They Say?

 Model optimizations in LLMs  Content type: Academic
arxiv.org·

Customer-Agent: Overcoming Context Limitations in Ultra-Long Shopping Trajectories via Tool-Augmented Agents and RLVR

 🤖Agents using LLMs  Content type: Academic
arxiv.org·

GuardNet: Ensemble Strategies of Shallow Neural Networks for Robust Prompt Injection and Jailbreak Detection

 📊AI Performance Profiling  Content type: Academic
arxiv.org·

A Unifying Lens on Reward Uncertainty in RLHF

 Model optimizations in LLMs  Content type: Academic
arxiv.org·

Quantifying Subliminal Behavioral Transfer Ratios in Language Model Distillation

 Model optimizations in LLMs  Content type: Academic
arxiv.org·

Does Topic Sentiment Cause Perceived Ideology? Comparing Human and LLM Annotations in Political News Articles

 Model optimizations in LLMs  Content type: Academic
arxiv.org·

TICoder: A Repository-Level Code Generation Framework with Test-Driven Planning and Implementation-Aware Reuse

 🔍Retrieval-augmented generation  Content type: Academic
arxiv.org·

What Do People Actually Want From AI? Mapping Preference Plurality

 📊AI Performance Profiling  Content type: Academic
arxiv.org·

nD-RoPE: A Generalized RoPE for n-Dimensional Position Embedding

 🔢Quantization of LLMs  Content type: Academic
arxiv.org·

End-to-End Context Compression at Scale

 🔧Systems-level optimizations for LLM serving  Content type: Academic
arxiv.org·

When Vision Misleads, Let Location Speak: A Worldwide Image Geo-Localization Method via Location Attention Mechanism and Large Multimodal Models

 🔍Retrieval-augmented generation  Content type: Academic
arxiv.org·

Synthetic Contrastive Reasoning for Multi-Table Q&A

 🔍Retrieval-augmented generation  Content type: Academic
arxiv.org·

What Should Agents Say? Action-state Communication for Efficient Multi-Agent Systems

 🤖Agents using LLMs  Content type: Academic
arxiv.org·

APEX4: Efficient Pure W4A4 LLM Inference via Intra-SM Compute Rebalancing

 🔧Systems-level optimizations for LLM serving  Content type: Academic
arxiv.org·

SpectrumKV: Per-Token Mixed-Precision KV Cache Transfer for Prefill-Decode Disaggregated LLM Serving

 🔧Systems-level optimizations for LLM serving  Content type: Academic
arxiv.org·

From Self to Other: Evaluating Demographic Perspective-Taking in LLM Hate Speech Annotation

 Model optimizations in LLMs  Content type: Academic
arxiv.org·

GRPO Does Not Close the Multi-Agent Coordination Gap

 🤖Agents using LLMs  Content type: Academic
arxiv.org·

Evidence Graph Consistency in Retrieval-Augmented Generation: A Model-Dependent Analysis of Hallucination Detection

 🔍Retrieval-augmented generation  Content type: Academic
arxiv.org·

Time-Series Foundation Model Embeddings for Remaining Useful Life Estimation

 Real-time AI Systems  Content type: Academic
arxiv.org·

No more posts from pleto's subscribed feeds.

Sign up or log in to see more results

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help