Prompt Engineering

Feeds to Scour
SubscribedAll
Scoured 2977 posts in 11.5 ms

AuRA: Internalizing Audio Understanding into LLMs as LoRA

 🧠LLMs  Content type: Academic
arxiv.org·

When LLMs Invent Rust Crates: An Empirical Study of Hallucination Patterns and Mitigation

 Effect Systems  Content type: Academic
arxiv.org·

Benchmarking and Exploring the Capabilities of LLMs for Attack Investigations

 📏Model Evaluation  Content type: Academic
arxiv.org·

Time Series as Language: A Universal Tokenizer for General-Purpose Time Series Foundation Models

 🧠LLMs  Content type: Academic
arxiv.org·

Automatic Extraction of Structured Information from Brain MRI Reports Using an Open-Weight Large Language Model

 📊ML Research  Content type: Academic
arxiv.org·

IDP-Bench: Benchmarking ability of LLMs to protect personal information in interdependent privacy contexts

 TLA+  Content type: Academic
arxiv.org·

Distilling Safe LLM Systems via Soft Prompts for On Device Settings

 TLA+  Content type: Academic
arxiv.org·

"I understand your perspective": LLM Persuasion and Sycophancy through the Lens of Communicative Action Theory

 🤖LLM Agents  Content type: Academic
arxiv.org·

Are Large Language Models Suitable for Graph Computation? Progress and Prospects

 🧠LLMs  Content type: Academic
arxiv.org·

SePO: Self-Evolving Prompt Agent for System Prompt Optimization

 💻AI Coding  Content type: Academic
arxiv.org·

Defending Jailbreak Attacks on Large Language Models via Manifold Trajectory Kinetics

 TLA+  Content type: Academic
arxiv.org·

Detecting Differences Is Not Understanding Structure: Large Language Models Fail at Graph Isomorphism

 🧠LLMs  Content type: Academic
arxiv.org·

A Komi-Yazva--Russian Parallel Corpus and Evaluation Protocol for Zero- and Few-Shot LLM Translation

 🧠LLMs  Content type: Academic
arxiv.org·

Elmes*: Automated Construction of Fine-Grained Evaluation Rubrics for Large Language Models in Long-Tail Educational Scenarios

 🔄Agentic Workflows  Content type: Academic
arxiv.org·

Phun-Bench: Evaluating LLMs on Phonological Understanding in Chinese

 🧠LLMs  Content type: Academic
arxiv.org·

Cross-LLM Consistency in Inference: Evidence from Shared Interactions

 🧠LLMs  Content type: Academic
arxiv.org·

Caliper: Probing Lexical Anchors versus Causal Structure in LLMs

 🧠LLMs  Content type: Academic
arxiv.org·

ClinicalBench: Can LLMs Beat Traditional ML Models in Clinical Prediction?

 📏Model Evaluation  Content type: Academic
arxiv.org·

IR3DE: A Linear Router for Large Language Models

 📊ML Research  Content type: Academic
arxiv.org·

QBugLM: An Agentic Benchmarking Framework for LLM-based Quantum Software Debugging

 🔄Agentic Workflows  Content type: Academic
arxiv.org·
Sign up or log in to see more results

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help