Skip to main content
Scour
Discover
Docs
Login
Sign Up
Discover
About
Docs
Changelog
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
LLMs
🧠 LLMs
Specific
large language models, GPT, Claude, foundation models
Filter Results
Timeframe
Choose a timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
46
posts in
53.4
ms
🔗
LLM Orchestration
arxiv.org
·
6d
6 days ago
Are
LLM-based
Chatbots Good Enough to Support Computer Science Students in Multiple-Choice Exercises?
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Are LLM-based Chatbots Good Enough to Support Computer Science Students in Multiple-Choice Exercises?
✍️
Prompt Engineering
arxiv.org
·
5d
5 days ago
Mind Companion: An Embodied Conversational Agent for Process-Based Psychotherapy
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Mind Companion: An Embodied Conversational Agent for Process-Based Psychotherapy
✍️
Prompt Engineering
arxiv.org
·
6d
6 days ago
Do
LLMs
Reliably Identify Correct Information Units in Aphasic Discourse?
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Do LLMs Reliably Identify Correct Information Units in Aphasic Discourse?
🔗
LLM Orchestration
arxiv.org
·
4d
4 days ago
CAPRA: Scaling Feedback on Software Architecture Deliverables with a Multi-Agent
LLM
System
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for CAPRA: Scaling Feedback on Software Architecture Deliverables with a Multi-Agent LLM System
🛠️
MLOps
arxiv.org
·
6d
6 days ago
Heteroskedastic Signals in Budgeted
LLM
Verification: Structural Heterogeneity Limits Optimization Gains
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Heteroskedastic Signals in Budgeted LLM Verification: Structural Heterogeneity Limits Optimization Gains
✍️
Prompt Engineering
arxiv.org
·
4d
4 days ago
PEC-Home: Interpretation of Progressively Elliptical Commands in Smart Homes
Covered by
ai-brief.liziran.com
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for PEC-Home: Interpretation of Progressively Elliptical Commands in Smart Homes
✍️
Prompt Engineering
arxiv.org
·
6d
6 days ago
Not All Skills Help: Measuring and Repairing Agent Knowledge
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Not All Skills Help: Measuring and Repairing Agent Knowledge
📚
RAG
arxiv.org
·
6d
6 days ago
Encode Errors: Representational Retrieval of
In-Context
Demonstrations for Multilingual Grammatical Error Correction
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Encode Errors: Representational Retrieval of In-Context Demonstrations for Multilingual Grammatical Error Correction
🛠️
MLOps
arxiv.org
·
5d
5 days ago
Unintended Effects of Geographic Conditioning in
Large
Language
Models
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Unintended Effects of Geographic Conditioning in Large Language Models
✍️
Prompt Engineering
arxiv.org
·
6d
6 days ago
Is Your Agent Playing Dead? Deployed
LLM
Agents Exhibit Constraint-Evasive Fabrication and Thanatosis
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Is Your Agent Playing Dead? Deployed LLM Agents Exhibit Constraint-Evasive Fabrication and Thanatosis
✍️
Prompt Engineering
arxiv.org
·
6d
6 days ago
Compositional Reasoning Depth Predicts Clinical AI Failure: Empirical Evidence Consistent with
Transformer
Compositionality Limits in Electronic Health Record Q...
Covered by
何夕2077的个人站
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Compositional Reasoning Depth Predicts Clinical AI Failure: Empirical Evidence Consistent with Transformer Compositionality Limits in Electronic Health Record Q...
✍️
Prompt Engineering
arxiv.org
·
5d
5 days ago
Structural Role Injection in Handlebars-Templated
LLM
Prompts
: Triple-Brace Interpolation, Delimiter Family, and the Limits of HTML Auto-Escaping
Covered by
何夕2077的个人站
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Structural Role Injection in Handlebars-Templated LLM Prompts: Triple-Brace Interpolation, Delimiter Family, and the Limits of HTML Auto-Escaping
🔗
LLM Orchestration
arxiv.org
·
4d
4 days ago
ARIADNE: Agnostic Routing for
Inference-time
Adapter DyNamic sElection
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for ARIADNE: Agnostic Routing for Inference-time Adapter DyNamic sElection
✍️
Prompt Engineering
arxiv.org
·
6d
6 days ago
Sycophancy as Material Failure under Pushback Loading: A Multi-Axis Characterization Across Three Loading Cases and up to Seventeen Material Charges
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Sycophancy as Material Failure under Pushback Loading: A Multi-Axis Characterization Across Three Loading Cases and up to Seventeen Material Charges
📚
RAG
arxiv.org
·
5d
5 days ago
When AI Says "I have been in similar situations": Synthetic Lived Experience in Peer-Like Caregiver Support
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for When AI Says "I have been in similar situations": Synthetic Lived Experience in Peer-Like Caregiver Support
📚
RAG
arxiv.org
·
6d
6 days ago
From Refusal Geometry to Safety Geometry: Harmfulness--Refusal Coupling under Dynamic Adversarial
Fine-Tuning
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for From Refusal Geometry to Safety Geometry: Harmfulness--Refusal Coupling under Dynamic Adversarial Fine-Tuning
🛠️
MLOps
arxiv.org
·
6d
6 days ago
Comparing Human Gaze and
Vision-Language
Model
Attention in Safety-Relevant Environments
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Comparing Human Gaze and Vision-Language Model Attention in Safety-Relevant Environments
🛠️
MLOps
arxiv.org
·
6d
6 days ago
Binary Tracking for Spatial QA and Navigation with Open
Vision-Language
Models
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Binary Tracking for Spatial QA and Navigation with Open Vision-Language Models
✍️
Prompt Engineering
arxiv.org
·
6d
6 days ago
Frame-Conditioned Moral Computation in
LLaMA
3.1-8B-Instruct: A Mechanistic Interpretability Audit of Ethical Reasoning
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Frame-Conditioned Moral Computation in LLaMA 3.1-8B-Instruct: A Mechanistic Interpretability Audit of Ethical Reasoning
🛠️
MLOps
arxiv.org
·
6d
6 days ago
The BD-LSC Dataset: Facilitating the Benchmarking of
Models
for Lexical Semantic Change Detection in Slang and Standard Usage
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for The BD-LSC Dataset: Facilitating the Benchmarking of Models for Lexical Semantic Change Detection in Slang and Standard Usage
« Page 1
·
Page 3 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous post
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Discover
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help
Like
Save
Not for me
Report