Prompt Engineering

Feeds to Scour
SubscribedAll
Scoured 95 posts in 10.5 ms

Evaluating Advanced Prompting on Gemini Flash for Multi-Hop Biomedical QA

 🧠LLM  Content type: Academic
arxiv.org·

Would an LLM tell you if it’s gaming your eval? Often, no. But we can still catch the model thinking about it.

 🧠LLM
threadreaderapp.com·

Measuring Embedding Drift: Why Hybrid Search Saves Stale Models.

 💬LLMs
pub.towardsai.net
·

Meet Hades: The malware that lies to AI security agents

 🔐InfoSec  Content type: News

A wild idea: Abstract reality using ontology

 🕸️Knowledge Graphs  Content type: Discussion

ReasonAlloc: Hierarchical Decoding-Time KV Cache Budget Allocation for Reasoning Models

 💬NLP  Content type: Academic
arxiv.org·

LangChain Explained: Understanding Models, Prompts, Chains, Memory, Indexes, and Agents

 🤖Large Language Models
pub.towardsai.net
·

LLM-Based Code Documentation Generation and Multi-Judge Evaluation

 🤖Large Language Models  Content type: Academic
arxiv.org·

The Silent Killer of LLM Accuracy: Why Forcing Direct JSON Outputs is Costing You Precision

 🤖Large Language Models
pub.towardsai.net
·

TVI-CoT: Text-Visual Interleaved Chain-of-Thought Reasoning for Multimodal Understanding

 🧠LLM  Content type: Academic
arxiv.org·

Dropout-GRPO: Variational Stochasticity for Continuous Latent Reasoning

 🎮Reinforcement Learning  Content type: Academic
arxiv.org·

What Actually Happens When You Send a Prompt to Claude A Full Breakdown

 💬LLMs
pub.towardsai.net
·

Operationalizing Linguistic Methods through Prompt-Engineering Skills: An Automatic Chinese Web Neologism Detection Pipeline

 🤖Large Language Models  Content type: Academic
arxiv.org·

Automatic Extraction of Structured Information from Brain MRI Reports Using an Open-Weight Large Language Model

 💬NLP  Content type: Academic
arxiv.org·

Beyond Retrieval: Learning Compact User Representations for Scalable LLM Personalization

 🧠LLM  Content type: Academic
arxiv.org·

When LLMs Invent Rust Crates: An Empirical Study of Hallucination Patterns and Mitigation

 🤖Large Language Models  Content type: Academic
arxiv.org·

Mutation Without Variation: Convergence Dynamics in LLM-Driven Program Evolution

 🧠LLM  Content type: Academic
arxiv.org·

UrduMMLU: A Massive Multitask Benchmark for Urdu Language Understanding

 🧠LLM  Content type: Academic
arxiv.org·

Tight Sample Complexity of Transformers

 💬LLMs  Content type: Academic
arxiv.org·

A Komi-Yazva--Russian Parallel Corpus and Evaluation Protocol for Zero- and Few-Shot LLM Translation

 🧠LLM  Content type: Academic
arxiv.org·

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help