Minimal Prompt Induction of Self-Talk in Base LLMs
lesswrong.com·17h
Flag this post

Published on October 15, 2025 1:15 AM GMT

Note: This is my first LessWrong post. I’m sharing initial observations of a small empirical study on open-source LLM behavior. These observations concern linguistic dynamics rather than literal agency, and I welcome replication, critique, and other pointers around this kind of research.

These are empirical notes on basal language dynamics, attractors, and how we might induce early goal-seek language patterns in base models as opposed to instruction-tuned model outputs.

Summary

Across ~20 iterations per condition, the base model produced no structured output under empty or single-token prompts, with structured role-based language appearing consistently only after minimal instruction priming. Future anal…

Similar Posts

Loading similar posts...