Computer Science > Computation and Language
arXiv:2601.00797 (cs)
Abstract:A central challenge in social science is to generate rich qualitative hypotheses about how diverse social groups might interpret new information. This article introduces and illustrates a novel methodological approach for this purpose: sociological persona simulation using Large Language Models (LLMs), which we frame as a "qualitative laboratory". We argue that for this specific task, persona simulation offers a distinct advantage over established methods. By generating naturalistic discourse, it overcomes the lack of discursive depth common in vignette surveys, and by operationalizing complex worldviews through natural language, it bypasses the for…
Computer Science > Computation and Language
arXiv:2601.00797 (cs)
Abstract:A central challenge in social science is to generate rich qualitative hypotheses about how diverse social groups might interpret new information. This article introduces and illustrates a novel methodological approach for this purpose: sociological persona simulation using Large Language Models (LLMs), which we frame as a "qualitative laboratory". We argue that for this specific task, persona simulation offers a distinct advantage over established methods. By generating naturalistic discourse, it overcomes the lack of discursive depth common in vignette surveys, and by operationalizing complex worldviews through natural language, it bypasses the formalization bottleneck of rule-based agent-based models (ABMs). To demonstrate this potential, we present a protocol where personas derived from a sociological theory of climate reception react to policy messages. The simulation produced nuanced and counter-intuitive hypotheses - such as a conservative persona’s rejection of a national security frame - that challenge theoretical assumptions. We conclude that this method, used as part of a "simulation then validation" workflow, represents a superior tool for generating deeply textured hypotheses for subsequent empirical testing.
| Comments: | 26 pages, 3 tables. Manuscript submitted for peer-reviewed journal publication |
| Subjects: | Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computers and Society (cs.CY); Multiagent Systems (cs.MA) |
| ACM classes: | J.4; I.6.8; I.2.7 |
| Cite as: | arXiv:2601.00797 [cs.CL] |
| (or arXiv:2601.00797v1 [cs.CL] for this version) | |
| https://doi.org/10.48550/arXiv.2601.00797 arXiv-issued DOI via DataCite |
Submission history
From: Hugues Draelants [view email] [v1] Tue, 25 Nov 2025 08:31:48 UTC (555 KB)
Current browse context:
cs.CL
Change to browse by:
export BibTeX citation