Capturing Classic Authorial Style in Long-Form Story Generation with GRPO Fine-Tuning
arxiv.org·6h
📝Concrete Syntax
Preview
Report Post

View PDF HTML (experimental)

Abstract:Recent advances in large language models (LLMs) show impressive performance in open-ended story generation, but fine-grained stylistic control remains limited. Existing methods often rely on shallow cues (e.g., names or topics) to simulate authorial style, without robust evaluation. In this work, we present a training framework for style-conditioned story generation using Group Relative Policy Optimization (GRPO) and a custom multi-reward setup. The style reward is derived from a fine-tuned sentence transformer using authorship verification (AV) signals, combined with content and completeness scores to stabilize long-form narrative generation. We conduct exper…

Similar Posts

Loading similar posts...