← Research
Preprint · Under review at ICLR 2026
Genre Lock-In in Autonomous Language Agents: When Authority Framing Overrides Epistemic Correctness
· Rohith Namboothiri · ICLR 2026 Conference
Read full paperAbstract
Genre Lock-In is a failure mode where autonomous agents infer an interaction genre from authority-framed prompts and prioritize genre coherence over epistemic correctness.
When an agent settles into an authoritative voice, that genre can become a soft prior that resists correction. Even when the system instructs refusal or the user introduces contradicting evidence, the agent may continue fabricating state to preserve the frame.
The paper distinguishes genre lock-in from ordinary hallucination, misalignment, and sycophancy, motivating narrative-neutral and epistemically gated architectures for agent state exchange.
Keywords
AgentsHallucinationPersonaCalibrationSafety