← Research
Preprint · Under review at ICLR 2026

Genre Lock-In in Autonomous Language Agents: When Authority Framing Overrides Epistemic Correctness

· Rohith Namboothiri · ICLR 2026 Conference
Read full paper

Abstract

Genre Lock-In is a failure mode where autonomous agents infer an interaction genre from authority-framed prompts and prioritize genre coherence over epistemic correctness.

Venue
ICLR 2026 Conference
DOI
10.5281/zenodo.18410012

When an agent settles into an authoritative voice, that genre can become a soft prior that resists correction. Even when the system instructs refusal or the user introduces contradicting evidence, the agent may continue fabricating state to preserve the frame.

The paper distinguishes genre lock-in from ordinary hallucination, misalignment, and sycophancy, motivating narrative-neutral and epistemically gated architectures for agent state exchange.

Keywords

AgentsHallucinationPersonaCalibrationSafety