Preprint · Under review at ICLR 2026

Genre Lock-In in Autonomous Language Agents: When Authority Framing Overrides Epistemic Correctness

2026-01-29 · Rohith Namboothiri · ICLR 2026 Conference

Abstract

Genre Lock-In is a failure mode where autonomous agents infer an interaction genre from authority-framed prompts and prioritize genre coherence over epistemic correctness.

Venue

ICLR 2026 Conference

DOI

10.5281/zenodo.18410012

When an agent settles into an authoritative voice, that genre can become a soft prior that resists correction. Even when the system instructs refusal or the user introduces contradicting evidence, the agent may continue fabricating state to preserve the frame.

The paper distinguishes genre lock-in from ordinary hallucination, misalignment, and sycophancy, motivating narrative-neutral and epistemically gated architectures for agent state exchange.

Keywords

AgentsHallucinationPersonaCalibrationSafety

← All research