Research

Papers on secure AI systems, agents, and learning intelligence.

01
System and Method for Provenance-Chained Retrieval-Augmented Generation with Cryptographic Integrity Verification
A method and system for retrieval-augmented generation in which every retrieved chunk carries a cryptographically verifiable chain of provenance back to its source event, allowing
Patent Pending · 202641035477
02
Authorization-First Retrieval: Enforcing Least Privilege in Multi-Agent RAG Systems
Authorization-first retrieval enforces least privilege before semantic retrieval, ensuring that documents never enter a multi-agent RAG prompt unless the requesting user or agent i
Accepted PaperACL TrustNLP 2026SAGAI 2026 · IEEE S&P colocated
03
Ghost Context: Cross-Context Hallucination in Long-Context Language Models
Ghost context describes cross-context hallucination in long-context language models, where information from earlier turns or neighboring documents bleeds into answers for unrelated
Accepted PaperACL TrustNLP 2026
04
Genre Lock-In in Autonomous Language Agents: When Authority Framing Overrides Epistemic Correctness
Genre Lock-In is a failure mode where autonomous agents infer an interaction genre from authority-framed prompts and prioritize genre coherence over epistemic correctness.
Preprint · Under review at ICLR 2026
05
SoK: Authorization in Multi-Agent Retrieval-Augmented Generation Systems
A systematization of authorization failure modes and mitigation families in agentic RAG, organized around the correctness property of Authorization-First Retrieval.
SoK · Under review at USENIX Security 2026
06
Position: Authorization Must Be an Architectural Primitive in Multi-Agent RAG Systems
A position paper arguing that authorization must be treated as an architectural primitive in multi-agent RAG, enforced before sensitive data reaches retrieval or generation.
Position Paper · Under review at ICML 2026
07
Measuring Teaching Efficacy Through Comprehension Delta: Large Language Models as Standardized Learner Proxies
We propose using language models as standardized learner proxies to measure teaching efficacy along a 'comprehension delta', the change in a calibrated proxy's understanding before
Submitted · Teaching in Higher Education