Research

Papers on secure AI systems, agents, and learning intelligence.

System and Method for Provenance-Chained Retrieval-Augmented Generation with Cryptographic Integrity Verification

A method and system for retrieval-augmented generation in which every retrieved chunk carries a cryptographically verifiable chain of provenance back to its source event, allowing …

Patent Pending · 202641035477

Authorization-First Retrieval: Enforcing Least Privilege in Multi-Agent RAG Systems

Authorization-first retrieval enforces least privilege before semantic retrieval, ensuring that documents never enter a multi-agent RAG prompt unless the requesting user or agent i…

Accepted PaperACL TrustNLP 2026SAGAI 2026 · IEEE S&P colocated

Ghost Context: Cross-Context Hallucination in Long-Context Language Models

Ghost context describes cross-context hallucination in long-context language models, where information from earlier turns or neighboring documents bleeds into answers for unrelated…

Accepted PaperACL TrustNLP 2026

Genre Lock-In in Autonomous Language Agents: When Authority Framing Overrides Epistemic Correctness

Genre Lock-In is a failure mode where autonomous agents infer an interaction genre from authority-framed prompts and prioritize genre coherence over epistemic correctness.…

Preprint · Under review at ICLR 2026

SoK: Authorization in Multi-Agent Retrieval-Augmented Generation Systems

A systematization of authorization failure modes and mitigation families in agentic RAG, organized around the correctness property of Authorization-First Retrieval.…

SoK · Under review at USENIX Security 2026

Position: Authorization Must Be an Architectural Primitive in Multi-Agent RAG Systems

A position paper arguing that authorization must be treated as an architectural primitive in multi-agent RAG, enforced before sensitive data reaches retrieval or generation.…

Position Paper · Under review at ICML 2026

Measuring Teaching Efficacy Through Comprehension Delta: Large Language Models as Standardized Learner Proxies

We propose using language models as standardized learner proxies to measure teaching efficacy along a 'comprehension delta', the change in a calibrated proxy's understanding before…

Submitted · Teaching in Higher Education

← Back to home