← Back to Home

Agents and Metastable Failures

Ongoing MS Thesis2026

Agents and Metastable Failures
Multi-AgentEvaluationBenchmarks

Progress Starvation in Agentic Loops: The vulnerability of delegated authority.

The Liveness Violation

Drawing from Lamport (1977)...
we investigate why agentic systems fail to satisfy the property of "eventual progress." By utilizing indirect prompt injection, we demonstrate how an agent can be coerced into a perpetual state of epistemic starvation, where it continues to consume resources (e.g., repeating a 2+2=? request) while remaining computationally unable to reach termination.

Delegation as a Structural Flaw

We frame Delegation—whether to a RAG corpus, an external tool, or a peer agent—as the primary attack surface. This "delegated control" allows an adversary to influence the agent's internal logic from the outside, effectively capturing the control flow without direct instruction modification.

Experimental Focus: Benchmarks and Mitigations

Our benchmarks map how specific adversarial inputs induce Progress Failureacross various agent architectures. We propose a Gated Defense Branching
framework to break these loops, ensuring that agents can differentiate between legitimate uncertainty and synthetic stalling patterns.

// State Analysis:
Threat Model: Epistemic Deception
Failure Mode: Eternal Deferral
Countermeasure: branching-based Decision Gates