> Writing
Building and breaking agents
agent-systemsai-securityarchitectureprocessred-teamingreliability
What red-teaming an agent actually looks like
May 14, 2026 · 1 min readNot a jailbreak leaderboard. A fixed clock, a severity-weighted plan, and findings that ship as deterministic reproductions.
#red-teaming#ai-security#processA practical threat model for tool-using agents
May 6, 2026 · 1 min readFour surfaces, ranked by how much a single success costs you. Prompt injection is on the list. It is not at the top.
#ai-security#red-teaming#agent-systemsAgent reliability is a systems problem, not a prompt problem
April 22, 2026 · 1 min readMost agent failures I see in production are not bad prompts. They are missing boundaries, missing observability, and a planning loop nobody can inspect.
#agent-systems#reliability#architecture