> Writing

Building and breaking agents

RSS · JSON Feed

agent-systemsai-securityarchitectureprocessred-teamingreliability

What red-teaming an agent actually looks like
May 14, 2026 · 1 min read
Not a jailbreak leaderboard. A fixed clock, a severity-weighted plan, and findings that ship as deterministic reproductions.
#red-teaming#ai-security#process
A practical threat model for tool-using agents
May 6, 2026 · 1 min read
Four surfaces, ranked by how much a single success costs you. Prompt injection is on the list. It is not at the top.
#ai-security#red-teaming#agent-systems
Agent reliability is a systems problem, not a prompt problem
April 22, 2026 · 1 min read
Most agent failures I see in production are not bad prompts. They are missing boundaries, missing observability, and a planning loop nobody can inspect.
#agent-systems#reliability#architecture