Architecture

JUNE 20, 2026

The False Green Baseline: When a Passing Test Suite Hides a Broken Type-Check

How an AI coding agent's type-check gate misattributed pre-existing LSP errors to a correct rename, and why gating on the change delta is the answer.

JUNE 18, 2026

Raising the Floor under the Coder: Generalizing an AI Coding Agent across Ticket Types

An AI coding agent went from one ticket type to six, with four architectural moves that raised the floor to deliver zero-Coder results on four types.

MAY 20, 2026

Agent Pipeline Grounding Chat: From Free-Form Q&A to Typed Fields

Two open quality items are closed. Operator design decisions flow through typed handoff fields. The grounding chat has a structural defense against LLM drift.

MAY 18, 2026

From LLM Author to LLM Reviewer: An AI Coding Agent Authors a Production Feature With Zero LLM Code Generation

The pipeline completed a multi-layer TypeScript feature with zero LLM code generation: 6/6 ticket tests, 959/959 suite, $0.308 versus $0.681.

MAY 14, 2026

Grounding an Autonomous Engineering Pipeline in Operator Design: The Pre-Autonomous Chat Stage

A new pre-autonomous chat stage lets the operator ground design decisions in the registry before the pipeline runs, adding a new top to the trust hierarchy.

MAY 9, 2026

From LLM Luck to Structurally Guaranteed: One Ticket Across Four Architectural Eras

Seven pipeline runs, one ticket, four architectural eras. Per-test cost dropped from $0.385 to $0.074 by replacing LLM guesswork with structural derivation.

MAY 6, 2026

Engineering Around LLM Non-Determinism: The Architectural Follow-Up to 248 Runs

What shipped in the three weeks after the 248-run hallucination ceiling: removing the LLM from computable decisions and validating everything else.

APRIL 24, 2026

What the Symbol Registry Stores, and How It Stays Fresh

The data model behind the symbol registry: per-symbol records, file-level hashes, call-graph edges, and the invalidation strategy that keeps it current.

FEBRUARY 20, 2026

What Calls This Function? Why AI Coding Agents Need a Language Server

Tree-Sitter tells you where a symbol is defined. It cannot tell you where it is called. That gap cost one pipeline run 33,000 tokens to find out.

FEBRUARY 13, 2026

The Quality Gate That Passed When It Failed

A Haiku optimization made the L2 quality gate silently pass on every run. The fix was removing the LLM call entirely.

FEBRUARY 6, 2026

Why AI Coding Agents Fail on Evolving Codebases

Not a model capability problem. An agent with the wrong codebase version produces output that is plausible but wrong in ways that are hard to catch.