Plumbline · Field notes

Field Notes

Short, technical write-ups on the gap between a passing test and software that actually works — the failures that hide there, and how to find them before they ship.

Jun 2026 · Reproducibility

It worked on my machine — that’s the bug.

Code that worked, but only on state nobody wrote down — a warm cache, a hand-installed tool. It passes every test until the first clean rebuild.

Read →
Jun 2026 · Falsifiability

Your test passed. Would it have failed if the feature were broken?

A passing test is only evidence if a broken system would have failed it. Three real cases — and the one question to ask of every green check.

Read →
Stay in the loop

Get new field notes.

Short, technical write-ups on verifying AI-built software — sent when a new one’s ready. No spam, no cadence promises.

You’re on the list. Talk soon.