Frank at 2am: same fix, half the cost

Narrative angle on the firefighter benchmark. Same bug, same 8/10 fix, same passing build. Haiku-with-memory cost $0.075. Sonnet-without cost $0.138. The whole cost story compressed into one on-call shift.

Sources: /blog/memory-not-smarter-cheaper-models · /blog/model-routing-single-developer

X / Twitter

Post 1 272 / 280

2am. Frank gets paged. Auth bypass live in production. He has never seen the codebase.

Same bug. Same 8/10 fix. Same passing build.

Frank with Hydrate (Haiku): $0.075
Frank without (Sonnet): $0.138

Memory did not make him smarter. It made the cheaper model sufficient.

Hook tweet. Best posted Tuesday to Thursday, 8 to 10am UK time. Reply with the LinkedIn version as a thread continuation.

Post 2 284 / 280

The mechanism is just cache.

Sonnet fresh input: $3 per million tokens.
Sonnet cached read: $0.30 per million. Ten times.

Without memory, every session starts cold. Cache hit rate sits around 67%.
With memory, context is warm before turn one. 97%.

Whole cost story in two numbers.

Stat-led follow-up. Can stand alone or thread under the Frank tweet.

Post 3 267 / 280

Sonnet for new decisions.
Haiku for established implementation.

That is the whole rule.

Hydrate is the bridge that makes Haiku viable on warm context. After your first sprint, roughly 80% of your sessions do not need frontier reasoning. They need frontier context.

Standalone rule tweet. Quotable. Pin candidate.

Frank at 2am: same fix, half the cost

X / Twitter

LinkedIn

Bluesky

Mastodon

Reddit

Title

Body

Title

Body

Hacker News

Title

URL

First comment

dev.to / Hashnode

Title

Tags

Body (markdown)

Images

Image

Save the generated image as

Prompt

Image

Save the generated image as

Prompt

Image

Save the generated image as

Prompt