The demo-to-production gap

There's a moment with every AI feature where it works in the demo and everyone gets excited. That moment is a trap. The demo proves the happy path exists. It says nothing about the hundred unhappy paths a real user will find in the first afternoon.

The gap, concretely

The demo handles the input you typed. Production handles:

the empty input, the 50-page input, the input in the wrong language
the model being down, slow, or rate-limited
the answer that's confidently wrong and needs a citation to catch
the user who pastes secrets into the box

None of that is glamorous. All of it is the actual product.

What closes it

The teams that ship don't have better models. They have better plumbing around the model: input validation, graceful failure, a retry budget, an eval suite that runs on every change, and a human-in-the-loop escape hatch for when the machine isn't sure.

I'll walk through the exact plumbing — with code — in the members how-tos. But even if you never read those: assume the demo is the easy 10%, and budget for the 90% up front.