Discussion about this post

User's avatar
H. Floyd's avatar

The tax-prep agent automated 7,000 returns at 97%. But the 3% it got wrong is where the real design problem lives. Did it know those returns were wrong? If it didn't, no amount of transparency or control in the interface would have caught them.

The first question for any agentic product is not how to design the handoff. It is how to detect when a handoff is needed. The agent will not tell you it is about to make a mistake. You have to build that detection separately.

Gabe Michael's avatar

Spot on Jonas.

10 more comments...

No posts

Ready for more?