
Analysis of 2.4 billion enterprise API calls confirms the blended cost of AI fell 67% year-over-year, from $18.40 to $6.07 per million tokens between Q1 2025 and Q1 2026.
That number is real. It is also almost entirely irrelevant to what is actually happening to enterprise AI invoices.
Total spend equals price per unit multiplied by volume consumed. The first variable is falling. The second is growing faster than any budget model accounted for.
The FinOps Foundation's 2026 State of FinOps Report found that 73% of enterprises reported AI costs exceeded original projections.
J.R. Storment, executive director of the FinOps Foundation, described what happened inside finance teams this spring: "In April and May, I started hearing from companies: 'Oh my god, we are 3x over our entire 2026 token budget and it's only April.'
The whole conversation shifted from tokenmaxxing and 'go fast' to 'we need guardrails, how do we control this?'"
Why cheaper tokens produce bigger bills
The problem is structural. Most enterprise AI budgets were built on per-seat or per-subscription logic. That logic made sense when AI meant a SaaS tool with a fixed monthly price.
Agentic workflows consume tokens in ways that no traditional budget model anticipated:
The companies already managing this
Uber blew through its entire 2026 AI coding budget in four months. Microsoft revoked internal Claude Code licenses months after enabling them. One company ran up a $500 million Claude bill in a single month after forgetting to set usage limits.
The Linux Foundation responded by announcing the Tokenomics Foundation, a new standards body aimed at bringing the same cost discipline to AI tokens that FinOps brought to cloud spending.
The pattern is consistent: deploy first, discover the bill second, introduce controls third. For enterprises trying to skip steps two and three, the requirement is real-time per-user, per-model token tracking before the invoice arrives.
CloudEagle.ai provides exactly that, visibility into AI token consumption across tools, by user and department, so finance and IT teams can set thresholds and act before the damage is done.
Let the headlines come straight to you with Access Granted — a monthly edition of Okta announcements, expert perspectives, analysis, and more.
