Shopify built a leaderboard to get employees excited about using AI. It worked a little too well.
Farhan Thawar, Shopify’s VP and Head of Engineering, has confirmed the company “killed” its original token leaderboard after discovering it created a perverse incentive: employees were burning through AI tokens to climb the rankings, not to actually accomplish anything useful. The phenomenon has a name now. It’s called “tokenmaxxing,” and it’s exactly what it sounds like, competitive overconsumption of AI resources for bragging rights.
The fix? Shopify replaced the leaderboard with a usage dashboard designed to measure utility rather than volume. The company also implemented spending alerts that trigger whenever a single user’s daily token spend exceeds $250.
From vanity metrics to value metrics
Shopify’s AI push began in 2025, around the same time CEO Tobi Lutke issued a memo establishing AI use as a baseline expectation across the company. The leaderboard was part of that cultural push, a way to normalize and encourage adoption.
Shopify also built an internal LLM proxy for bulk token purchasing, centralizing how the company buys and distributes AI capacity. Combined with the $250 daily spend alert per user, the infrastructure now acts as both a supply chain and a guardrail.
Smaller models, bigger savings
Shopify is investing heavily in what it calls its Universal Distillation Platform, or UDP, a system for building smaller, task-specific AI models through a process called model distillation. Instead of routing every request through massive frontier models like GPT or Claude, Shopify trains compact models that are purpose-built for specific jobs. These distilled models can be 2x to 30x cheaper than calling frontier APIs, according to Thawar, while actually improving accuracy and reducing latency for the tasks they’re designed to handle.
What this means for the AI adoption playbook
Shopify’s experience reflects a pattern: mandate AI adoption company-wide, gamify it, discover gamification created waste, then rebuild the measurement system around outcomes. Discussions around tokenmaxxing have grown more prominent in 2026, with Shopify increasingly cited as a corrective model for the broader industry. Meta, among others, has reportedly wrestled with comparable incentive alignment problems in its own AI metrics.
The core lesson is deceptively simple: what you measure is what you get. Volume-based metrics encourage volume. Utility-based metrics encourage utility. Thawar’s public candor about the leaderboard’s failure is unusual in the industry.
Disclosure: This article was edited by Editorial Team. For more information on how we create and review content, see our Editorial Policy.

1 hour ago
21









English (US) ·