Daily token quota planner
A handful of heavy users can quietly consume most of your API budget. The fix is per-user token quotas: a fair daily allowance that keeps total spend predictable. This planner takes your daily budget, user counts and a blended token price, and computes the daily token quota for free and paid users — with paid users getting a configurable multiple of the free allowance.
How it works
The planner converts your daily budget into a total token allowance, then splits
it across users weighted by tier. Paid users get ratio× the free quota:
total_tokens = (daily_budget / blended_price) × 1,000,000
weight_sum = free_users × 1 + paid_users × ratio
free_quota = total_tokens / weight_sum
paid_quota = free_quota × ratio
This guarantees that if every user spends exactly their quota, you land on budget — no surprises.
Tips and notes
- Set quotas slightly below the budget-exact figure to leave headroom for spikes and for the share of users who exceed the average.
- Reset quotas on a rolling 24-hour window rather than at midnight UTC to avoid a thundering herd of resets all at once.
- Pair quotas with an upgrade prompt: a free user repeatedly hitting their cap is your warmest paid-conversion signal.