[{"data":1,"prerenderedAt":74},["ShallowReactive",2],{"term-t\u002Ftoken-budget":3,"related-t\u002Ftoken-budget":59},{"id":4,"title":5,"acronym":6,"body":7,"category":40,"description":41,"difficulty":42,"extension":43,"letter":44,"meta":45,"navigation":46,"path":47,"related":48,"seo":53,"sitemap":54,"stem":57,"subcategory":6,"__hash__":58},"terms\u002Fterms\u002Ft\u002Ftoken-budget.md","Token Budget",null,{"type":8,"value":9,"toc":33},"minimark",[10,15,19,23,26,30],[11,12,14],"h2",{"id":13},"eli5-the-vibe-check","ELI5 — The Vibe Check",[16,17,18],"p",{},"A token budget is the cap on how many tokens a request, session, or user can consume. Like a food budget but for AI. Without budgets, one runaway agent can rack up thousands of dollars in an hour.",[11,20,22],{"id":21},"real-talk","Real Talk",[16,24,25],{},"A token budget is an enforced limit on token consumption per request, session, user, or time window. Implemented at the application layer or via API rate limits. Critical for multi-tenant AI products to prevent cost explosions from single users or runaway agents. Typically paired with graceful degradation when the budget is exhausted.",[11,27,29],{"id":28},"when-youll-hear-this","When You'll Hear This",[16,31,32],{},"\"Set a 50k token budget per session and enforce it.\" \u002F \"Token budgets prevent one bad actor from bankrupting the service.\"",{"title":34,"searchDepth":35,"depth":35,"links":36},"",2,[37,38,39],{"id":13,"depth":35,"text":14},{"id":21,"depth":35,"text":22},{"id":28,"depth":35,"text":29},"ai","A token budget is the cap on how many tokens a request, session, or user can consume. Like a food budget but for AI.","beginner","md","t",{},true,"\u002Fterms\u002Ft\u002Ftoken-budget",[49,50,51,52],"Token Burn","Token Tax","Rate Limiting","Cost Per Token",{"title":5,"description":41},{"changefreq":55,"priority":56},"weekly",0.7,"terms\u002Ft\u002Ftoken-budget","HxDGrxYUx5WguCCIvnk4WF1RBtWCd_infhouUHxATwo",[60,63,68,71],{"title":52,"path":61,"acronym":6,"category":40,"difficulty":42,"description":62},"\u002Fterms\u002Fc\u002Fcost-per-token","Cost per token is how much each token (input or output) costs with a given AI provider. Flagship models cost more per token than cheap ones.",{"title":51,"path":64,"acronym":6,"category":65,"difficulty":66,"description":67},"\u002Fterms\u002Fr\u002Frate-limiting","backend","intermediate","Rate limiting is like a bouncer who says 'you can come in 100 times per hour, then you wait.",{"title":49,"path":69,"acronym":6,"category":40,"difficulty":42,"description":70},"\u002Fterms\u002Ft\u002Ftoken-burn","Token burn is how fast your AI bill climbs because the model keeps re-reading the same context. Every turn of a long chat costs more.",{"title":50,"path":72,"acronym":6,"category":40,"difficulty":42,"description":73},"\u002Fterms\u002Ft\u002Ftoken-tax","Token tax is the ongoing cost of running AI features in production. Every API call costs tokens. Every request the user makes. It never sleeps.",1776518319312]