[{"data":1,"prerenderedAt":72},["ShallowReactive",2],{"term-m\u002Fmodel-routing":3,"related-m\u002Fmodel-routing":58},{"id":4,"title":5,"acronym":6,"body":7,"category":40,"description":41,"difficulty":42,"extension":43,"letter":44,"meta":45,"navigation":46,"path":47,"related":48,"seo":52,"sitemap":53,"stem":56,"subcategory":6,"__hash__":57},"terms\u002Fterms\u002Fm\u002Fmodel-routing.md","Model Routing",null,{"type":8,"value":9,"toc":33},"minimark",[10,15,19,23,26,30],[11,12,14],"h2",{"id":13},"eli5-the-vibe-check","ELI5 — The Vibe Check",[16,17,18],"p",{},"Model routing is dynamically choosing which AI model to call based on task complexity, cost, or latency — the smart switchboard for LLMs. Simple question? Route to Haiku. Complex reasoning? Escalate to Opus. Time-sensitive? Pick the fastest. You're not locked into one model — you're running a strategy that matches task requirements to model capabilities. Like having three employees with different skill levels and knowing which one to call.",[11,20,22],{"id":21},"real-talk","Real Talk",[16,24,25],{},"Model routing sits in front of an LLM layer and makes dispatch decisions based on classifiers, heuristics, or a lightweight \"router model\" that evaluates the incoming prompt. OpenRouter, LiteLLM, and RouteLLM are purpose-built routing layers. Organizations use routing to control cost (cheap models for easy tasks), latency (faster models for UX-critical paths), and capability (specialized models for code, math, or multimodal tasks).",[11,27,29],{"id":28},"when-youll-hear-this","When You'll Hear This",[16,31,32],{},"\"We implemented model routing — simple queries hit Haiku, complex ones escalate to Sonnet.\" \u002F \"Model routing cut our LLM costs by 60% without touching response quality.\"",{"title":34,"searchDepth":35,"depth":35,"links":36},"",2,[37,38,39],{"id":13,"depth":35,"text":14},{"id":21,"depth":35,"text":22},{"id":28,"depth":35,"text":29},"ai","Model routing is dynamically choosing which AI model to call based on task complexity, cost, or latency — the smart switchboard for LLMs.","advanced","md","m",{},true,"\u002Fterms\u002Fm\u002Fmodel-routing",[49,50,51],"LLM","Agent","Orchestration",{"title":5,"description":41},{"changefreq":54,"priority":55},"weekly",0.7,"terms\u002Fm\u002Fmodel-routing","hDm623GoWNfDuj3p6lCIaCs_zfCk8AvOCitQzsRldlE",[59,63,68],{"title":50,"path":60,"acronym":6,"category":40,"difficulty":61,"description":62},"\u002Fterms\u002Fa\u002Fagent","intermediate","An AI agent is an LLM that doesn't just answer questions — it takes actions.",{"title":49,"path":64,"acronym":65,"category":40,"difficulty":66,"description":67},"\u002Fterms\u002Fl\u002Fllm","Large Language Model","beginner","An LLM is a humongous AI that read basically the entire internet and learned to predict what words come next, really really well.",{"title":51,"path":69,"acronym":6,"category":70,"difficulty":61,"description":71},"\u002Fterms\u002Fo\u002Forchestration","cicd","Orchestration is the process of automatically managing, coordinating, and scheduling where your containers run.",1775560914131]