Articles tagged “model-routing”
2 articles

Agent Architecture·14 min read read
Everyone Benchmarks Opus. Your Chatbot Runs on Haiku.
Haiku 4.5, GPT-5 Mini, Gemini Flash at the $1/MTok tier that powers CX. Tool-call accuracy, first-token latency, structured-output reliability, blended cost math.
Read More

Agent Architecture·13 min read read
Your Agent Should Use Three Models, Not One
Production CX agents route tasks by difficulty, not brand loyalty. The planner/router/summarizer pattern, a concrete rubric, support-deflection cost math, and the failure modes nobody warns you about.
Read More
The Signal Briefing
One email a week. How leading CS, revenue, and AI teams are turning conversations into decisions. Benchmarks, playbooks, and what's working in production.