Articles tagged “gpt-5”
3 articles

Industry & Strategy·16 min read read
Your CX Agent Doesn't Care Who Won SWE-Bench. Here's Who Actually Wins.
SWE-bench crowns a coding king. Customer experience agents answer to a different benchmark, tau-bench, and the rankings flip. The head-to-head that actually predicts production reliability.
Read More

Agent Architecture·13 min read read
Your Agent Should Use Three Models, Not One
Production CX agents route tasks by difficulty, not brand loyalty. The planner/router/summarizer pattern, a concrete rubric, support-deflection cost math, and the failure modes nobody warns you about.
Read More

Operations·14 min read read
Reasoning Tokens Are Showing Up on the Bill
GPT-5 and Claude thinking tokens bill as output and stay invisible. A 200-token reply can hide 8,000 billable ones. How to measure, cap, and budget.
Read More
The Signal Briefing
Un email por semana. Cómo los equipos líderes de CS, ingresos e IA están convirtiendo conversaciones en decisiones. Benchmarks, playbooks y lo que funciona en producción.