The Chanl Blog
Insights on building, connecting, and monitoring AI agents for customer experience — from the teams shipping them.
All Articles
235 articles · Page 6 of 20

Your Agent Should Use Three Models, Not One
Production CX agents route tasks by difficulty, not brand loyalty. The planner/router/summarizer pattern, a concrete rubric, support-deflection cost math, and the failure modes nobody warns you about.

Reasoning Tokens Are Showing Up on the Bill
GPT-5 and Claude thinking tokens bill as output and stay invisible. A 200-token reply can hide 8,000 billable ones. How to measure, cap, and budget.

When to Use a Supervisor, When to Let Agents Swarm
Supervisor burns 20-40% more tokens per run. Swarm hits a quality cliff past 8-10 handoffs. Start supervisor, graduate to swarm when latency bites.

Stop Using SWE-Bench to Pick Your CX Model
SWE-Bench scores 85% or 23% depending on the harness, and neither measures customer experience. Why tau-bench, tau2-bench, and pass^k matter for CX agents.

The Modern Data Stack Wasn't Built for Agents
Snowflake, dbt, and Fivetran were built for humans asking batch questions. Agents need streaming signals, per-entity memory in under 100ms, and write-back.

Correlation Killed Your Retention Model. Causal AI Fixes It.
Your churn model says support calls cause retention. They don't. Build a causal pipeline with DoWhy, EconML, and propensity matching in Python.

Stop Storing Transcripts. Start Modeling Signals.
A JSON blob of transcripts works at 1k calls and collapses at 50k. Design a Signal schema with entity/event split, confidence, provenance, and versioning.

Every Conversation Is an Experiment You Didn't Run
Your agent already ran the A/B test you're scoping. Here's how to read the results in your logs with propensity matching, synthetic control, and diff-in-diff.

Stop Building Dashboards. Start Shipping Signal.
Dashboards tell VPs what happened last quarter. Signal tells them which account to call today, and why. How CX is exiting the post-dashboard era in 2026.

Is AI Better Than Your Humans? Score Both on One Rubric
Most teams can't say whether AI beats humans because they score them differently. One rubric, run on both, sliced by segment, gives you an honest answer.

Every Failed Call Is a Test Case You Haven't Written Yet
The gap between staging and production for AI agents is measured in surprise. Here's how to close the loop from live failure to regression gate.

Your Conversations Are Already CRM Data. Here's How to Use Them.
Every customer call carries churn risk, expansion intent, and compliance signal. Most teams toss it. Here's how to turn conversations into live CRM data.
Learn Agentic AI
Weekly. Patterns for shipping agents that work — MCP, scorecards, regression tests, prompts, model comparisons.