ChanlChanl

The Chanl Blog

Insights on building, connecting, and monitoring AI agents for customer experience — from the teams shipping them.

All Articles

235 articles · Page 3 of 20

A glowing terminal prompt floats above an AWS cloud diagram, agent tool calls fanning out to S3, DynamoDB, Bedrock, and Lambda nodes
Tools & MCP·12 min read read

AWS just gave your agent 15,000 cloud tools

The AWS MCP Server is now GA. One tool call reaches any of 15,000+ AWS APIs, sandboxed Python execution lets agents run multi-step operations, and Agent Skills replace heavyweight SOPs with on-demand guidance. Here's what changed and how to wire it.

Read More
A warm-lit dashboard showing token usage breakdown with a large orange bar labeled 'System Prompt' dominating the chart
Operations·13 min read read

Your agent re-reads its own manual on every call

Datadog's 2026 State of AI Engineering report found that 69% of input tokens go to system prompts, yet only 28% of LLM calls use prompt caching. Here's how to diagnose the problem and fix it without rewriting your agent.

Read More
An interactive booking confirmation card rendered inline inside an AI chat conversation
Tools & MCP·14 min read

MCP Apps: Build UIs That Render Inside AI Chat

MCP Apps let your tools return interactive HTML dashboards, forms, and visualizations that render inline in Claude, ChatGPT, and VS Code. Here's how to build them for CX agents.

Read More
A flowchart showing an agent's step-by-step decision path with one step flagged as diverging from the expected trajectory
Testing & Evaluation·13 min read

Trajectory Eval: Catch Agent Bugs Output Scoring Misses

Final-output scoring misses 20-40% of agent regressions. Trajectory evaluation scores every step an agent takes -- tool calls, reasoning decisions, order of operations -- and catches the bugs that output-only evals can't see.

Read More
Two parallel agent workflows running side by side, one labeled live and one labeled shadow, with metrics comparison
Operations·13 min read

Shadow Mode: Deploy AI Agent Updates Without Risk

Shadow mode runs your new agent version in parallel with production, comparing behavior before customers ever see it. Here's how to build the full deployment pipeline from shadow to canary to 100%.

Read More
Diagram showing an AI agent resuming a multi-step workflow from the last checkpoint after a crash
Agent Architecture·14 min read

Your CX Agent Crashes Mid-Task. Here's the Fix.

When your CX agent crashes mid-refund or mid-booking, the customer is stuck. Durable execution guarantees long-running agent tasks survive failures. Here's how to build it.

Read More
A browser dashboard showing live agent tool calls and state updates streaming in real time
Tools & MCP·13 min read

AG-UI: The Protocol That Connects Agents to UIs

AG-UI is the open event-based protocol that streams AI agent state to any frontend in real time. Here's how it works, what events it defines, and how to wire it up in TypeScript.

Read More
A graph diagram showing agent state transitions with named nodes and typed edges
Agent Architecture·14 min read

Your Agent Is Already a State Machine. Make It Explicit.

Every production AI agent is secretly a state machine. Making it explicit gives you checkpointing, testable paths, and observable state transitions -- without rewriting your agent logic.

Read More
A dashboard showing rich telemetry data on one side and a blank trend chart on the other, representing observability without measurement
Testing & Evaluation·11 min read

Your Agent Has Observability. It Doesn't Have Measurement.

89% of AI teams added observability. 52% added evals. But only 31% can say whether their agent is getting better or worse. Here's the difference between watching your agent and actually measuring it.

Read More
A timeline showing a completed conversation on the left and failed downstream tasks on the right, with a gap between them
Agent Architecture·13 min read

Why CX Agents Fail Between Conversations

Your AI agent handles the call perfectly and still fails your customer. The problem isn't the conversation -- it's everything that happens after it. Here's how async task queues fix the gap.

Read More
Dashboard showing AI agent KPI tiles for task completion rate, escalation rate, cost per successful outcome, and CSAT delta
Testing & Evaluation·13 min read

AI Agent KPIs: What to Measure Before You Ship

Only 31% of teams have a measurement framework for their AI agents. Here's how to define task completion rate, escalation rate, cost per outcome, and CSAT delta before your first production interaction.

Read More
Diagram showing an MCP server with OAuth 2.0 token validation, per-tenant tool scoping, and multi-tenant isolation layers
Tools & MCP·15 min read

MCP Auth in Production: Scopes, Tokens, and Tenant Isolation

Most MCP servers ship with no auth. Here's how to add OAuth 2.0 scopes, per-tenant tool sets, and client isolation before your MCP server becomes load-bearing production infrastructure.

Read More

Learn Agentic AI

Weekly. Patterns for shipping agents that work — MCP, scorecards, regression tests, prompts, model comparisons.

500+ builders subscribed