Articles tagged “tool-calling”
8 articles

How MCP Tool Descriptions Break Your Agent
New research shows 97% of MCP tool descriptions have quality issues that hurt agent accuracy. Here's what the smells look like, why they matter, and how to fix them.

MCP Apps: Build UIs That Render Inside AI Chat
MCP Apps let your tools return interactive HTML dashboards, forms, and visualizations that render inline in Claude, ChatGPT, and VS Code. Here's how to build them for CX agents.

AG-UI: The Protocol That Connects Agents to UIs
AG-UI is the open event-based protocol that streams AI agent state to any frontend in real time. Here's how it works, what events it defines, and how to wire it up in TypeScript.

Circuit Breakers for AI Agents: Stop the 3 AM Meltdown
One retry loop at 11 PM becomes $437 by 7 AM. Here's how to implement circuit breakers for AI agent tool calls, LLM calls, and external APIs, with TypeScript patterns that stop cascading failures before they start.

Your CX Agent Doesn't Care Who Won SWE-Bench. Here's Who Actually Wins.
SWE-bench crowns a coding king. Customer experience agents answer to a different benchmark, tau-bench, and the rankings flip. The head-to-head that actually predicts production reliability.

Everyone Benchmarks Opus. Your Chatbot Runs on Haiku.
Haiku 4.5, GPT-5 Mini, Gemini Flash at the $1/MTok tier that powers CX. Tool-call accuracy, first-token latency, structured-output reliability, blended cost math.

MCP SSE Is Deprecated. Here's How to Migrate
SSE transport is being deprecated across major MCP platforms in 2026. Here's a practical migration guide from HTTP+SSE to Streamable HTTP, with TypeScript examples and a phased rollout strategy.

MCP Streamable HTTP: The Transport Layer That Makes AI Agents Production-Ready
MCP's Streamable HTTP transport replaced the original SSE transport to fix critical production gaps. This guide covers what changed, why it matters, and how to implement it in TypeScript with code examples.
The Signal Briefing
Un email por semana. Cómo los equipos líderes de CS, ingresos e IA están convirtiendo conversaciones en decisiones. Benchmarks, playbooks y lo que funciona en producción.