Blog/Tags/agent-architecture

agent-architecture

Browse 24 articles tagged with “agent-architecture”.

Articles tagged “agent-architecture”

24 articles

Event-Driven Architecture Diagram Showing an Ambient AI Agent Subscribing to Multiple CX Data Streams

Agent Architecture·14 min read

How to Build Ambient AI Agents for Always-On CX

Most AI agents wait for prompts. Ambient agents watch event streams and act first. Here's how to build always-on CX intelligence that catches problems before customers notice them.

A Traffic Light Showing Amber Beside a Circuit Board Pattern, Representing a Deliberate Pause in an Automated Workflow

Agent Architecture·16 min read

How to Build Agent Interrupt and Approval Checkpoints

How to pause an AI agent before high-stakes actions, persist full state through the approval window, and resume cleanly. Covers interrupt gates, approval queues, checkpointing, and EU AI Act compliance for production CX agents.

A Control Panel With a Retry Button That Returns the Same Green Checkmark on Every Press, Showing Idempotent Operations

Best Practices·14 min read

How to Build Idempotent Tool Calls for AI Agents

Naive retry logic charges customers twice, sends duplicate emails, and fires double webhooks. Here's how to build idempotent tool calls for AI agents with idempotency keys, deduplication, and safe retries.

An AI Agent Conversation and a Human Agent Screen-Pop With Full Context, Side by Side

Voice & Conversation·15 min read

How to Build the Context Package for AI-to-Human Handoffs

AI agents escalate every day, and most send the human in blind. Here's how to build the context package that makes handoffs invisible to customers.

Structured agent specification document with capability and constraint sections next to a chat interface

Best Practices·16 min read

How to Write an Agent Spec Before You Write the Prompt

Inconsistent agent behavior isn't a prompt problem. It's a missing-spec problem. Here's the seven-section document that fixes it before code.

Side-by-side timeline showing sequential tool calls stacking up to 450ms versus parallel speculative execution finishing in 220ms

Agent Architecture·14 min read

Pre-Execute Tool Calls to Cut Agent Latency 48%

Sequential tool calls quietly kill your agent's response time. PASTE shows you can pre-execute likely tool calls during LLM thinking time and cut latency 48% without touching your model.

Diagram of an event-driven MCP server receiving webhook notifications and pushing updates to an AI agent

Tools & MCP·19 min read

MCP Webhooks: Build Event-Driven Agents That React in Real Time

MCP's request-response model breaks when agents need to react to external events. Here's how to build event-driven agents today using stateless HTTP plus webhooks, and what the June 2026 spec will make native.

Diagram showing an AI agent resuming a multi-step workflow from the last checkpoint after a crash

Agent Architecture·14 min read

Your CX Agent Crashes Mid-Task. Here's the Fix.

When your CX agent crashes mid-refund or mid-booking, the customer is stuck. Durable execution guarantees long-running agent tasks survive failures. Here's how to build it.

A graph diagram showing agent state transitions with named nodes and typed edges

Agent Architecture·14 min read

Your Agent Is Already a State Machine. Make It Explicit.

Every production AI agent is secretly a state machine. Making it explicit gives you checkpointing, testable paths, and observable state transitions -- without rewriting your agent logic.

A timeline showing a completed conversation on the left and failed downstream tasks on the right, with a gap between them

Agent Architecture·13 min read

Why CX Agents Fail Between Conversations

Your AI agent handles the call perfectly and still fails your customer. The problem isn't the conversation -- it's everything that happens after it. Here's how async task queues fix the gap.

Developer console with a grid of tool tiles fading out as a routing accuracy curve declines past tool 50

Tools & MCP·10 min read

Past 50 tools, function-calling accuracy falls off a cliff

Past 50 tools, function-calling accuracy falls off a cliff. Measure the curve on your own agent and recover accuracy with per-turn toolset scoping.

AI-generated illustration for long context vs rag cx agents -- Soul (2020) style, Terra Cotta palette

Technical Guide·17 min read

1M-Token Context or RAG? How to Pick for Your CX Agent

Gemini's 1M-token window is real but not free. A practical decision framework for choosing between long-context and RAG for customer experience agents, with cost numbers, code, and the hybrid pattern most production teams land on.

Three Routed Paths Splitting From a Single Customer Message, Each Labeled With a Different AI Model Tier

Agent Architecture·13 min read read

Your Agent Should Use Three Models, Not One

Production CX agents route tasks by difficulty, not brand loyalty. The planner/router/summarizer pattern, a concrete rubric, support-deflection cost math, and the failure modes nobody warns you about.

Two Agent Topologies Side by Side, a Hub-and-Spoke Supervisor and a Peer-to-Peer Swarm, With a Dotted Graduation Arrow Between Them

Agent Architecture·13 min read

When to Use a Supervisor, When to Let Agents Swarm

Supervisor burns 20-40% more tokens per run. Swarm hits a quality cliff past 8-10 handoffs. Start supervisor, graduate to swarm when latency bites.

Architecture diagram of an agentic data layer with event log, signal extraction, entity store, and improvement loop

Agent Architecture·14 min read

The Modern Data Stack Wasn't Built for Agents

Snowflake, dbt, and Fivetran were built for humans asking batch questions. Agents need streaming signals, per-entity memory in under 100ms, and write-back.

Watercolor illustration of an engineer at a desk reviewing wall of screens with charts and signals — schema design in the Arrival-inspired sage & olive palette

Agent Architecture·14 min read read

Stop Storing Transcripts. Start Modeling Signals.

A JSON blob of transcripts works at 1k calls and collapses at 50k. Design a Signal schema with entity/event split, confidence, provenance, and versioning.

Diagram showing MCP as a foundational protocol layer with agent configuration, memory, testing, and observability stacked above it

Tools & MCP·16 min read

MCP Is Now Open Infrastructure: Build for What's Next

MCP was donated to the Linux Foundation and the AAIF just held its first summit. What does the protocol becoming open infrastructure mean for what you build on top of it?

A massive warehouse of filing cabinets stretching into fog, with one person sitting at a clean desk with three folders under warm lamplight

Agent Architecture·14 min read read

Your MCP server is a monolith. Here's how to fix it

MCP servers dump every tool into the context window, burning tokens before your agent reasons. Four patterns to fix it: decompose, filter, gateway, facade.

Person surrounded by many tools but looking at an empty notebook

Agent Architecture·5 min read

50 Tools, Zero Memory. The Biggest Gap in AI Agents Today

AI agents can call 50 APIs but can't remember what you said yesterday. The tool layer is years ahead of the memory layer, and customers are paying the price.

Office workers are busy working on computers. - Photo by TECNIC Bioprocess Solutions on Unsplash

Agent Architecture·14 min read

The Buffering Bug That Quietly Breaks Voice Agent Latency

SSE streams fine locally, then tokens batch into 500ms bursts in production. Here's why, how to fix it, and why pipeline parallelism matters more than model speed.

Two men filming a scene outdoors with artwork. - Photo by Luke Thornton on Unsplash

Testing & Evaluation·12 min read

Zero-Shot or Zero Chance? How AI Agents Handle Calls They've Never Seen Before

When a customer calls with a request your AI agent has never encountered, what actually happens? We break down the mechanics of zero-shot handling, and how to test for it before it fails in production.

A network of connected nodes representing protocol communication between AI systems

Tools & MCP·11 min read

MCP Is Now the Industry Standard for AI Agent Integrations. Here's What That Means

MCP standardizes how AI agents connect to tools and data, replacing fragile, proprietary integrations with a universal protocol. Here's what it means for your agents.

selective focus of black and white quadrone - Photo by Kenny Eliason on Unsplash

Agent Architecture·7 min read

Conversational AI vs. Agentic AI: What's the Difference, and Why It Matters for CX Teams

Conversational AI follows scripts. Agentic AI pursues goals. Here's the exact difference, with a side-by-side comparison and a practical guide to choosing the right approach for customer experience.

Watercolor illustration of two interlocking systems — tools and behavioral instructions — powering an AI agent

Tools & MCP·14 min read read

Your agent has 30 tools and no idea when to use them

MCP tools give agents external capabilities. Skills give agents behavioral expertise. Learn the architecture of both, build them in TypeScript, and understand when to use each — and when you need both.

Learn Agentic AI

Weekly. Patterns for shipping agents that work — MCP, scorecards, regression tests, prompts, model comparisons.

500+ builders subscribed