agent-infrastructure
Browse 10 articles tagged with “agent-infrastructure”.
Articles tagged “agent-infrastructure”
10 articles

74% of Production Agents Still Rely on Human Evaluation
A survey of 306 practitioners reveals most production agents are far simpler than expected. The eval gap isn't a tooling problem. It's a trust problem.

What to Trace When Your AI Agent Hits Production
OpenTelemetry GenAI conventions are the production standard for agent tracing. What to instrument, what to skip, and what breaks — from a 2 AM debugging war story.

Your AI Agent Costs $13K/Month. Here's the Fix.
A production customer-service agent burned $13,247 in one month. Prompt caching, model routing, batch processing, and plan-and-execute architecture cut it to $1,100. Real pricing math for every technique.

Why Browser Agents Waste 89% of Their Tokens
Browser agents burn 1,500-2,000 tokens per screenshot. Chrome 146's navigator.modelContext API lets websites expose structured tools instead, cutting token usage by 89% and raising task accuracy to 98%. Here's how WebMCP works.

AI Agent Observability: What to Monitor When Your Agent Goes Live
Build a production observability pipeline for AI agents. Covers latency, token usage, tool success rates, conversation quality, drift detection, structured logging, alerting strategies, and the critical difference between LLM and agent observability.

AI Agent Testing: How to Evaluate Agents Before They Talk to Customers
A practical guide to testing AI agents before production — scenario-based testing with AI personas, scorecard evaluation, regression suites, edge case generation, and CI/CD integration.

Herramientas para Agentes de IA: MCP, OpenAPI y Gestión de Herramientas que Realmente Escala
Cómo los agentes de IA en producción descubren, ejecutan y gestionan herramientas: desde el protocolo MCP hasta la importación automática de OpenAPI, sandboxing de seguridad e infraestructura de herramientas multi-tenant.

MCP Deep Dive: Advanced Patterns for Agent Tool Integration
Production MCP patterns for teams who've built their first server and need to scale it — OAuth 2.1 with PKCE, Streamable HTTP transport, gateways, sampling, dynamic tool registration, and multi-tenant security.

Multimodal AI Agents: Voice, Vision, and Text in Production
How to architect multimodal AI agents that process voice, vision, and text simultaneously — from STT→LLM→TTS pipelines to vision integration, latency budgets, and production fusion strategies.

How Multimodal Voice AI Works: From Audio-Only to Vision-Aware Agents
How multimodal voice AI combines speech, vision, and text into a single agent — architecture patterns, latency tradeoffs, and TypeScript code you can run.
Aprende IA Agéntica
Una lección por semana: técnicas prácticas para construir, probar y lanzar agentes IA. Desde ingeniería de prompts hasta monitoreo en producción. Aprende haciendo.