Articles tagged “context-window”
4 articles

1M-Token Context or RAG? How to Pick for Your CX Agent
Gemini's 1M-token window is real but not free. A practical decision framework for choosing between long-context and RAG for customer experience agents, with cost numbers, code, and the hybrid pattern most production teams land on.

Stop Loading All Your MCP Tools at Once
Loading 50 MCP tools burns 72K tokens before your agent says a word. Progressive tool discovery fixes that: smaller context, sharper decisions, real code patterns.

Your MCP server is a monolith. Here's how to fix it
MCP servers dump every tool into the context window, burning tokens before your agent reasons. Four patterns to fix it: decompose, filter, gateway, facade.

Your AI Agent's Context Window Is Already Half Full
System prompts, tool schemas, MCP descriptions, memory injection, conversation history. They all eat tokens before the user says a word. Learn where your context budget goes and how to manage it.
The Signal Briefing
Un email por semana. Cómo los equipos líderes de CS, ingresos e IA están convirtiendo conversaciones en decisiones. Benchmarks, playbooks y lo que funciona en producción.