Articles tagged “performance”
2 articles

Agent Architecture·14 min read
Pre-Execute Tool Calls to Cut Agent Latency 48%
Sequential tool calls quietly kill your agent's response time. PASTE shows you can pre-execute likely tool calls during LLM thinking time and cut latency 48% without touching your model.
Read More

Testing & Evaluation·15 min read
Performance Benchmarks for AI Agents: What Actually Matters Beyond Word Error Rate
Most enterprises obsess over Word Error Rate while missing the metrics that actually predict success. Here's what to measure instead.
Read More
Learn Agentic AI
Weekly. Patterns for shipping agents that work — MCP, scorecards, regression tests, prompts, model comparisons.