Blog/Tags/performance

performance

Browse 2 articles tagged with “performance”.

Articles tagged “performance”

2 articles

Side-by-side timeline showing sequential tool calls stacking up to 450ms versus parallel speculative execution finishing in 220ms

Agent Architecture·14 min read

Pre-Execute Tool Calls to Cut Agent Latency 48%

Sequential tool calls quietly kill your agent's response time. PASTE shows you can pre-execute likely tool calls during LLM thinking time and cut latency 48% without touching your model.

A blurry image of a green and white background - Photo by Logan Voss on Unsplash

Testing & Evaluation·15 min read

Performance Benchmarks for AI Agents: What Actually Matters Beyond Word Error Rate

Most enterprises obsess over Word Error Rate while missing the metrics that actually predict success. Here's what to measure instead.

Learn Agentic AI

Weekly. Patterns for shipping agents that work — MCP, scorecards, regression tests, prompts, model comparisons.

500+ builders subscribed