ChanlChanl

The Chanl Blog

Insights on building, connecting, and monitoring AI agents for customer experience — from the teams shipping them.

All Articles

215 articles · Page 8 of 18

Illustration of a balance scale tilted by invisible weights, representing hidden biases in AI evaluation systems
Learning AI·18 min read

12 Ways Your LLM Judge Is Lying to You

Research identifies 12 systematic biases in LLM-as-a-judge systems. Learn to detect and mitigate each one before they corrupt your eval pipeline.

Read More
A filing cabinet with most drawers empty and papers scattered on the floor, watercolor illustration in muted blue tones
Knowledge & Memory·12 min read read

Your Agent Completed the Task. It Also Forgot 87% of What It Knew.

Task completion hides a silent failure: agents forget 87% of stored knowledge under complexity. New research reveals why standard evals miss this entirely.

Read More
Watercolor illustration of a split dashboard showing human reviewers on one side and automated scoring metrics on the other
Operations·15 min read read

74% of Production Agents Still Rely on Human Evaluation

A survey of 306 practitioners reveals most production agents are far simpler than expected. The eval gap isn't a tooling problem. It's a trust problem.

Read More
Watercolor illustration of a digital fortress under siege with abstract red and blue waves representing adversarial AI testing
Testing & Evaluation·15 min read read

NIST Red-Teamed 13 Frontier Models. All of Them Failed.

NIST ran 250K+ attacks against every frontier model. None survived. Here's what the results mean for teams shipping AI agents to production today.

Read More
Visualization of the widening gap between AI agent capability scores and reliability metrics across model generations
Learning AI·15 min read

Your Agent Is Getting Smarter. It's Not Getting More Reliable.

Reliability improves at half the rate of accuracy. Three 85%+ tools combine to just 74%. Here's the math, the research, and the testing protocols that close the gap.

Read More
Auto repair shop garage with a phone ringing on the counter while a mechanic works under a lifted car
Side Hustle·14 min read

The Auto Shop That Knows Your Car Better Than You Do

Build an AI phone agent for auto repair shops that answers calls, quotes brake jobs, remembers every vehicle, and sends maintenance reminders.

Read More
Warm illustration of a friendly AI assistant at a dental clinic front desk, answering calls while the office is dark and empty at night
Side Hustle·14 min read

A Dental Receptionist That Works Nights and Weekends

Build an AI receptionist for dental clinics that answers insurance questions, books appointments, and captures after-hours leads. Five clients pay $1,500/month.

Read More
HVAC service van parked outside a house at night with a phone screen showing an AI assistant booking an appointment
Side Hustle·14 min read

The HVAC Company That Never Misses a Call

Build an AI receptionist that answers HVAC calls 24/7, triages emergencies, and books appointments. Then sell it as a service for $400-500/mo per client.

Read More
A phone screen showing a text conversation qualifying a home buyer at 2 AM, with a sleeping real estate agent in the background
Side Hustle·14 min read

The Real Estate Agent Who Qualified Leads While Sleeping

Build an AI lead qualifier for real estate agents. Respond in under 60 seconds, score by budget and timeline, match listings, and book showings automatically.

Read More
Warm watercolor illustration of a cozy Italian restaurant kitchen with a phone ringing on the counter
Side Hustle·14 min read

Build a Restaurant AI That Remembers Every Regular

Build an AI phone agent for a local restaurant that takes orders, answers menu questions, and remembers regulars. A developer side hustle worth $400/month per client.

Read More
Person surrounded by many tools but looking at an empty notebook
Agent Architecture·5 min read

50 Tools, Zero Memory. The Biggest Gap in AI Agents Today

AI agents can call 50 APIs but can't remember what you said yesterday. The tool layer is years ahead of the memory layer, and customers are paying the price.

Read More
Person exploring geometric shapes representing vector space
Learning AI·20 min read

Embeddings Turn Text Into Meaning. Here's the Math and the Code

What embeddings are, how similarity search works under the hood, and how to build a semantic search engine, from cosine similarity math to production vector databases.

Read More

The Signal Briefing

One email a week. How leading CS, revenue, and AI teams are turning conversations into decisions. Benchmarks, playbooks, and what's working in production.

500+ CS and revenue leaders subscribed