ChanlChanl
Blog/Tags/llm

llm

Browse 16 articles tagged with “llm”.

Articles tagged “llm

16 articles

Illustration of a balance scale tilted by invisible weights, representing hidden biases in AI evaluation systems
Learning AI·18 min read

12 Ways Your LLM Judge Is Lying to You

Research identifies 12 systematic biases in LLM-as-a-judge systems. Learn to detect and mitigate each one before they corrupt your eval pipeline.

Read More
Developer comparing small and large AI model outputs on a monitor
Learning AI·18 min read

A 7B Domain Model Beat Everything We Tried

Domain-specific language models are beating trillion-parameter generalists on vertical tasks. Here's when a 7B model is the right call, how the training pipeline works, and what production teams are shipping today.

Read More
Illustration of a neural network with low-rank adapter matrices injected between layers, showing only a small percentage of parameters highlighted for training
Learning AI·19 min read

Fine-Tune a 7B Model for $1,500 (Not $50,000)

Full fine-tuning costs $50K in H100s. QLoRA on an RTX 4090 costs $1,500. Learn how LoRA and QLoRA let you train only 0.1-1% of parameters with nearly identical results, with working code for fine-tuning models that understand your agent's tool schemas.

Read More
Neural network distillation visualization showing a large teacher model transferring knowledge to a compact student model
Learning AI·16 min read

A 1B Model Just Matched the 70B. Here's How.

How to distill frontier LLMs into small, cheap models that retain 98% accuracy on agent tasks. The teacher-student pattern, NVIDIA's data flywheel, and the Plan-and-Execute architecture that cuts agent costs by 90%.

Read More
Small chip outperforming a rack of servers
Learning AI·14 min read

Why Your AI Bill Is 30x Too High

Small language models match GPT-3.5 at 2% of the size and 95% less cost. Benchmarks, code, and a migration story from $13K/month to $400.

Read More
Watercolor illustration of descending cost bars alongside token streams flowing through an optimization pipeline
Operations·16 min read read

Your AI Agent Costs $13K/Month. Here's the Fix.

A production customer-service agent burned $13,247 in one month. Prompt caching, model routing, batch processing, and plan-and-execute architecture cut it to $1,100. Real pricing math for every technique.

Read More
Two men filming a scene outdoors with artwork. - Photo by Luke Thornton on Unsplash
Testing & Evaluation·12 min read

Zero-Shot or Zero Chance? How AI Agents Handle Calls They've Never Seen Before

When a customer calls with a request your AI agent has never encountered, what actually happens? We break down the mechanics of zero-shot handling, and how to test for it before it fails in production.

Read More
Claude AI agent development tools with code on a developer workspace
Agent Architecture·20 min read read

Claude 4.6 broke our production agent in two hours — here's what's worth the migration

A practical developer guide to Claude 4.6 — adaptive thinking, 1M context, compaction API, tool search, and structured outputs. Real code examples in TypeScript and Python for building production AI agents.

Read More
selective focus of black and white quadrone - Photo by Kenny Eliason on Unsplash
Agent Architecture·7 min read

Conversational AI vs. Agentic AI: What's the Difference, and Why It Matters for CX Teams

Conversational AI follows scripts. Agentic AI pursues goals. Here's the exact difference, with a side-by-side comparison and a practical guide to choosing the right approach for customer experience.

Read More
Watercolor illustration of converging streams representing voice, vision, and text flowing into an AI agent system
Agent Architecture·28 min read read

Multimodal AI Agents: Voice, Vision, and Text in Production

How to architect multimodal AI agents that process voice, vision, and text simultaneously — from STT→LLM→TTS pipelines to vision integration, latency budgets, and production fusion strategies.

Read More
Developer comparing two approaches on a whiteboard
Knowledge & Memory·20 min read

Fine-tuning vs RAG: why most teams pick wrong and how to decide

When to fine-tune, when to use RAG, and when you need both — with hands-on LoRA fine-tuning and RAG implementation on the same task to show the difference.

Read More
Illustration of two people reviewing an improvement chart together at a standing desk
Learning AI·20 min read

How to Evaluate AI Agents: Build an Eval Framework from Scratch

Build a working AI agent eval framework in TypeScript and Python. Covers LLM-as-judge, rubric scoring, regression testing, and CI integration.

Read More
Illustration of a person writing thoughtfully at a desk with sticky notes and a warm lamp
Learning AI·25 min read

Prompt Engineering from First Principles: 12 Techniques Every AI Developer Needs

Master 12 essential prompt engineering techniques with real TypeScript examples. From zero-shot to ReAct, build better AI agents from first principles.

Read More
man in blue dress shirt sitting on black office rolling chair - Photo by David Schultz on Unsplash
Agent Architecture·22 min read

How Multimodal Voice AI Works: From Audio-Only to Vision-Aware Agents

How multimodal voice AI combines speech, vision, and text into a single agent — architecture patterns, latency tradeoffs, and TypeScript code you can run.

Read More
text - Photo by Artur Shamsutdinov on Unsplash
Agent Architecture·16 min read

How LLMs Changed Agent Training Forever: From Writing Rules to Writing Prompts

LLMs didn't just improve agent training. They changed the entire discipline. Here's what actually shifted, what works in production, and what the industry still gets wrong.

Read More
a man and a woman standing in front of a whiteboard - Photo by Walls.io on Unsplash
Knowledge & Memory·16 min read

Prompt engineering vs. context engineering: What's the next step for voice AI?

While prompt engineering focuses on perfecting inputs, context engineering optimizes the entire conversation environment. Discover why context engineering is becoming the key differentiator in voice AI.

Read More

Learn Agentic AI

One lesson a week — practical techniques for building, testing, and shipping AI agents. From prompt engineering to production monitoring. Learn by doing.

500+ engineers subscribed