AI

October 1, 2025
in AI
4 min read

Backpressure: making subagents 10x more effective

Subagents are great at exploring, but terrible at stopping. If you've ever spun up a helper agent and watched it spray dozens of tool calls, you know the pain: high latency, noisy output, and a growing chance it drifts into nonsense.

The fix isn't more prompts. It's backpressure.

In this article, I'll walk through how I made subagents roughly 10x more effective by applying three backpressure levers:

Reduce tool calls without hurting accuracy
Enrich error logs so agents recover faster
Minimize total steps before they go runaway

August 1, 2025
in AI, Voice
6 min read

How to perform LLM Evals

Evaluating LLM pipelines/workflows improvements is key to improving our AI systems. With limited time and resources, oftentimes the blocker is: overthinking them. In this article, I'll talk through a couple simple evals for benchmarking your improvements, based on work I've done previously.

The Curse of Overthinking

I've learnt that evals really can be as simple as an assert statement. The goal here is doing quick "smoke tests" to ensure that your pipeline is working as expected, whilst accounting for stochasticity. From that point, complexity is incrementally earned by structuring your evals around your most common & important failure modes.

If these failure modes aren't immediately apparent to you yet, then hunt for them first.

May 10, 2025
in AI, Voice
4 min read

Working with Voice AI is Easy, Try This

Getting started with Voice AI can be easy. It's important to start simple to progressively build an understanding of what happens under the hood. By doing so, we can build a solid foundation for more complex applications. In addition, starting simple and adding complexity slowly helps us compare and appreciate the delta between demo-land & deployments into production.

Let's explore the following:

Simple Speech-to-Text (STT)
STT + Structured Outputs (e.g. JSON, Pydantic)
Evals for Audio

For starters, let's keep it simple with a basic speech-to-text provider.