[ Insights ]

Field notes from production AI.

Long-form essays on what actually works when AI meets enterprise reality — written by the engineers who ship it.

ArchitectureFeatured

Why most AI agents fail in production

The gap between a demo agent and a reliable one isn't the model — it's the eval harness, the tool contracts, and the recovery loops you build around it.

March 2026 · 8 min read
Engineering

Designing tool surfaces LLMs actually use well

Function-calling reliability collapses past roughly twelve tools. We share the routing pattern we use to scale agent toolkits past eighty.

February 2026 · 6 min read
Strategy

The autonomy curve: sequencing AI adoption

Enterprises that win with AI sequence capability in four phases. Skipping a phase is the most common — and most expensive — mistake.

January 2026 · 11 min read
Engineering

Observability for non-deterministic systems

Your existing APM stack wasn't built for agents. Here's the telemetry layer we've standardized across every engagement.

December 2025 · 9 min read
Architecture

RAG is a system, not a feature

Retrieval-augmented generation looks simple until your corpus crosses one million documents. We map the architectural cliffs.

November 2025 · 7 min read
Strategy

Build vs buy: when an off-the-shelf agent is enough

Not every workflow deserves custom engineering. A framework for deciding when to license, when to fine-tune, and when to build.

October 2025 · 5 min read