By Phil Heller — 12 Jun 2026

The Narrow Road to the Deep North

Most people in AI are trying to make the road ever wider.

More powerful models. More tokens. More context. More memory. More tools. More agents. More expensive inference. The direction is understandable. A frontier model can often absorb bad prompts, noisy documents, vague instructions, bloated context, and unclear workflows, then still return something useful.

But that is not the only way.

Matsuo Bashō gives us another road. Bashō was a seventeenth-century Japanese poet and one of the great masters of haiku. His best-known work, The Narrow Road to the Deep North, originally Oku no hosomichi, is a poetic travelogue published in 1694. It records a journey he began in 1689, when he left his home near Edo and traveled on foot through the northern provinces of Japan. The work combines prose, travel observation, historical memory, literary allusion, emotional response, and haiku into one of the major works of classical Japanese literature.

Bashō’s narrow road was not just physical. It was a spiritual discipline. He was not trying to contain everything he saw. He was trying to reduce experience until the essential thing remained.

That is also the discipline of haiku. A haiku is not powerful because it says a lot. It is powerful because it makes very little carry weight. The form gives the poet almost nowhere to hide. Every word has to matter.

That is the bridge to AI.

The lesson is not that less is always more. It is that limits can force better design. A rule, a budget, a deadline, a smaller canvas, a fixed number of words - these create pressure. Under that pressure, the builder has to decide what matters and what can be left out.

AI products now face the same question.

The industry’s default answer is abundance. When a task is hard, use a bigger model. When context is messy, send more of it. When the answer is weak, add more tokens. When the workflow is unclear, let the model infer it. A powerful model can make a weak product look better than it is. It can compensate for bad retrieval, vague instructions, and poor workflow design. But compensation is not architecture.

The better question is not always: what is the strongest model available? The better question is: how much intelligence should live in the product layer before the most efficient model is called?

If the product layer is doing real work, much of the task should already be shaped before it reaches the model. The system should understand what the user is trying to do. It should retrieve the right material, not all material. It should know whether the task is extraction, comparison, drafting, synthesis, planning, or judgment. It should choose the workflow before asking the model to complete it.

Then the model receives a cleaner task. And much of the time, a smaller model should be enough.

That is where Claude Haiku becomes interesting. Not because Haiku is better than the larger frontier models. It is not. Not because every task should be routed to the cheapest model. It should not. Haiku is interesting because it acts like a constraint. It tests whether the product layer has done its job.

A strong AI product should not need the largest model for every step. It should use the large model when the work truly requires it: deep reasoning, hard synthesis, subtle judgment, complex drafting, long-horizon planning. But it should not use raw model power to compensate for every failure upstream.

If the product has already found the right context, framed the task, selected the workflow, and reduced the problem to its essential shape, then Haiku can often do meaningful work. It can classify, extract, format, summarize, route, check, transform, draft simple sections, and execute subtasks because the product has already made the work legible.

That is the practical value of constraint. A haiku forces the poet to decide what belongs. A smaller model forces the product to decide what work should happen outside the model.

In both cases, the limitation is not the point. The choices forced by the limitation are the point.

This is not an argument against frontier models. Some problems deserve the biggest engine. The best systems will know when to use it. But they will also know when not to.

In AI, the breakthrough may not be only more power. It may be learning how to need less of it.

Subscribe to Orthogonal