Back to posts
2026-05-25 · 9 min read

Anti-Convergence Design: Why Biology Doesn't Let You Stand Still — and What That Teaches AI Agents

Biology and AI engineering have independently arrived at the same design principle: no system that needs to adapt can be allowed to converge permanently. The mechanisms behind hedonic adaptation in humans mirror the mechanisms behind exploration schedules in agents. Lessons for how you build organizations, products, and yourself.

TuanBy @tuan

There's a question I've spent weeks thinking about.

Why can't we hold a good state? A relationship at its most romantic. A success at its peak. A joy at its most complete. The strongest feeling.

They all fade. Without external cause. Without anyone breaking them. The nervous system itself pulls every state back to baseline.

This isn't accidental. It's design.

And the interesting part: when we build AI agents, we've been rediscovering the same design — not on purpose, but because systems without anti-convergence mechanisms fail in predictable ways.

Homeostasis — the foundational principle

Biology operates on a basic principle: homeostasis. Every biological system has a normal range and actively pulls back toward it when pushed.

Body temperature stays around 37°C. Blood sugar stays within a narrow band. Blood pressure, electrolyte concentrations, blood pH — all kept in tight ranges.

When you're hungry, blood sugar drops, the system pushes you to eat. When you've eaten too much, blood sugar spikes, the system secretes insulin. Never stable at abnormally high or abnormally low.

What's less commonly understood: emotional homeostasis exists too. And it operates the same way.

Hedonic adaptation

Daniel Kahneman and colleagues showed a strange phenomenon: lottery winners and people who lost limbs in accidents, after 1-2 years, have roughly equivalent mean happiness and roughly equivalent to their pre-event baseline.

This is hedonic adaptation — the brain adjusts so that extreme emotional states cannot persist.

The mechanism: dopamine receptors downregulate when exposed to high dopamine. Stress hormone receptors up- or down-regulate depending on conditions. The brain actively rewires its own biochemistry so any high-intensity state becomes the new background after a while.

This is why:

Deep love after 18-24 months loses the early intensity — not because love died, but because the receptors changed.

Chronic pain sufferers can function — the body adjusted so pain no longer paralyzes.

People long-accustomed to wealth don't experience wealth as special — it became baseline.

Hedonic adaptation doesn't distinguish good from bad. It just pulls toward baseline.

Why evolution designed it this way

The natural question: why? Why doesn't evolution let us hold good states?

The answer is survival function.

One, signals need contrast. If everything is always wonderful, nothing is wonderful. Joy has meaning because it contrasts with baseline. Pain has meaning because it contrasts with no-pain. A system at a permanent extreme loses signal capacity.

Two, high energy is expensive. Intense emotions — positive and negative — burn biochemical resources. The system can't sustain high intensity without depletion. Evolution prefers systems that return to economy after events.

Three, environments change. Organisms holding a fixed state when environments shift will die. Organisms that adapt to new conditions — even when old conditions were better — survive.

Four, organisms must keep searching. If we felt complete forever, we'd stop seeking food, mates, or threats. Evolution designed every fullness to fade, creating action drive.

This is anti-convergence design. Biology doesn't let any state become the endpoint. Every peak falls. Every trough rises. Life is oscillation around baseline, not a journey to a summit.

AI agents — why we have to rediscover the same design

When we started building production AI agents seriously this past year, certain lessons couldn't be theorized — they had to be learned through failure.

One of those lessons: agents without anti-convergence mechanisms fail predictably.

Specifically:

Agent loops without escape conditions. Our early agent ran plan-execute-observe-replan. Beautiful in theory. In practice, the agent fell into local maxima — "good enough" solutions it couldn't escape. It kept optimizing around the same solution, never exploring other regions of the space.

The fix: temperature schedules. Force the agent to sometimes pick a non-optimal action, to escape false convergence points. This is exactly anti-convergence — forcing the system not to stabilize in one state.

This isn't new. It's a re-implementation of exploration vs exploitation — the old reinforcement-learning problem. But more importantly, it's the same principle biology solved: a system needs a mechanism that forces non-permanent convergence.

Memory without decay. Our agent's initial long-term memory weighted every event equally. After a few hundred conversations, the context window filled with old, irrelevant information. The agent made worse decisions because the past was noise.

The fix: memory decay. Old information fades unless re-activated. This is exactly how the human brain works — unretrieved memories weaken, retrieved memories strengthen.

Why? Because keeping everything forever is a burden. A resource-bounded system must have a deletion mechanism. Both brains and AI agents.

Reward functions without saturation. An agent optimizing a single metric (say, revenue) will converge on extreme behavior that maximizes it. Goodhart's law: when a measure becomes a target, it stops being a good measure.

The fix: diminishing returns in the reward function. Each unit increase in the metric yields decreasing reward. This forces the agent not to converge on a single strategy.

This is how biology works. Eat until full — past full, dopamine from eating drops. Drink until satisfied — past satisfied, more drinking feels like nothing. The system is designed so every satisfaction saturates.

State without context refresh. An agent holding a state too long without refresh will decide based on stale assumptions. In a changing environment, this fails.

The fix: forced context refresh on a schedule. Make the agent revisit state, check assumptions, update. This is the opposite of convergence — forcing the system to question itself.

In biology, this is the function of sleep and REM. Every night, the brain reprocesses the day's experiences, updates its model of the world, consolidates some memories and discards others. People in prolonged sleep deprivation suffer severe cognitive decline — not from tiredness alone, but because the system has no update mechanism.

The lesson from a two-way parallel

When biology and AI engineering converge on the same design principle, that's a strong signal the principle isn't accidental. It's a property of the problem.

The problem: how does a system maintain adaptive capacity in a changing environment.

The shared answer — biology's and AI's: don't allow permanent convergence. There must be a mechanism that forces exit from every stable state, including the good ones.

This is a hard truth for anyone wanting to optimize their life:

There is no "optimal stable." Any state that feels perfect will fade. Not because anything is wrong. Because this is how the system is designed.

There is no "arrived." Every achievement becomes baseline. Continuing to need new goals isn't greed — it's biology rejecting standstill.

There is no "permanent emotional stability." People who appear constantly balanced haven't reached a final state — they're oscillating in a narrow band, with a low baseline.

Consequences for how you build organizations

This post seems about technical design. But it has large consequences for leadership.

Organizations have homeostasis too. No culture can hold a high state forever. Every reorg that inspires at first will fade. Every grand vision becomes background after 18-24 months.

This isn't bad leadership. It's how social systems work — the same principle that pulls body temperature to 37°C.

Consequence: leaders must continuously inject new energy to keep the organization in the desired state. Not because the organization is "broken" — because converging to baseline is the design.

Don't expect employees to maintain peak forever. An employee who's brilliant in their first project won't be brilliant at the same intensity in their fifth. They didn't get worse — hedonic adaptation applies to work.

The right structure: cycles. Peak → normal → recovery → next peak. Not permanent peak.

KPIs need to change over time. A good metric for this year will become Goodhart in 2-3 years — the organization learns to optimize the metric instead of optimizing the real outcome. This isn't because the organization is bad. It's a property of optimizing systems — they converge on the metric.

Good leaders change metrics often enough to prevent convergence. Not every quarter — too fast. Not for 10 years — too slow. Change on the natural rhythm of hedonic adaptation: 18-24 months.

Crisis is a self-correcting mechanism. When an organization hits a crisis, that's not failure — it's the system being forced to update. As fever in the body isn't disease but an immune mechanism. Good leaders don't avoid crisis — they use crisis to update assumptions.

Consequences for individuals

Don't try to hold the high. Joy will fade. Romantic love will become background. Success will become baseline. Not because anything is wrong — because this is the design. Enjoy the high state when it's there, don't pour energy into freezing it.

Don't believe the low is permanent. Grief will fade. Pain will lose intensity. Depression will change shape. This is also design. Invest in the process, not in the belief that the current state is the end.

Cooperate with the design, don't fight it. Trying to hold a positive state forever fights biology. Trying to escape a negative state instantly also fights biology. Let oscillation happen — oscillation itself is health.

Don't artificially extend high intensity. When the brain wants to fade an emotion, don't build rituals that keep it alive. The system is trying to do its work. Cooperate.

Update assumptions periodically. Every 2-3 years, sit down with the foundational assumptions about your life. Updating proactively reduces accumulated pressure. People who don't update proactively get forced to update through crisis — much more expensive.

A final question

Biology is not partial. AI agents are not partial. Both provide anti-convergence mechanisms for every state — good and bad.

The same mechanism that fades joy also fades pain. The same mechanism that makes romantic love become background also makes grief become background. The same mechanism that doesn't allow organizations to stay stable forever doesn't allow crises to last forever.

The system provides capability. The choice — what to hold, what to release — stays with the user.

The system is anti-convergence. But it doesn't tell you what to converge on — it has no preferred destination. It only doesn't let you stand still.

The question isn't will I change. You will — biology guarantees this. Your agents will — code guarantees this.

The question is am I participating in this change consciously, or letting it happen to me.

That's the difference between people who mature through decades — and people defeated by decades. Between organizations that adapt through crisis — and organizations destroyed by crisis. Between agents that self-update — and agents stuck in local maxima.

Biology doesn't care which one you choose. AI agents don't care. But your life, your organization, your products — do.

test enPart 2 / 2
  1. 1When Nobody Understands Software Anymore
  2. 2Anti-Convergence Design: Why Biology Doesn't Let You Stand Still — and What That Teaches AI Agentsyou are here