AI Glossary · Letter F

Feedback Loop.

A cycle where a model’s predictions influence the data it subsequently receives, which in turn shapes future model training, potentially amplifying the model’s initial biases or assumptions rather than correcting them. For agencies, feedback loops are why AI systems that optimize on their own outputs can drift toward narrow, self-confirming behavior over time without any single decision causing the problem.

Also known as algorithmic feedback loop, model feedback cycle, self-reinforcing bias

What it is

A working definition of the feedback loop.

A feedback loop forms when a model’s outputs affect the environment it observes, and those environmental changes feed back into future model inputs. The problem is that the data generated after a model is deployed reflects the model’s decisions, not an unbiased sample of what would have happened without the model. A recommendation system that surfaces content the model predicts will be popular generates engagement data only for the content it showed; content it did not show generates no engagement data. The model then trains on engagement data that reflects its own prior choices, reinforcing those choices rather than learning from a neutral comparison.

Feedback loops can be positive, meaning self-amplifying, or negative, meaning self-correcting. Positive feedback loops are the more common concern in AI systems: a model that over-recommends a certain content type generates engagement with that content type, which increases the model’s estimate of that content type’s value, which increases recommendation frequency, in a cycle that can drift the model far from a globally optimal recommendation policy. Negative feedback loops correct themselves: if a model over-recommends a type of content until users become fatigued and engagement drops, the lower engagement signal eventually corrects the over-recommendation tendency, though typically after meaningful performance degradation.

Detecting feedback loops requires comparing model behavior to what would have been observed under counterfactual conditions. This is inherently difficult because the counterfactual data, what would have happened if the model had recommended something different, does not exist by definition. Exploration strategies that intentionally surface non-recommended content to a control group, and offline evaluation methods that use historical logged data with appropriate weighting, are the standard approaches for breaking the loop enough to measure it.

Why ad agencies care

Why feedback loops matter more in agency work than in most industries.

Every automated campaign optimization system a working ad agency deploys creates a feedback loop: the system allocates budget to the placements, audiences, and creative it predicts will perform best, and the performance data it receives reflects only the allocations it made. This is unavoidable in any system that optimizes rather than randomly tests, and it means that all AI optimization systems are, to some degree, learning from their own choices rather than from unbiased signal. Understanding this structure is necessary for managing the long-run behavior of these systems rather than being surprised when they drift.

Audience narrowing is a common feedback loop outcome. A lookalike audience model that optimizes conversions will progressively narrow toward the segment of the target audience most likely to convert given the current creative and media approach, excluding segments that might convert well under different conditions. Over multiple optimization cycles, the reachable audience shrinks, CPMs rise as the model competes with itself for a narrower pool, and performance plateaus without an obvious cause. The cause is the feedback loop gradually excluding everything the model has not observed converting.

Creative fatigue acceleration is a feedback loop problem. An automated creative optimization system that concentrates budget on the best-performing variant will exhaust that variant’s audience faster than a diversified approach would. The engagement data the system receives is dominated by the top variant, making it appear to perform even better than a fresh variant would, which concentrates budget further, accelerating fatigue. Breaking this loop requires explicit policies about how quickly a dominant variant’s share can grow and how much budget must remain allocated to new creative regardless of current performance rankings.

Feedback loops in scoring models entrench historical patterns. A lead scoring model trained on historical conversion data learns which lead characteristics have historically correlated with conversion. If those characteristics reflect historical marketing choices, such as which segments received attention and outreach, the model will score future leads to direct attention toward the same segments, producing conversion patterns that confirm the model’s initial assumptions. The leads the model down-scores never receive outreach, never convert, and their absence from the positive training examples confirms the model’s estimate that they are low-value. The historical pattern becomes self-fulfilling.

In practice

What feedback loop looks like inside a working ad agency.

An agency runs a personalized content recommendation system for a B2B media client. Over six months of continuous optimization, the system progressively concentrates recommendation traffic on a narrow set of topic categories that historically showed high engagement. New content in underrepresented categories receives little traffic because the model has low confidence estimates for new categories, and low traffic produces low engagement data, which keeps the model’s confidence estimates low. The editorial team notices that 73% of recommendation traffic is concentrating on three of the publication’s twelve topic categories. The agency implements an exploration policy that allocates 15% of recommendation traffic to a random sample of non-recommended content, generating unbiased engagement signal for categories the model has been ignoring. After three months of exploration data collection, the model retrains on the broadened signal and the top-three category concentration drops to 51%, engagement per session increases 8% as users encounter more relevant content from categories they had not been shown, and four previously neglected categories emerge as strong performers with sufficient engagement history to be reliably recommended.

Build AI optimization systems with the exploration policies that prevent feedback loops from narrowing performance over time through The Creative Cadence Workshop.

The automations and agents module of the workshop covers how to design and monitor AI-powered optimization systems that remain adaptive over time rather than converging on locally optimal but globally narrow solutions.