AI Glossary · Letter C

Class Imbalance.

A condition in training data where some outcome categories appear far more frequently than others, which causes predictive models to favor the majority class and underperform on the rare one. For agencies, class imbalance explains why AI tools built on real marketing data often struggle with the outcomes that matter most: conversions, churn events, and high-value prospects.

Also known as imbalanced classes, skewed class distribution, unbalanced data

What it is

A working definition of class imbalance.

Most real-world classification problems are imbalanced. Conversions are rare relative to site visits. High-value customers are rare relative to the full customer base. Fraudulent transactions are rare relative to legitimate ones. When a model trains on this data without adjustment, it learns that predicting the majority class is a reliable way to minimize overall error, because being wrong about the rare class costs less in aggregate than being wrong about the common one.

A lead scoring model trained on data where 2% of leads converted might achieve 98% accuracy by classifying everything as non-converting. This accuracy is meaningless for the task. Addressing class imbalance requires techniques like oversampling the minority class, undersampling the majority, generating synthetic minority examples (SMOTE), or adjusting the model’s loss function to penalize minority-class errors more heavily.

The presence of class imbalance also changes which evaluation metrics matter. Overall accuracy is misleading when classes are imbalanced. Precision, recall, the F1 score, and the area under the precision-recall curve are more informative metrics for imbalanced classification tasks.

Why ad agencies care

Why class imbalance might matter more in agency work than in most industries.

Marketing data is almost always imbalanced toward the non-event. Clicks, conversions, and high-value leads are rare. Agencies evaluating AI tools that use classification models need to understand how those models handle imbalanced training data, because the answer determines whether the tool is actually useful for the rarest and most valuable predictions.

Accuracy is not the right metric for marketing models. A lead scoring model that gets 98% of predictions right by always predicting “not qualified” is useless. Agencies should ask vendors for precision and recall breakdowns, not just accuracy. Any vendor unable to provide these metrics for their classification tools is either not measuring the right things or concealing poor performance on the minority class.

Rare event models require special handling. Models for churn prediction, conversion propensity, and high-value customer identification are all rare event models. Vendors who do not acknowledge class imbalance in their documentation for these tools have either solved it without mentioning it or have not solved it at all. The question is worth asking directly.

Data collection strategy affects imbalance severity. Agencies helping clients design data collection programs can influence how severe class imbalance becomes. Enriching training data with targeted examples of rare positive events, such as detailed records of past conversions, gives models better coverage of the outcomes that matter most.

In practice

What class imbalance looks like inside a working ad agency.

An agency deploys a churn prediction model for a subscription client. The vendor reports 96% accuracy. A closer look at the confusion matrix reveals the model correctly identifies 94% of non-churners but only 31% of actual churners, because churners represent 4% of the training data. The model is useless as a retention tool despite its impressive overall accuracy. The agency requires the vendor to rebalance the training data and retrain with recall on churners as the primary optimization objective. After retraining, accuracy drops to 89% but churn recall rises to 74%, making the model genuinely useful for the retention campaign it was supposed to power.

Build the model literacy to evaluate AI tools on the metrics that actually matter through The Creative Cadence Workshop.

The generative AI foundations module of the workshop covers how today’s models work, what they can and can’t do, and how to choose between them.