AI Glossary · Letter U

User Embedding.

A dense vector representation of a user learned from behavioral interaction data, encoding the user’s preferences, interests, and patterns as a point in a continuous multi-dimensional space where similar users are close together and dissimilar users are far apart. User embeddings are the core representation in recommendation systems, personalized ranking, audience look-alike modeling, and dynamic creative personalization, enabling AI systems to reason about user similarity and preference at a scale that handcrafted feature engineering cannot match.

Also known as user representation, user vector, user feature embedding

What it is

A working definition of user embedding.

A user embedding is a learned vector of typically 32 to 512 floating-point numbers that represents a user in a continuous preference space. These numbers are not handcrafted features such as age or category preference scores; they are the output of a neural network trained to predict user behavior, with the embedding layer’s output serving as the compact representation of everything the model has learned about the user from their interaction history. Users with similar behavioral patterns will have similar embeddings, meaning their vectors will be close in the embedding space as measured by cosine similarity or Euclidean distance.

User embeddings are typically learned jointly with item embeddings in a two-tower neural network architecture. The user tower takes the user’s interaction history, profile attributes, or both as input and outputs a user embedding. The item tower takes item features as input and outputs an item embedding. The model is trained to maximize the dot product between user and item embeddings for (user, item) pairs that have been positively interacted with, and to minimize it for negative pairs. After training, the similarity between a user’s embedding and all available item embeddings quantifies the predicted affinity of that user for each item, enabling rapid nearest-neighbor retrieval of the top-k most relevant items.

User embeddings can also be learned from sequence models that process the ordered history of a user’s interactions as a temporal sequence. Transformer-based sequential recommendation models treat a user’s click or purchase history as a sequence of items, apply self-attention over the sequence, and produce a contextual representation of the user that reflects not only which items they have engaged with but in what order and with what temporal recency. These sequential user representations capture dynamic preference shifts, such as a user transitioning from browsing budget options to researching premium products over several sessions, that static user embeddings based on aggregate behavioral summaries would miss.

Why ad agencies care

Why user embeddings enable the personalization depth and similarity-based audience discovery that rule-based systems cannot provide.

A working ad agency building personalization systems, look-alike audiences, or dynamic creative optimization for clients needs user embeddings when rule-based segmentation reaches its ceiling. Segments defined by explicit rules (age range, category preference flag, recency tier) describe users along discrete dimensions and cannot capture the complex multi-dimensional preference structure that actually differentiates how users respond to specific content or product offers. User embeddings represent users in a continuous space that simultaneously encodes hundreds of behavioral dimensions, enabling personalization decisions that are sensitive to the full behavioral context rather than a handful of hand-chosen attributes.

Look-alike audience expansion using user embeddings identifies prospects that resemble high-value customers along multiple behavioral dimensions simultaneously. A look-alike model that computes user embeddings from behavioral data, then identifies prospects whose embeddings are nearest to the embedding centroid of the seed audience in embedding space, selects prospects that resemble the seed audience holistically rather than on any single feature. This multi-dimensional similarity captures behavioral look-alikes that demographic or single-feature targeting misses: a prospect who shares the same content engagement pattern, browsing depth, category sequence, and session recency profile as high-value customers, regardless of whether they match on any individual demographic attribute. Embedding-based look-alike models consistently outperform feature-score-based look-alike approaches on conversion rate lift in controlled tests.

User embedding drift over time reflects genuine preference evolution and requires monitoring to maintain recommendation and targeting quality. A user’s interests and preferences are not static; a user who was heavily engaged with hiking gear content 12 months ago may have shifted their interests toward home fitness over the subsequent year. If a user embedding is computed once from static historical data and not updated, the embedding will reflect the user’s past preferences rather than their current ones, degrading personalization quality over time. Monitoring the average drift between successive user embedding updates, and the correlation between embedding freshness and recommendation click-through rate, provides early warning that static embeddings are becoming stale and triggering recomputation is worthwhile.

Cross-session user embeddings enable consistent personalized experiences even when users browse without logging in. Many users browse across sessions without authenticating, making it difficult to maintain a persistent user representation for personalization. User embeddings computed from session-level behavioral signals such as the sequence of pages viewed, content categories explored, and time spent per content type can be generated in real time from the current session without requiring a persistent user identity. These session-level embeddings enable personalization within and across anonymous sessions by finding items whose embeddings are similar to the session embedding, providing a consistent personalized experience without requiring the user to authenticate.

In practice

What user embedding looks like inside a working ad agency.

An agency builds a dynamic product recommendation system for a specialty beauty retailer client with 6,200 active SKUs and 1.4 million cookied users in its first-party data environment. The prior recommendation system used a collaborative filtering approach that required a minimum of 5 purchases per user to generate recommendations; 68% of users had fewer than 5 purchases and received popularity-based recommendations rather than personalized ones. The agency trains a two-tower neural network model using 18 months of interaction data including product page views, add-to-cart events, purchases, and wishlist additions. The user tower processes each user’s 90-day interaction sequence and outputs a 128-dimensional user embedding. The item tower processes each product’s attribute features (category, ingredient profile, price tier, finish, skin type compatibility) and outputs a 128-dimensional item embedding. Training uses sampled softmax loss with 100 negative samples per positive interaction. After training, the model produces embeddings for all 1.4 million users. Users with fewer than 5 purchases have embeddings learned from their browse and engagement behavior rather than purchase history alone, producing personalized recommendations for 91% of the user base (up from 32% with the prior purchase-history-only system). Offline evaluation on a 60-day held-out period shows the embedding-based recommendation model achieves recall at 20 (fraction of next-purchased items appearing in the top 20 recommendations) of 0.31, versus 0.18 for the prior collaborative filter and 0.09 for popularity-based recommendations. A 4-week A/B test of the embedding-based system against the prior system on 50% of live traffic shows 27% improvement in revenue per recommendation session and 19% improvement in average order value, with the largest gains among users with fewer than 5 purchases where the new system provides genuinely personalized recommendations for the first time.

Build the user representation and personalization expertise that enables recommendation and targeting systems to leverage the full depth of behavioral data through The Creative Cadence Workshop.

The generative AI foundations module covers user embeddings including two-tower networks, sequential recommendation models, embedding similarity retrieval, and the monitoring practices that maintain embedding quality as user preferences evolve over time.