Recs Algorithm Engine

This section explains how the recommendation engine works, including the data inputs it relies on, the minimum data required for reliable output, and how performance scales with increasing data volume.

Data Inputs: Behavioural Events

Data Inputs are the behavioural and catalogue signals (like clicks, views, purchases, and product attributes) that power the recommendation engine. The quality and volume of these inputs directly determine how accurate, relevant, and personalised your recommendations are.

The recommendation engine is powered by behavioural event data captured via the Unbxd clickstream SDK.

Depending on the algorithm, one or more of the following signals are used:

Event TypeDescriptionSignal Strength
ClickA shopper clicks on a productLow
Add to CartA shopper adds a product to their cartMedium
Order / PurchaseA shopper completes a purchaseHigh
Page / PDP ViewA shopper views a product detail pageLow–Medium
Search QueryText queries entered by the shopperMedium

Minimum Data Requirements

While the engine can operate with minimal data, stronger behavioural signals lead to more accurate and personalised recommendations. Below certain thresholds, performance may be limited.

MetricRecommended MinimumNotes
Lookback Window7 days (rolling)Balances recency with sufficient data volume
Total behavioural events5,000 eventsCombined clicks, cart adds, and orders
Unique users1,000 visitorsRequired for user-level personalization
Unique products interacted500+ stock keeping unitsEnables meaningful product associations
Purchase events500+ ordersCritical for sales-based and collaborative filtering

Graceful Degradation

When data is limited, the engine automatically adjusts its strategy as per below:

Data VolumeImpact
< 1,000 eventsFalls back to catalog-based signals (e.g., product attributes, recency)
1,000–5,000 eventsRecommendations available but less personalized
5,000+ eventsFull algorithm performance with reliable personalization
10,000+ eventsHigh-confidence recommendations across all algorithms

Low-Dependency Algorithms

Some algorithms are available from day one and do not depend on large behavioural datasets.

These include:

  • Recently Viewed
  • Recs Based on Recently Viewed
  • Recs Based on Last Viewed
  • More Like This
  • Catalog-driven variants

These rely primarily on session data or product attributes, making them effective even in low-data scenarios.