Advanced28 min readยท Topic 11.2

Recommendation systems

Collaborative & content-based filtering, two-tower architecture, cold start, ranking

๐ŸŽฏKey Takeaways

  • 1
    Collaborative filtering: 'users who bought X also bought Y' โ€” based on user behavior similarity
  • 2
    Content-based filtering: recommend items similar to what the user has liked โ€” based on item features
  • 3
    Two-tower architecture: user tower + item tower โ†’ dot product for fast similarity computation
  • 4
    Cold start problem: new users and new items lack data โ€” use content features and popularity as fallback

How Netflix, YouTube, and Amazon Recommend

Recommendation systems drive engagement at every major platform: YouTube's watch time, Amazon's purchases, Spotify's listening, TikTok's scroll time. The core challenge: from millions of items, find the 10-50 most relevant for each user, in real-time.

Modern systems use a multi-stage pipeline: Candidate Generation (narrow millions to hundreds) โ†’ Ranking (score and sort) โ†’ Re-ranking (diversity, business rules, filters).

Recommendation Approaches

Based on user behavior patterns: users who behaved similarly will like similar things.

User-based: find users similar to you, recommend what they liked.

Item-based: find items similar to what you liked, recommend those.

Limitation: cold start โ€” can't recommend for new users or items with no interaction data.

Advantages

  • โ€ขCollaborative filtering captures non-obvious user preferences
  • โ€ขTwo-tower architecture scales to billions of items
  • โ€ขMulti-stage pipeline balances relevance with performance

Disadvantages

  • โ€ขCold start requires fallback strategies
  • โ€ขFeedback loops can create filter bubbles
  • โ€ขRecommendation quality depends heavily on data quality

๐Ÿงช Test Your Understanding

Knowledge Check1/1

What is the cold start problem in recommendation systems?