MLconf NYC Justin Basilico

Learning a Personalized HomepageJustin BasilicoPage Algorithms Engineering April 11, 2014

NYC 2014

@JustinBasilico

Change of focus

2006 2014

Netflix Scale > 44M members

> 40 countries

> 1000 device types

> 5B hours in Q3 2013

Plays: > 30M/day

Log 100B events/day

31.62% of peak US downstream traffic

Approach to Recommendation“Emmy Winning”

Help members find content to watch and enjoy to maximize member satisfaction and retention

Everything is a Recommendation

Ranking

Over 75% of what people watch comes from our recommendations

Recommendations are driven by Machine Learning

Top Picks

Personalization awareness

Diversity

Personalized genres Genres focused on user interest

Derived from tag combinations

Provide context and evidence

How are they generated?

Implicit: Based on recent plays, ratings & other interactions

Explicit: Taste preferences

Hybrid: combine the above

Similarity Find something similar to

something you’ve liked

Because you watched rows

Also Video display page In response to user actions

(search, list add, …)

Support for Recommendations

Social Support

Learning to Recommend

Machine Learning Approach

Problem

ModelAlgorithm

Metrics

Data Plays

Duration, bookmark, time, device, …

Ratings

Metadata Tags, synopsis, cast, …

Impressions

Interactions Search, list add, scroll, …

Social

Models & Algorithms Regression (Linear, logistic, elastic net)

SVD and other Matrix Factorizations

Factorization Machines

Restricted Boltzmann Machines

Deep Neural Networks

Markov Models and Graph Algorithms

Clustering

Latent Dirichlet Allocation

Gradient Boosted Decision Trees/Random Forests

Gaussian Processes

Offline/Online testing process

Rollout Feature to all users

Offline testing

Online A/Btesting[success] [success]

[fail]

days Weeks to months

Rating Prediction First progress prize

Top 2 algorithms

Matrix Factorization (SVD++)

Restricted Boltzmann Machines (RBM)

Ensemble: Linear blend

Videos

(99% Sparse) d

Videos

Ranking by ratings

4.7 4.6 4.5 4.5 4.5 4.5 4.5 4.5 4.5 4.5

Niche titlesHigh average ratings… by those who would watch it

Learning to Rank Approaches:

Point-wise: Loss over items (Classification, ordinal regression, MF, …)

Pair-wise: Loss over preferences (RankSVM, RankNet, BPR, …)

List-wise: (Smoothed) loss over ranking (LambdaMART, DirectRank, GAPfm, …)

Ranking quality measures: NDCG, MRR, ERR, MAP, FCP,

Precision@N, Recall@N, …0 10 20 30 40 50 60 70 80 90 100

NDCG MRR FCP

Example: Two features, linear model

Popularity

Linear Model:frank(u,v) = w1 p(v) + w2 r(u,v)

Final Ranking

Ranking

Putting it together“Learning to Row”

Page-level algorithmic challenge

10,000s of possible

rows …

10-40 rows

Variable number of possible videos per

row (up to thousands)

1 personalized page

per device

Balancing a Personalized Page

vs.Accurate Diverse

vs.Discovery Continuation

vs.Depth Coverage

vs.Freshness Stability

vs.Recommendations Tasks

2D Navigational Modeling

More likely to see

Less likely

Row Lifecycle

Select Candidates

Select Evidence

Filter

Format

Choose

Building a page algorithmically Approaches

Template: Non-personalized layout Row-independent: Greedy rank rows by f(r | u, c) Stage-wise: Pick next rows by f(r | u, c, p1:n)

Page-wise: Total page fitness f(p | u, c)

Obey constraints per device Certain rows may be required Examples: Continue watching and My List

Row Features Quality of items Features of items Quality of evidence User-row interactions Item/row metadata Recency Item-row affinity Row length Position on page Context Title Diversity Freshness …

Page-level Metrics How do you measure the quality of

the homepage? Ease of discovery Diversity Novelty …

Challenges: Position effects Row-video generalization

2D versions of ranking quality metrics Example: Recall @ row-by-column

0 5 10 15 20 25 30 35

Conclusions

Evolution of Recommendation Approach

Rating Ranking Page Generation

Research Directions

Context awareness

Full-page optimization

Presentation effects

Social recommendation

Personalized learning to rank Cold start

34Thank You Justin Basilico

jbasilico@netflix.com@JustinBasilico

We’re hiring

MLconf NYC Justin Basilico

Technology

Transcript of MLconf NYC Justin Basilico

Ross Goodwin, Technologist, Sunspring, MLconf NYC 2017

Evan Estola, Lead Machine Learning Engineer, Meetup, at MLconf NYC 2017

Furong Huang, Ph.D. Candidate, UC Irvine at MLconf NYC - 4/15/16

Sergei Vassilvitskii, Research Scientist, Google at MLconf NYC - 4/15/16

Geetu Ambwani, Principal Data Scientist, Huffington Post at MLconf NYC - 4/15/16

MLconf NYC Samantha Kleinberg

Michal Malohlava, Software Engineer, H2O.ai at MLconf NYC

Jeff Johnson, Research Engineer, Facebook at MLconf NYC

Soumith Chintala, AI research engineer, Facebook, at MLconf NYC 2017

MLconf NYC Xiangrui Meng

Kaheer Suleman, CTO, Maluuba at MLconf NYC - 4/15/16

Ben Hamner, CTO, Kaggle, at MLconf NYC 2017

Irina Rish, Research Staff, IBM T.J. Watson Research Center at MLconf NYC

MLconf NYC Shan Shan Huang

Alexandra Johnson, Software Engineer, SigOpt, at MLconf NYC 2017

Lei Yang, Senior Engineering Manager, Quora at MLconf NYC - 4/15/16

Aaron Roth, Associate Professor, University of Pennsylvania, at MLconf NYC 2017

Session 2 - Akyildiz, Beinecke, Yee at MLconf NYC

Jeremy Stanley, EVP/Data Scientist, Sailthru at MLconf NYC

Ted Willke, Senior Principal Engineer, Intel Labs at MLconf NYC