Time to Shop for Valentine’s Day: Shopping Occasions and...

Post on 18-Jul-2020

3 views 0 download

Transcript of Time to Shop for Valentine’s Day: Shopping Occasions and...

Time to Shop for Valentine’s Day: Shopping Occasions and Sequential

Recommendation in E-commerceJianling Wang, Raphael Louca*, Diane Hu*, Caitlin Cellier*,

James Caverlee and Liangjie Hong*Texas A&M University

* Etsy Inc.

Recommender Systems usually:

● Capture users' intrinsic preference from their long-term behaviorpatterns.

● Infer users’ current needs by emphasizing recent actions.

However, intrinsic user behavior may be shifted by occasions, such asbirthdays, anniversaries, or gifting celebrations (Valentine's Day orMother's Day).

In E-commerce

In E-commerce

Often, these occasion-based purchases deviate from long-termpreferences and are not related to recent actions.

Idea: Let’s incorporate occasion signals!● Recommend more time or season-aware candidates, which may

alleviate the cold-start problem.● Reduce the noise in modeling users' intrinsic preferences.● Recommend relevant items to the user for upcoming reoccurring

occasions.

Global Occasions● Happen at the same time for a large number of users.● Encourage or lead to similar shopping decisions for crowds of users.● Examples: Christmas, Valentine’s Day, …

Personal Occasions● Happen at different times for different users.● Occur in a periodic and repeated pattern for a specific user.● Examples: Birthdays (for you or your friends) and anniversaries.

To Illustrate: Temporal Shopping TrendsUsers' shopping preferences are dynamic and can reflect reoccurring occasions (festivals, holidays, seasonal activities).

Valentine’s Day

Mother’s Day

Father’s Day

Thanksgiving Christmas

(a) (b)

Time-sensitive Most Popular V.S. General Most Popular

To Illustrate: From Personal PerspectiveThere are occasions that may re-occur within a certain period (e.g. annually or monthly) over the course of multiple years and trigger relevant purchase.

⇒ ⇒ traceable patterns in personal occasion signals.

Time Gap between Purchases for Wedding and Anniversary within a year. More than 50% of purchases for anniversary are near the date of wedding purchase within a time window less than 30 days.

Occasion-Aware Recommendation (OAR)Intrinsic Preference +

Personal Occasion Signals +

Global Occasion Signals

Our Proposal:

Occasion-Aware Recommendation (OAR)● Model the repeated personal occasion signals with attention layers.● Model the global occasion signals by memorizing the temporal trends

of shopping behaviors.● With a gating component, we balance the global and local effects of

different occasions. ● Experiments on Etsy and Amazon.

Attention Mechanisms● Query

● Key-Value Pairs

● Scoring Function - Calculating similarity/relationship between a Query & a Key

Keys ValuesQueries… ……

Attention Weights Outputs

q<latexit sha1_base64="BWkoXlOkYejmq6A3DcUbfxqunms=">AAAB8XicbVDLTgJBEOzFF+IL9ehlIjHxRHbRRI9ELx4xkUcEQmaHWZgwO7vO9BrJhr/w4kFjvPo33vwbB9iDgpV0UqnqTneXH0th0HW/ndzK6tr6Rn6zsLW9s7tX3D9omCjRjNdZJCPd8qnhUiheR4GSt2LNaehL3vRH11O/+ci1EZG6w3HMuyEdKBEIRtFK9x3kT+gH6cOkVyy5ZXcGsky8jJQgQ61X/Or0I5aEXCGT1Ji258bYTalGwSSfFDqJ4TFlIzrgbUsVDbnpprOLJ+TEKn0SRNqWQjJTf0+kNDRmHPq2M6Q4NIveVPzPaycYXHZToeIEuWLzRUEiCUZk+j7pC80ZyrEllGlhbyVsSDVlaEMq2BC8xZeXSaNS9s7KldvzUvUqiyMPR3AMp+DBBVThBmpQBwYKnuEV3hzjvDjvzse8NedkM4fwB87nDx2HkTQ=</latexit>

o<latexit sha1_base64="MdMM0CCP6gkS6I9FmEsJNlSQy9E=">AAAB8XicbVBNS8NAEJ34WetX1aOXxSJ4KkkV9Fj04rGC/cA2lM120y7dbMLuRCyh/8KLB0W8+m+8+W/ctjlo64OBx3szzMwLEikMuu63s7K6tr6xWdgqbu/s7u2XDg6bJk414w0Wy1i3A2q4FIo3UKDk7URzGgWSt4LRzdRvPXJtRKzucZxwP6IDJULBKFrpoYv8CYMwiye9UtmtuDOQZeLlpAw56r3SV7cfszTiCpmkxnQ8N0E/oxoFk3xS7KaGJ5SN6IB3LFU04sbPZhdPyKlV+iSMtS2FZKb+nshoZMw4CmxnRHFoFr2p+J/XSTG88jOhkhS5YvNFYSoJxmT6PukLzRnKsSWUaWFvJWxINWVoQyraELzFl5dJs1rxzivVu4ty7TqPowDHcAJn4MEl1OAW6tAABgqe4RXeHOO8OO/Ox7x1xclnjuAPnM8fGn2RMg==</latexit>

k1<latexit sha1_base64="vpeHugvKEkgkR/0j0altVfMTXNo=">AAAB83icbVBNS8NAEJ3Ur1q/qh69LBbBU0mqoMeiF48V7Ac0pWy2m3bpZhN2J2IJ/RtePCji1T/jzX/jts1BWx8MPN6bYWZekEhh0HW/ncLa+sbmVnG7tLO7t39QPjxqmTjVjDdZLGPdCajhUijeRIGSdxLNaRRI3g7GtzO//ci1EbF6wEnCexEdKhEKRtFKvo/8CYMwG0/7Xr9ccavuHGSVeDmpQI5Gv/zlD2KWRlwhk9SYrucm2MuoRsEkn5b81PCEsjEd8q6likbc9LL5zVNyZpUBCWNtSyGZq78nMhoZM4kC2xlRHJllbyb+53VTDK97mVBJilyxxaIwlQRjMguADITmDOXEEsq0sLcSNqKaMrQxlWwI3vLLq6RVq3oX1dr9ZaV+k8dRhBM4hXPw4ArqcAcNaAKDBJ7hFd6c1Hlx3p2PRWvByWeO4Q+czx9BnZHS</latexit>

k2<latexit sha1_base64="MYieos1SHUd3AQE445MyTQ0XbWo=">AAAB83icbVBNS8NAEN3Ur1q/qh69LBbBU0mqoMeiF48V7Ac0pWy2k3bpZhN2J2IJ/RtePCji1T/jzX/jts1BWx8MPN6bYWZekEhh0HW/ncLa+sbmVnG7tLO7t39QPjxqmTjVHJo8lrHuBMyAFAqaKFBCJ9HAokBCOxjfzvz2I2gjYvWAkwR6ERsqEQrO0Eq+j/CEQZiNp/1av1xxq+4cdJV4OamQHI1++csfxDyNQCGXzJiu5ybYy5hGwSVMS35qIGF8zIbQtVSxCEwvm988pWdWGdAw1rYU0rn6eyJjkTGTKLCdEcORWfZm4n9eN8XwupcJlaQIii8WhamkGNNZAHQgNHCUE0sY18LeSvmIacbRxlSyIXjLL6+SVq3qXVRr95eV+k0eR5GckFNyTjxyRerkjjRIk3CSkGfySt6c1Hlx3p2PRWvByWeOyR84nz9DIZHT</latexit>

k3<latexit sha1_base64="jAR3sPEfMoURVaK7hqHc6/mT8KA=">AAAB83icbVBNS8NAEN3Ur1q/qh69LBbBU0laQY9FLx4r2FZoStlsJ+3SzSbsTsQS+je8eFDEq3/Gm//GbZuDtj4YeLw3w8y8IJHCoOt+O4W19Y3NreJ2aWd3b/+gfHjUNnGqObR4LGP9EDADUihooUAJD4kGFgUSOsH4ZuZ3HkEbEat7nCTQi9hQiVBwhlbyfYQnDMJsPO3X++WKW3XnoKvEy0mF5Gj2y1/+IOZpBAq5ZMZ0PTfBXsY0Ci5hWvJTAwnjYzaErqWKRWB62fzmKT2zyoCGsbalkM7V3xMZi4yZRIHtjBiOzLI3E//zuimGV71MqCRFUHyxKEwlxZjOAqADoYGjnFjCuBb2VspHTDOONqaSDcFbfnmVtGtVr16t3V1UGtd5HEVyQk7JOfHIJWmQW9IkLcJJQp7JK3lzUufFeXc+Fq0FJ585Jn/gfP4ARKWR1A==</latexit>

kL<latexit sha1_base64="XeR3m7IUyoUIUJ0yLNPrDRZNyDQ=">AAAB83icbVA9SwNBEJ3zM8avqKXNYhCswl0UtAzaWFhEMB+QC2Fvs5cs2ds7dufEcORv2FgoYuufsfPfuEmu0MQHA4/3ZpiZFyRSGHTdb2dldW19Y7OwVdze2d3bLx0cNk2casYbLJaxbgfUcCkUb6BAyduJ5jQKJG8Fo5up33rk2ohYPeA44d2IDpQIBaNoJd9H/oRBmI0mvbteqexW3BnIMvFyUoYc9V7py+/HLI24QiapMR3PTbCbUY2CST4p+qnhCWUjOuAdSxWNuOlms5sn5NQqfRLG2pZCMlN/T2Q0MmYcBbYzojg0i95U/M/rpBhedTOhkhS5YvNFYSoJxmQaAOkLzRnKsSWUaWFvJWxINWVoYyraELzFl5dJs1rxzivV+4ty7TqPowDHcAJn4MEl1OAW6tAABgk8wyu8Oanz4rw7H/PWFSefOYI/cD5/AGqJke0=</latexit>

vL<latexit sha1_base64="I0VC8gBrQq8yWNyo+yFpgT4wz1Y=">AAAB83icbVBNS8NAEJ3Ur1q/qh69BIvgqSRV0GPRiwcPFewHNKFstpt26WYTdifFEvo3vHhQxKt/xpv/xm2bg7Y+GHi8N8PMvCARXKPjfFuFtfWNza3idmlnd2//oHx41NJxqihr0ljEqhMQzQSXrIkcBeskipEoEKwdjG5nfnvMlOaxfMRJwvyIDCQPOSVoJM9D9oRBmI2nvfteueJUnTnsVeLmpAI5Gr3yl9ePaRoxiVQQrbuuk6CfEYWcCjYtealmCaEjMmBdQyWJmPaz+c1T+8wofTuMlSmJ9lz9PZGRSOtJFJjOiOBQL3sz8T+vm2J47WdcJikySReLwlTYGNuzAOw+V4yimBhCqOLmVpsOiSIUTUwlE4K7/PIqadWq7kW19nBZqd/kcRThBE7hHFy4gjrcQQOaQCGBZ3iFNyu1Xqx362PRWrDymWP4A+vzB3tWkfg=</latexit>

v1<latexit sha1_base64="LBrO2AG9JVSYbXOTEEaiIcFSFcM=">AAAB83icbVBNS8NAEN34WetX1aOXxSJ4KkkV9Fj04rGC/YCmlM120i7dbMLupFhC/4YXD4p49c9489+4bXPQ1gcDj/dmmJkXJFIYdN1vZ219Y3Nru7BT3N3bPzgsHR03TZxqDg0ey1i3A2ZACgUNFCihnWhgUSChFYzuZn5rDNqIWD3iJIFuxAZKhIIztJLvIzxhEGbjac/rlcpuxZ2DrhIvJ2WSo94rffn9mKcRKOSSGdPx3AS7GdMouIRp0U8NJIyP2AA6lioWgelm85un9NwqfRrG2pZCOld/T2QsMmYSBbYzYjg0y95M/M/rpBjedDOhkhRB8cWiMJUUYzoLgPaFBo5yYgnjWthbKR8yzTjamIo2BG/55VXSrFa8y0r14apcu83jKJBTckYuiEeuSY3ckzppEE4S8kxeyZuTOi/Ou/OxaF1z8pkT8gfO5w9SapHd</latexit>

v2<latexit sha1_base64="aZYsu15SPJ5E+sMT1cDcJ3BeBvA=">AAAB83icbVBNS8NAEJ34WetX1aOXxSJ4KkkV9Fj04rGC/YCmlM120y7dbMLupFhC/4YXD4p49c9489+4bXPQ1gcDj/dmmJkXJFIYdN1vZ219Y3Nru7BT3N3bPzgsHR03TZxqxhsslrFuB9RwKRRvoEDJ24nmNAokbwWju5nfGnNtRKwecZLwbkQHSoSCUbSS7yN/wiDMxtNetVcquxV3DrJKvJyUIUe9V/ry+zFLI66QSWpMx3MT7GZUo2CST4t+anhC2YgOeMdSRSNuutn85ik5t0qfhLG2pZDM1d8TGY2MmUSB7YwoDs2yNxP/8zophjfdTKgkRa7YYlGYSoIxmQVA+kJzhnJiCWVa2FsJG1JNGdqYijYEb/nlVdKsVrzLSvXhqly7zeMowCmcwQV4cA01uIc6NIBBAs/wCm9O6rw4787HonXNyWdO4A+czx9T7pHe</latexit>

v3<latexit sha1_base64="22jywEjQYStxJ5VhI3BqGrGrjow=">AAAB83icbVBNS8NAEJ3Ur1q/qh69BIvgqSStoMeiF48V7Ac0oWy2m3bpZhN2J8US+je8eFDEq3/Gm//GbZuDtj4YeLw3w8y8IBFco+N8W4WNza3tneJuaW//4PCofHzS1nGqKGvRWMSqGxDNBJeshRwF6yaKkSgQrBOM7+Z+Z8KU5rF8xGnC/IgMJQ85JWgkz0P2hEGYTWb9er9ccarOAvY6cXNSgRzNfvnLG8Q0jZhEKojWPddJ0M+IQk4Fm5W8VLOE0DEZsp6hkkRM+9ni5pl9YZSBHcbKlER7of6eyEik9TQKTGdEcKRXvbn4n9dLMbzxMy6TFJmky0VhKmyM7XkA9oArRlFMDSFUcXOrTUdEEYomppIJwV19eZ20a1W3Xq09XFUat3kcRTiDc7gEF66hAffQhBZQSOAZXuHNSq0X6936WLYWrHzmFP7A+vwBVXKR3w==</latexit>

↵q1<latexit sha1_base64="KHLFA9dJh8UXBjlmeG4gSSqYavE=">AAAB8nicbVBNS8NAEN3Ur1q/qh69LBbBU0mqoMeiF48V7Ae0oUy2m3bpZhN3J0IJ/RlePCji1V/jzX/jts1BWx8MPN6bYWZekEhh0HW/ncLa+sbmVnG7tLO7t39QPjxqmTjVjDdZLGPdCcBwKRRvokDJO4nmEAWSt4Px7cxvP3FtRKwecJJwP4KhEqFggFbq9kAmI+hnj960X664VXcOukq8nFRIjka//NUbxCyNuEImwZiu5yboZ6BRMMmnpV5qeAJsDEPetVRBxI2fzU+e0jOrDGgYa1sK6Vz9PZFBZMwkCmxnBDgyy95M/M/rphhe+5lQSYpcscWiMJUUYzr7nw6E5gzlxBJgWthbKRuBBoY2pZINwVt+eZW0alXvolq7v6zUb/I4iuSEnJJz4pErUid3pEGahJGYPJNX8uag8+K8Ox+L1oKTzxyTP3A+fwBTE5FH</latexit>

↵q2<latexit sha1_base64="iTEZbT2BAe/Lm+3dV9kMoHumgPI=">AAAB8nicbVBNS8NAEJ3Ur1q/qh69LBbBU0mqoMeiF48V7Ae0oWy2m3bpZhN3J0IJ/RlePCji1V/jzX/jts1BWx8MPN6bYWZekEhh0HW/ncLa+sbmVnG7tLO7t39QPjxqmTjVjDdZLGPdCajhUijeRIGSdxLNaRRI3g7GtzO//cS1EbF6wEnC/YgOlQgFo2ilbo/KZET72WNt2i9X3Ko7B1klXk4qkKPRL3/1BjFLI66QSWpM13MT9DOqUTDJp6VeanhC2ZgOeddSRSNu/Gx+8pScWWVAwljbUkjm6u+JjEbGTKLAdkYUR2bZm4n/ed0Uw2s/EypJkSu2WBSmkmBMZv+TgdCcoZxYQpkW9lbCRlRThjalkg3BW355lbRqVe+iWru/rNRv8jiKcAKncA4eXEEd7qABTWAQwzO8wpuDzovz7nwsWgtOPnMMf+B8/gBUmJFI</latexit>

↵qL<latexit sha1_base64="kkjpS9t58amNksFaPDZ+g+H+llA=">AAAB8nicbVA9SwNBEJ3zM8avqKXNYRCswl0UtAzaWFhEMB9wOcLcZpMs2ds9d/eEcORn2FgoYuuvsfPfuEmu0MQHA4/3ZpiZFyWcaeN5387K6tr6xmZhq7i9s7u3Xzo4bGqZKkIbRHKp2hFqypmgDcMMp+1EUYwjTlvR6Gbqt56o0kyKBzNOaBjjQLA+I2isFHSQJ0PsZo93k26p7FW8Gdxl4uekDDnq3dJXpydJGlNhCEetA99LTJihMoxwOil2Uk0TJCMc0MBSgTHVYTY7eeKeWqXn9qWyJYw7U39PZBhrPY4j2xmjGepFbyr+5wWp6V+FGRNJaqgg80X9lLtGutP/3R5TlBg+tgSJYvZWlwxRITE2paINwV98eZk0qxX/vFK9vyjXrvM4CnAMJ3AGPlxCDW6hDg0gIOEZXuHNMc6L8+58zFtXnHzmCP7A+fwBfBqRYg==</latexit>

↵q3<latexit sha1_base64="HjnYOqDG+9mWsRnkxKGgT7/tGPs=">AAAB8nicbVBNS8NAEN3Ur1q/qh69LBbBU0laQY9FLx4r2FZoQ5lsN+3STTbuToQS+jO8eFDEq7/Gm//GbZuDtj4YeLw3w8y8IJHCoOt+O4W19Y3NreJ2aWd3b/+gfHjUNirVjLeYkko/BGC4FDFvoUDJHxLNIQok7wTjm5nfeeLaCBXf4yThfgTDWISCAVqp2wOZjKCfPdan/XLFrbpz0FXi5aRCcjT75a/eQLE04jEyCcZ0PTdBPwONgkk+LfVSwxNgYxjyrqUxRNz42fzkKT2zyoCGStuKkc7V3xMZRMZMosB2RoAjs+zNxP+8borhlZ+JOEmRx2yxKEwlRUVn/9OB0JyhnFgCTAt7K2Uj0MDQplSyIXjLL6+Sdq3q1au1u4tK4zqPo0hOyCk5Jx65JA1yS5qkRRhR5Jm8kjcHnRfn3flYtBacfOaY/IHz+QNWHZFJ</latexit>

Softm

ax

Intrinsic Preference ModelingModel users' dynamic intrinsic preferences based on the correlation between the most recent purchase and the personal historic purchases.

● Query = embedding of current item

● Keys, Values = sequence of previous items

Ref: Wang-Cheng Kang and Julian McAuley. 2018. Self-attentive sequential recommendation. In ICDM.

+ + +Item Embedding

Positional Embedding

+ … Query

Self-Attention

Intrinsic Preference Modeling

Shopping History

Intrinsic Preference

Personal Occasion ElicitationElicit personal occasion signals by tracing the user's previous shopping behavior in the neighboring days.

● Query = timestamp of interest

● (Key, Value) = timestamp embedding of previous purchase, corresponding item embedding

Item Embedding …

Timestamp Embedding

Query

Personal Occasion Next Shopping

Personal Occasion Elicitation

Shopping History

Global Occasion MemorizationMemorize the shopping behavior of the crowd under different global occasions.

● Query = current timestamp embedding

● (Key, Value) = (timestamp embedding, memory slot embedding)

Time Key Embedding

Memory Slots

Query

Next Shopping

… Global OccasionGlobal Occasion

Memorization

Global Table

Gating LayerUtilize an attention (gating) layer to balance a user's intrinsic preferences with different occasion signals for personalization.

Personal Occasion Next Shopping

Query

x

Predicted Preference

Global Occasion

Gated Output

User EmbeddingShopping History

Gating Layer

Intrinsic Preference

Experiment: Data

Etsy: November 2006 to December 2018

Amazon: May 1996 to July 2014

Dataset # Users # Items # Purchases Density Cutting Time

Etsy 118,668 80,214 5.3M 0.0561% 2018/1/1Amazon 84,191 100,946 1.0M 0.0124% 2013/8/1

Experiment: Metrics

● Normalized Discounted Cumulative Gain (NDCG@K)● Hit Rate (HR@K)● Mean Reciprocal Rank (MRR)

Experiment: Baselines

● Most Popular● MF-BPR: Matrix Factorization with Bayesian Personalized Ranking● Fossil: Fusing Similarity Models with Markov Chains● GRU4Rec+: Recurrent Neural Networks with Top-k Gains● TCN: A Simple Convolutional Generative Network for Next Item

Recommendation● HPMN: Lifelong Sequential Modeling with Hierarchical Periodic

Memory Network● SARec: Self-attentive sequential recommendation

ModelEtsy Amazon

NDCG HR MRR NDCG HR MRRK=5 K=10 K=5 K=10 K=5 K=10 K=5 K=10MP 0.1531 0.1919 0.2304 0.3511 0.1673 0.2129 0.2509 0.3020 0.4199 0.2195

MF-BPR 0.4519 0.5001 0.5947 0.7434 0.4376 0.2663 0.3012 0.3619 0.4698 0.2668Fossil 0.4946 0.5354 0.5511 0.7630 0.4746 0.2160 0.2483 0.2967 0.3969 0.2221TCN 0.5199 0.5726 0.6698 0.8059 0.5090 0.2632 0.3029 0.3664 0.4893 0.2650

GRU4Rec+ 0.5346 0.5771 0.6830 0.8136 0.5126 0.2763 0.3169 0.3828 0.5087 0.2770HPMN 0.5480 0.5883 0.6962 0.8201 0.5245 0.2820 0.3216 0.3881 0.5109 0.2819SARec 0.5665 0.6047 0.7102 0.8278 0.5433 0.3009 0.3385 0.4085 0.5251 0.2984OAR 0.6078* 0.6415* 0.7425* 0.8462* 0.5847* 0.3200* 0.3580* 0.4301* 0.5476* 0.3165*

Table 2: Comparison of Di�erent Models. ⇤ indicates that the improvement of the best result is statistically signi�cant com-pared with second best result with p < 0.01.

Model Etsy AmazonNDCG@5 MRR NDCG@5 MRR

Global (G) 0.1816 0.1953 0.2238 0.2294Intrinsic (I) 0.5665 0.5433 0.3009 0.2984Personal (P) 0.5791 0.5582 0.3069 0.3047

I + G 0.5885 0.5642 0.3099 0.3063I + P 0.5916 0.5677 0.3136 0.3108

Remove Gate 0.5859 0.5618 0.3074 0.3039OAR 0.6078 0.5847 0.3200 0.3165

Table 3: Ablation Test Results.

methods is over {Adam, Adagrad, SGD}. We also �ne-tune all themodel-speci�c hyperparameters and report the best performancein the following sections.

4.3 Model ComparisonWe summarize the best performance of all the baseline models andthe proposed model in Table 2. We can see that OAR achieves thebest performance under di�erent metrics in both datasets. It gains7.62% and 6.06% MRR improvement in Etsy and Amazon comparedwith the state-of-the-art.

Compared with the basic general MP, we can see that MF-BPRwhich represents users and items with static latent factors canachieve a 177.43% and 39.67% improvement on average in Etsy andAmazon. Then by introducing the Markov Chains to capture thetransition of users among di�erent items, we �nd that Fossil worksbetter than MF in Etsy but performs worse in Amazon. Presumablyit is because the Amazon data is extremely sparse and results in anunstable factorized Markov Chains component in Fossil.

Comparing the recent neural-based sequential models, we �ndthat GRU4Rec+ works slightly better than TCN, which is based ondilated CNN. And HPMN utilizing hierarchical multi-layer memorynetworks outperforms GRU4Rec+ in both data, which proves thatthere are periodic pattern in users’ shopping behaviors. However,HPMN model assumes that the period of shopping behavior isconstant for all the users along the time and thus lack of �exibilityto handle the real-world scenarios. We �nd that SARec, which isutilizing self-attention to model users’ intrinsic preference, workseven better than HPMN. This shows that attention mechanismsare a good �t for modeling sequential behaviors. And by carefullyeliciting the occasion signals and combining them with the intrinsic

Figure 7: Similarity between di�erent calendar days.

preferences, OAR achieves the best performance in the next-itemprediction via an accurate user model.

4.4 Evaluation of OARTo examine whether each component in OAR achieves its goaland to understand how it contributes to the recommendation, weanalyze their impacts with an ablation test (in Table 3).

The Global occasion component (G), in which we set up a certainnumber of memory slots to record the crowd behavior in di�erentoccasions, does not provide personalized recommendation individ-ually. It can outperform the general Most Popular (MP) model by17.17% and 4.89% in Etsy and Amazon, which demonstrates thatit can capture the temporal global occasion signals hidden in thecrowd behavior. Additionally, we can infer that the users in Etsyare more likely to follow the temporal global trends in shopping.Both Intrinsic and Personal components can provide personalizednext-item recommendation. In Intrinsic (I), it maps the most re-cent purchase to the items purchased before to infer the “relevant”items in the future. While in Personal (P), the main idea is to traceback to the previous behaviors in the related time periods. We cansee that P performs slightly better than I, which means that in e-commerce, it is important to predict the shopping occasion and paymore attention to the items purchased around similar occasionswhile inferring the next purchase. While combining the I and Gor I and P, we can see the joint models can improve each of theindividual components. Thus we �nd that in e-commerce platforms,

OAR achieves the best performance on both Etsy and Amazon

OAR vs Baselines

6%~8% improvement

ModelEtsy Amazon

NDCG HR MRR NDCG HR MRRK=5 K=10 K=5 K=10 K=5 K=10 K=5 K=10MP 0.1531 0.1919 0.2304 0.3511 0.1673 0.2129 0.2509 0.3020 0.4199 0.2195

MF-BPR 0.4519 0.5001 0.5947 0.7434 0.4376 0.2663 0.3012 0.3619 0.4698 0.2668Fossil 0.4946 0.5354 0.5511 0.7630 0.4746 0.2160 0.2483 0.2967 0.3969 0.2221TCN 0.5199 0.5726 0.6698 0.8059 0.5090 0.2632 0.3029 0.3664 0.4893 0.2650

GRU4Rec+ 0.5346 0.5771 0.6830 0.8136 0.5126 0.2763 0.3169 0.3828 0.5087 0.2770HPMN 0.5480 0.5883 0.6962 0.8201 0.5245 0.2820 0.3216 0.3881 0.5109 0.2819SARec 0.5665 0.6047 0.7102 0.8278 0.5433 0.3009 0.3385 0.4085 0.5251 0.2984OAR 0.6078* 0.6415* 0.7425* 0.8462* 0.5847* 0.3200* 0.3580* 0.4301* 0.5476* 0.3165*

Table 2: Comparison of Di�erent Models. ⇤ indicates that the improvement of the best result is statistically signi�cant com-pared with second best result with p < 0.01.

Model Etsy AmazonNDCG@5 MRR NDCG@5 MRR

Global (G) 0.1816 0.1953 0.2238 0.2294Intrinsic (I) 0.5665 0.5433 0.3009 0.2984Personal (P) 0.5791 0.5582 0.3069 0.3047

I + G 0.5885 0.5642 0.3099 0.3063I + P 0.5916 0.5677 0.3136 0.3108

Remove Gate 0.5859 0.5618 0.3074 0.3039OAR 0.6078 0.5847 0.3200 0.3165

Table 3: Ablation Test Results.

methods is over {Adam, Adagrad, SGD}. We also �ne-tune all themodel-speci�c hyperparameters and report the best performancein the following sections.

4.3 Model ComparisonWe summarize the best performance of all the baseline models andthe proposed model in Table 2. We can see that OAR achieves thebest performance under di�erent metrics in both datasets. It gains7.62% and 6.06% MRR improvement in Etsy and Amazon comparedwith the state-of-the-art.

Compared with the basic general MP, we can see that MF-BPRwhich represents users and items with static latent factors canachieve a 177.43% and 39.67% improvement on average in Etsy andAmazon. Then by introducing the Markov Chains to capture thetransition of users among di�erent items, we �nd that Fossil worksbetter than MF in Etsy but performs worse in Amazon. Presumablyit is because the Amazon data is extremely sparse and results in anunstable factorized Markov Chains component in Fossil.

Comparing the recent neural-based sequential models, we �ndthat GRU4Rec+ works slightly better than TCN, which is based ondilated CNN. And HPMN utilizing hierarchical multi-layer memorynetworks outperforms GRU4Rec+ in both data, which proves thatthere are periodic pattern in users’ shopping behaviors. However,HPMN model assumes that the period of shopping behavior isconstant for all the users along the time and thus lack of �exibilityto handle the real-world scenarios. We �nd that SARec, which isutilizing self-attention to model users’ intrinsic preference, workseven better than HPMN. This shows that attention mechanismsare a good �t for modeling sequential behaviors. And by carefullyeliciting the occasion signals and combining them with the intrinsic

Figure 7: Similarity between di�erent calendar days.

preferences, OAR achieves the best performance in the next-itemprediction via an accurate user model.

4.4 Evaluation of OARTo examine whether each component in OAR achieves its goaland to understand how it contributes to the recommendation, weanalyze their impacts with an ablation test (in Table 3).

The Global occasion component (G), in which we set up a certainnumber of memory slots to record the crowd behavior in di�erentoccasions, does not provide personalized recommendation individ-ually. It can outperform the general Most Popular (MP) model by17.17% and 4.89% in Etsy and Amazon, which demonstrates thatit can capture the temporal global occasion signals hidden in thecrowd behavior. Additionally, we can infer that the users in Etsyare more likely to follow the temporal global trends in shopping.Both Intrinsic and Personal components can provide personalizednext-item recommendation. In Intrinsic (I), it maps the most re-cent purchase to the items purchased before to infer the “relevant”items in the future. While in Personal (P), the main idea is to traceback to the previous behaviors in the related time periods. We cansee that P performs slightly better than I, which means that in e-commerce, it is important to predict the shopping occasion and paymore attention to the items purchased around similar occasionswhile inferring the next purchase. While combining the I and Gor I and P, we can see the joint models can improve each of theindividual components. Thus we �nd that in e-commerce platforms,

Impact of each component?The global occasion memorization component performs better than recommending general most popular items.

Impact of each component?Personal occasion component works better than the intrinsic preference component.

ModelEtsy Amazon

NDCG HR MRR NDCG HR MRRK=5 K=10 K=5 K=10 K=5 K=10 K=5 K=10MP 0.1531 0.1919 0.2304 0.3511 0.1673 0.2129 0.2509 0.3020 0.4199 0.2195

MF-BPR 0.4519 0.5001 0.5947 0.7434 0.4376 0.2663 0.3012 0.3619 0.4698 0.2668Fossil 0.4946 0.5354 0.5511 0.7630 0.4746 0.2160 0.2483 0.2967 0.3969 0.2221TCN 0.5199 0.5726 0.6698 0.8059 0.5090 0.2632 0.3029 0.3664 0.4893 0.2650

GRU4Rec+ 0.5346 0.5771 0.6830 0.8136 0.5126 0.2763 0.3169 0.3828 0.5087 0.2770HPMN 0.5480 0.5883 0.6962 0.8201 0.5245 0.2820 0.3216 0.3881 0.5109 0.2819SARec 0.5665 0.6047 0.7102 0.8278 0.5433 0.3009 0.3385 0.4085 0.5251 0.2984OAR 0.6078* 0.6415* 0.7425* 0.8462* 0.5847* 0.3200* 0.3580* 0.4301* 0.5476* 0.3165*

Table 2: Comparison of Di�erent Models. ⇤ indicates that the improvement of the best result is statistically signi�cant com-pared with second best result with p < 0.01.

Model Etsy AmazonNDCG@5 MRR NDCG@5 MRR

Global (G) 0.1816 0.1953 0.2238 0.2294Intrinsic (I) 0.5665 0.5433 0.3009 0.2984Personal (P) 0.5791 0.5582 0.3069 0.3047

I + G 0.5885 0.5642 0.3099 0.3063I + P 0.5916 0.5677 0.3136 0.3108

Remove Gate 0.5859 0.5618 0.3074 0.3039OAR 0.6078 0.5847 0.3200 0.3165

Table 3: Ablation Test Results.

methods is over {Adam, Adagrad, SGD}. We also �ne-tune all themodel-speci�c hyperparameters and report the best performancein the following sections.

4.3 Model ComparisonWe summarize the best performance of all the baseline models andthe proposed model in Table 2. We can see that OAR achieves thebest performance under di�erent metrics in both datasets. It gains7.62% and 6.06% MRR improvement in Etsy and Amazon comparedwith the state-of-the-art.

Compared with the basic general MP, we can see that MF-BPRwhich represents users and items with static latent factors canachieve a 177.43% and 39.67% improvement on average in Etsy andAmazon. Then by introducing the Markov Chains to capture thetransition of users among di�erent items, we �nd that Fossil worksbetter than MF in Etsy but performs worse in Amazon. Presumablyit is because the Amazon data is extremely sparse and results in anunstable factorized Markov Chains component in Fossil.

Comparing the recent neural-based sequential models, we �ndthat GRU4Rec+ works slightly better than TCN, which is based ondilated CNN. And HPMN utilizing hierarchical multi-layer memorynetworks outperforms GRU4Rec+ in both data, which proves thatthere are periodic pattern in users’ shopping behaviors. However,HPMN model assumes that the period of shopping behavior isconstant for all the users along the time and thus lack of �exibilityto handle the real-world scenarios. We �nd that SARec, which isutilizing self-attention to model users’ intrinsic preference, workseven better than HPMN. This shows that attention mechanismsare a good �t for modeling sequential behaviors. And by carefullyeliciting the occasion signals and combining them with the intrinsic

Figure 7: Similarity between di�erent calendar days.

preferences, OAR achieves the best performance in the next-itemprediction via an accurate user model.

4.4 Evaluation of OARTo examine whether each component in OAR achieves its goaland to understand how it contributes to the recommendation, weanalyze their impacts with an ablation test (in Table 3).

The Global occasion component (G), in which we set up a certainnumber of memory slots to record the crowd behavior in di�erentoccasions, does not provide personalized recommendation individ-ually. It can outperform the general Most Popular (MP) model by17.17% and 4.89% in Etsy and Amazon, which demonstrates thatit can capture the temporal global occasion signals hidden in thecrowd behavior. Additionally, we can infer that the users in Etsyare more likely to follow the temporal global trends in shopping.Both Intrinsic and Personal components can provide personalizednext-item recommendation. In Intrinsic (I), it maps the most re-cent purchase to the items purchased before to infer the “relevant”items in the future. While in Personal (P), the main idea is to traceback to the previous behaviors in the related time periods. We cansee that P performs slightly better than I, which means that in e-commerce, it is important to predict the shopping occasion and paymore attention to the items purchased around similar occasionswhile inferring the next purchase. While combining the I and Gor I and P, we can see the joint models can improve each of theindividual components. Thus we �nd that in e-commerce platforms,

Impact of each component?Replace the Gating layer with simple addition ⇒ The performance drops

ModelEtsy Amazon

NDCG HR MRR NDCG HR MRRK=5 K=10 K=5 K=10 K=5 K=10 K=5 K=10MP 0.1531 0.1919 0.2304 0.3511 0.1673 0.2129 0.2509 0.3020 0.4199 0.2195

MF-BPR 0.4519 0.5001 0.5947 0.7434 0.4376 0.2663 0.3012 0.3619 0.4698 0.2668Fossil 0.4946 0.5354 0.5511 0.7630 0.4746 0.2160 0.2483 0.2967 0.3969 0.2221TCN 0.5199 0.5726 0.6698 0.8059 0.5090 0.2632 0.3029 0.3664 0.4893 0.2650

GRU4Rec+ 0.5346 0.5771 0.6830 0.8136 0.5126 0.2763 0.3169 0.3828 0.5087 0.2770HPMN 0.5480 0.5883 0.6962 0.8201 0.5245 0.2820 0.3216 0.3881 0.5109 0.2819SARec 0.5665 0.6047 0.7102 0.8278 0.5433 0.3009 0.3385 0.4085 0.5251 0.2984OAR 0.6078* 0.6415* 0.7425* 0.8462* 0.5847* 0.3200* 0.3580* 0.4301* 0.5476* 0.3165*

Table 2: Comparison of Di�erent Models. ⇤ indicates that the improvement of the best result is statistically signi�cant com-pared with second best result with p < 0.01.

Model Etsy AmazonNDCG@5 MRR NDCG@5 MRR

Global (G) 0.1816 0.1953 0.2238 0.2294Intrinsic (I) 0.5665 0.5433 0.3009 0.2984Personal (P) 0.5791 0.5582 0.3069 0.3047

I + G 0.5885 0.5642 0.3099 0.3063I + P 0.5916 0.5677 0.3136 0.3108

Remove Gate 0.5859 0.5618 0.3074 0.3039OAR 0.6078 0.5847 0.3200 0.3165

Table 3: Ablation Test Results.

methods is over {Adam, Adagrad, SGD}. We also �ne-tune all themodel-speci�c hyperparameters and report the best performancein the following sections.

4.3 Model ComparisonWe summarize the best performance of all the baseline models andthe proposed model in Table 2. We can see that OAR achieves thebest performance under di�erent metrics in both datasets. It gains7.62% and 6.06% MRR improvement in Etsy and Amazon comparedwith the state-of-the-art.

Compared with the basic general MP, we can see that MF-BPRwhich represents users and items with static latent factors canachieve a 177.43% and 39.67% improvement on average in Etsy andAmazon. Then by introducing the Markov Chains to capture thetransition of users among di�erent items, we �nd that Fossil worksbetter than MF in Etsy but performs worse in Amazon. Presumablyit is because the Amazon data is extremely sparse and results in anunstable factorized Markov Chains component in Fossil.

Comparing the recent neural-based sequential models, we �ndthat GRU4Rec+ works slightly better than TCN, which is based ondilated CNN. And HPMN utilizing hierarchical multi-layer memorynetworks outperforms GRU4Rec+ in both data, which proves thatthere are periodic pattern in users’ shopping behaviors. However,HPMN model assumes that the period of shopping behavior isconstant for all the users along the time and thus lack of �exibilityto handle the real-world scenarios. We �nd that SARec, which isutilizing self-attention to model users’ intrinsic preference, workseven better than HPMN. This shows that attention mechanismsare a good �t for modeling sequential behaviors. And by carefullyeliciting the occasion signals and combining them with the intrinsic

Figure 7: Similarity between di�erent calendar days.

preferences, OAR achieves the best performance in the next-itemprediction via an accurate user model.

4.4 Evaluation of OARTo examine whether each component in OAR achieves its goaland to understand how it contributes to the recommendation, weanalyze their impacts with an ablation test (in Table 3).

The Global occasion component (G), in which we set up a certainnumber of memory slots to record the crowd behavior in di�erentoccasions, does not provide personalized recommendation individ-ually. It can outperform the general Most Popular (MP) model by17.17% and 4.89% in Etsy and Amazon, which demonstrates thatit can capture the temporal global occasion signals hidden in thecrowd behavior. Additionally, we can infer that the users in Etsyare more likely to follow the temporal global trends in shopping.Both Intrinsic and Personal components can provide personalizednext-item recommendation. In Intrinsic (I), it maps the most re-cent purchase to the items purchased before to infer the “relevant”items in the future. While in Personal (P), the main idea is to traceback to the previous behaviors in the related time periods. We cansee that P performs slightly better than I, which means that in e-commerce, it is important to predict the shopping occasion and paymore attention to the items purchased around similar occasionswhile inferring the next purchase. While combining the I and Gor I and P, we can see the joint models can improve each of theindividual components. Thus we �nd that in e-commerce platforms,

Visualization of Occasion-driven ShoppingPreferences for Occasion-related items are dynamics in a year.

Summary● Propose an occasion-aware sequential recommender● Model the repeated personal occasion signals with attention layers,

while modeling the global occasion signals by memorizing the temporal trends of shopping behaviors

● Experiments on Etsy and Amazon show the positive impact of incorporating occasion signals

● Next…

Thank you!