Analysis and Forecasting of Trending Topics in Online Media Streams 1 ACM MM 2013 Tim Althoff,...

105
Analysis and Forecasting of Trending Topics in Online Media Streams 1 Analysis and Forecasting of Trending Topics in Online Media Streams ACM MM 2013 Tim Althoff, Damian Borth, Jörn Hees , Andreas Denge German Research Center for Artificial Intelligence (DFKI

Transcript of Analysis and Forecasting of Trending Topics in Online Media Streams 1 ACM MM 2013 Tim Althoff,...

Page 1: Analysis and Forecasting of Trending Topics in Online Media Streams 1 ACM MM 2013 Tim Althoff, Damian Borth, Jörn Hees, Andreas Dengel German Research.

Analysis and Forecasting of Trending Topics in Online Media Streams 1

Analysis and Forecasting of Trending Topics in Online Media Streams

ACM MM 2013

Tim Althoff, Damian Borth, Jörn Hees, Andreas DengelGerman Research Center for Artificial Intelligence (DFKI)

Page 2: Analysis and Forecasting of Trending Topics in Online Media Streams 1 ACM MM 2013 Tim Althoff, Damian Borth, Jörn Hees, Andreas Dengel German Research.

Analysis and Forecasting of Trending Topics in Online Media Streams 4

Motivation

• People use the web for news, information, and communication

• Online activity becomes indicative of the interests of the global population

› „Mirror of Society“[Steven Van Belleghem, 2012]

Page 3: Analysis and Forecasting of Trending Topics in Online Media Streams 1 ACM MM 2013 Tim Althoff, Damian Borth, Jörn Hees, Andreas Dengel German Research.

Analysis and Forecasting of Trending Topics in Online Media Streams 5

Use Cases of Online Activity

Page 4: Analysis and Forecasting of Trending Topics in Online Media Streams 1 ACM MM 2013 Tim Althoff, Damian Borth, Jörn Hees, Andreas Dengel German Research.

Analysis and Forecasting of Trending Topics in Online Media Streams 6

Tracking Flu Infections

flu remedy

[Ginsberg et al. Detecting influenza epidemics using search engine query data. Nature. 2008.]

Page 5: Analysis and Forecasting of Trending Topics in Online Media Streams 1 ACM MM 2013 Tim Althoff, Damian Borth, Jörn Hees, Andreas Dengel German Research.

Analysis and Forecasting of Trending Topics in Online Media Streams 7

Relevance for the MM Community

• Panel: Cross-Media Analysis and Mining• Focus more on the time dimension• How do topics of interest develop over time?

Page 6: Analysis and Forecasting of Trending Topics in Online Media Streams 1 ACM MM 2013 Tim Althoff, Damian Borth, Jörn Hees, Andreas Dengel German Research.

Analysis and Forecasting of Trending Topics in Online Media Streams 8

Relevance for the MM Community

• Panel: Cross-Media Analysis and Mining• Focus more on the time dimension• How do topics of interest develop over time?

• Trend aware recommendation systems• Automatic vocabulary selection for classification

Page 7: Analysis and Forecasting of Trending Topics in Online Media Streams 1 ACM MM 2013 Tim Althoff, Damian Borth, Jörn Hees, Andreas Dengel German Research.

Analysis and Forecasting of Trending Topics in Online Media Streams 9

Shortcomings of Related Work

1. Focus on very specific use cases (e.g. flu)2. Nowcasting instead of forecasting3. Manual correlation analysis instead of fully

automatic forecasting4. No comparison/study of different online & social

media channels wrt. emerging trends

[Choi and Varian. Predicting initial claims for unemployment benefits. Google, Inc. 2009.][Choi and Varian. Predicting the Present with Google Trends. Economic Record, 2012.][Ginsberg et al. Detecting influenza epidemics using search engine query data. Nature. 2008.][Cooper et al. Cancer internet search activity on a major search engine. Journal of medical Internet research, 2005.][Goel et al. Predicting consumer behavior with web search. National Academy of Sciences, 2010.][Jin et al. The wisdom of social multimedia: using Flickr for prediction and forecast. ACM MM 2010.][Adar et al. Why we search: visualizing and predicting user behavior. WWW, 2007.]

Page 8: Analysis and Forecasting of Trending Topics in Online Media Streams 1 ACM MM 2013 Tim Althoff, Damian Borth, Jörn Hees, Andreas Dengel German Research.

Analysis and Forecasting of Trending Topics in Online Media Streams 10

Goals

1. Multi-channel analysis of trending topics

2. Fully automatic forecasting of trending topics

Page 9: Analysis and Forecasting of Trending Topics in Online Media Streams 1 ACM MM 2013 Tim Althoff, Damian Borth, Jörn Hees, Andreas Dengel German Research.

Analysis and Forecasting of Trending Topics in Online Media Streams 11

Goals

1. Multi-channel analysis of trending topics

2. Fully automatic forecasting of trending topics

Page 10: Analysis and Forecasting of Trending Topics in Online Media Streams 1 ACM MM 2013 Tim Althoff, Damian Borth, Jörn Hees, Andreas Dengel German Research.

Analysis and Forecasting of Trending Topics in Online Media Streams 12

Trending Topics

Page 11: Analysis and Forecasting of Trending Topics in Online Media Streams 1 ACM MM 2013 Tim Althoff, Damian Borth, Jörn Hees, Andreas Dengel German Research.

Analysis and Forecasting of Trending Topics in Online Media Streams 13

Trending TopicsA trending topic• is a subject matter of discussion agreed on by a group of people• experiences a sudden spike in user interest or engagement

Page 12: Analysis and Forecasting of Trending Topics in Online Media Streams 1 ACM MM 2013 Tim Althoff, Damian Borth, Jörn Hees, Andreas Dengel German Research.

Analysis and Forecasting of Trending Topics in Online Media Streams 14

• Daily crawling of 10 sources• Capturing

• Communication needs (Twitter)• Search patterns (Google)• Information demand (Wikipedia)

Trending Topics Discovery

Trends

Search Germany

Search USA

News Germany

News USA

English Wikipedia

German Wikipedia

Trends Germany

Trends USA

Daily Trends

[Damian Borth, Adrian Ulges, Thomas Breuel. Dynamic Vocabularies for Web-based Concept Detection by Trend Discovery. ACM Multimedia, 2012]

Page 13: Analysis and Forecasting of Trending Topics in Online Media Streams 1 ACM MM 2013 Tim Althoff, Damian Borth, Jörn Hees, Andreas Dengel German Research.

Analysis and Forecasting of Trending Topics in Online Media Streams 15

• Daily crawling of 10 sources• Capturing

• Communication needs (Twitter)• Search patterns (Google)• Information demand (Wikipedia)

Trending Topics Discovery

Trends

Search Germany

Search USA

News Germany

News USA

English Wikipedia

German Wikipedia

Trends Germany

Trends USA

Daily Trends

Dataset• Observation period– Sept. 2011 – Sept. 2012

• 40,000 items analyzed• ~ 3,000 trending topics discovered

[Damian Borth, Adrian Ulges, Thomas Breuel. Dynamic Vocabularies for Web-based Concept Detection by Trend Discovery. ACM Multimedia, 2012]

Page 14: Analysis and Forecasting of Trending Topics in Online Media Streams 1 ACM MM 2013 Tim Althoff, Damian Borth, Jörn Hees, Andreas Dengel German Research.

Analysis and Forecasting of Trending Topics in Online Media Streams 16

Trending Topics Discovery

Page 15: Analysis and Forecasting of Trending Topics in Online Media Streams 1 ACM MM 2013 Tim Althoff, Damian Borth, Jörn Hees, Andreas Dengel German Research.

Analysis and Forecasting of Trending Topics in Online Media Streams 17

Trending Topics Discovery

Page 16: Analysis and Forecasting of Trending Topics in Online Media Streams 1 ACM MM 2013 Tim Althoff, Damian Borth, Jörn Hees, Andreas Dengel German Research.

Analysis and Forecasting of Trending Topics in Online Media Streams 18

Goals

1. Multi-channel analysis of trending topics

2. Fully automatic forecasting of trending topics

Page 17: Analysis and Forecasting of Trending Topics in Online Media Streams 1 ACM MM 2013 Tim Althoff, Damian Borth, Jörn Hees, Andreas Dengel German Research.

Analysis and Forecasting of Trending Topics in Online Media Streams 19

Multi-Channel Analysis

Page 18: Analysis and Forecasting of Trending Topics in Online Media Streams 1 ACM MM 2013 Tim Althoff, Damian Borth, Jörn Hees, Andreas Dengel German Research.

Analysis and Forecasting of Trending Topics in Online Media Streams 20

Example

Neil Armstrong died on August 25, 2012.

Page 19: Analysis and Forecasting of Trending Topics in Online Media Streams 1 ACM MM 2013 Tim Althoff, Damian Borth, Jörn Hees, Andreas Dengel German Research.

Analysis and Forecasting of Trending Topics in Online Media Streams 21

Lifetime Analysis

Page 20: Analysis and Forecasting of Trending Topics in Online Media Streams 1 ACM MM 2013 Tim Althoff, Damian Borth, Jörn Hees, Andreas Dengel German Research.

Analysis and Forecasting of Trending Topics in Online Media Streams 22

Topic Category Analysis

Page 21: Analysis and Forecasting of Trending Topics in Online Media Streams 1 ACM MM 2013 Tim Althoff, Damian Borth, Jörn Hees, Andreas Dengel German Research.

Analysis and Forecasting of Trending Topics in Online Media Streams 23

Topic Category Analysis

Page 22: Analysis and Forecasting of Trending Topics in Online Media Streams 1 ACM MM 2013 Tim Althoff, Damian Borth, Jörn Hees, Andreas Dengel German Research.

Analysis and Forecasting of Trending Topics in Online Media Streams 24

Topic Category Analysis

Page 23: Analysis and Forecasting of Trending Topics in Online Media Streams 1 ACM MM 2013 Tim Althoff, Damian Borth, Jörn Hees, Andreas Dengel German Research.

Analysis and Forecasting of Trending Topics in Online Media Streams 25

Topic Category Analysis

Page 24: Analysis and Forecasting of Trending Topics in Online Media Streams 1 ACM MM 2013 Tim Althoff, Damian Borth, Jörn Hees, Andreas Dengel German Research.

Analysis and Forecasting of Trending Topics in Online Media Streams 26

Forecasting

Page 25: Analysis and Forecasting of Trending Topics in Online Media Streams 1 ACM MM 2013 Tim Althoff, Damian Borth, Jörn Hees, Andreas Dengel German Research.

Analysis and Forecasting of Trending Topics in Online Media Streams 27

Forecasting

time

observed values forecast

Page 26: Analysis and Forecasting of Trending Topics in Online Media Streams 1 ACM MM 2013 Tim Althoff, Damian Borth, Jörn Hees, Andreas Dengel German Research.

Analysis and Forecasting of Trending Topics in Online Media Streams 28

Basic Forecasting TechniquesNaive

Linear Trend

Average/Median

ARMA

Nearest Neighbor

Page 27: Analysis and Forecasting of Trending Topics in Online Media Streams 1 ACM MM 2013 Tim Althoff, Damian Borth, Jörn Hees, Andreas Dengel German Research.

Analysis and Forecasting of Trending Topics in Online Media Streams 29

Basic Forecasting TechniquesNaive

Linear Trend

Average/Median

ARMA

Nearest Neighbor

Page 28: Analysis and Forecasting of Trending Topics in Online Media Streams 1 ACM MM 2013 Tim Althoff, Damian Borth, Jörn Hees, Andreas Dengel German Research.

Analysis and Forecasting of Trending Topics in Online Media Streams 30

Basic Forecasting TechniquesNaive

Linear Trend

Average/Median

ARMA

Nearest Neighbor

Page 29: Analysis and Forecasting of Trending Topics in Online Media Streams 1 ACM MM 2013 Tim Althoff, Damian Borth, Jörn Hees, Andreas Dengel German Research.

Analysis and Forecasting of Trending Topics in Online Media Streams 31

Basic Forecasting TechniquesNaive

Linear Trend

Average/Median

ARMA

Nearest Neighbor

Page 30: Analysis and Forecasting of Trending Topics in Online Media Streams 1 ACM MM 2013 Tim Althoff, Damian Borth, Jörn Hees, Andreas Dengel German Research.

Analysis and Forecasting of Trending Topics in Online Media Streams 32

Basic Forecasting TechniquesNaive

Linear Trend

Average/Median

ARMA

Nearest Neighbor

Page 31: Analysis and Forecasting of Trending Topics in Online Media Streams 1 ACM MM 2013 Tim Althoff, Damian Borth, Jörn Hees, Andreas Dengel German Research.

Analysis and Forecasting of Trending Topics in Online Media Streams 33

Example Signal

Page 32: Analysis and Forecasting of Trending Topics in Online Media Streams 1 ACM MM 2013 Tim Althoff, Damian Borth, Jörn Hees, Andreas Dengel German Research.

Analysis and Forecasting of Trending Topics in Online Media Streams 34

Forecasting Can Be Very Challenging

Page 33: Analysis and Forecasting of Trending Topics in Online Media Streams 1 ACM MM 2013 Tim Althoff, Damian Borth, Jörn Hees, Andreas Dengel German Research.

Analysis and Forecasting of Trending Topics in Online Media Streams 35

Recurring Signal

Page 34: Analysis and Forecasting of Trending Topics in Online Media Streams 1 ACM MM 2013 Tim Althoff, Damian Borth, Jörn Hees, Andreas Dengel German Research.

Analysis and Forecasting of Trending Topics in Online Media Streams 36

Recurring Signal

Page 35: Analysis and Forecasting of Trending Topics in Online Media Streams 1 ACM MM 2013 Tim Althoff, Damian Borth, Jörn Hees, Andreas Dengel German Research.

Analysis and Forecasting of Trending Topics in Online Media Streams 37

What To Do In This Case?

Page 36: Analysis and Forecasting of Trending Topics in Online Media Streams 1 ACM MM 2013 Tim Althoff, Damian Borth, Jörn Hees, Andreas Dengel German Research.

Analysis and Forecasting of Trending Topics in Online Media Streams 38

What To Do In This Case?

Semantics!

Page 37: Analysis and Forecasting of Trending Topics in Online Media Streams 1 ACM MM 2013 Tim Althoff, Damian Borth, Jörn Hees, Andreas Dengel German Research.

Analysis and Forecasting of Trending Topics in Online Media Streams 39

Use SemanticallySimilar Topics

Page 38: Analysis and Forecasting of Trending Topics in Online Media Streams 1 ACM MM 2013 Tim Althoff, Damian Borth, Jörn Hees, Andreas Dengel German Research.

Analysis and Forecasting of Trending Topics in Online Media Streams 40

Approach

Page 39: Analysis and Forecasting of Trending Topics in Online Media Streams 1 ACM MM 2013 Tim Althoff, Damian Borth, Jörn Hees, Andreas Dengel German Research.

Analysis and Forecasting of Trending Topics in Online Media Streams 41

Approach

Page 40: Analysis and Forecasting of Trending Topics in Online Media Streams 1 ACM MM 2013 Tim Althoff, Damian Borth, Jörn Hees, Andreas Dengel German Research.

Analysis and Forecasting of Trending Topics in Online Media Streams 42

Approach

Page 41: Analysis and Forecasting of Trending Topics in Online Media Streams 1 ACM MM 2013 Tim Althoff, Damian Borth, Jörn Hees, Andreas Dengel German Research.

Analysis and Forecasting of Trending Topics in Online Media Streams 43

Discovering Semantically Similar Topics

Trending Topic:“Olympics 2012”

Page 42: Analysis and Forecasting of Trending Topics in Online Media Streams 1 ACM MM 2013 Tim Althoff, Damian Borth, Jörn Hees, Andreas Dengel German Research.

Analysis and Forecasting of Trending Topics in Online Media Streams 44

Nearest Neighbor Sequence Matching

Page 43: Analysis and Forecasting of Trending Topics in Online Media Streams 1 ACM MM 2013 Tim Althoff, Damian Borth, Jörn Hees, Andreas Dengel German Research.

Analysis and Forecasting of Trending Topics in Online Media Streams 46

Forecasting

Page 44: Analysis and Forecasting of Trending Topics in Online Media Streams 1 ACM MM 2013 Tim Althoff, Damian Borth, Jörn Hees, Andreas Dengel German Research.

Analysis and Forecasting of Trending Topics in Online Media Streams 47

Forecasting

Page 45: Analysis and Forecasting of Trending Topics in Online Media Streams 1 ACM MM 2013 Tim Althoff, Damian Borth, Jörn Hees, Andreas Dengel German Research.

Analysis and Forecasting of Trending Topics in Online Media Streams 48

Forecasting

Page 46: Analysis and Forecasting of Trending Topics in Online Media Streams 1 ACM MM 2013 Tim Althoff, Damian Borth, Jörn Hees, Andreas Dengel German Research.

Analysis and Forecasting of Trending Topics in Online Media Streams 49

Forecasting

Page 47: Analysis and Forecasting of Trending Topics in Online Media Streams 1 ACM MM 2013 Tim Althoff, Damian Borth, Jörn Hees, Andreas Dengel German Research.

Analysis and Forecasting of Trending Topics in Online Media Streams 50

Forecasting

Page 48: Analysis and Forecasting of Trending Topics in Online Media Streams 1 ACM MM 2013 Tim Althoff, Damian Borth, Jörn Hees, Andreas Dengel German Research.

Analysis and Forecasting of Trending Topics in Online Media Streams 51

Evaluation

Page 49: Analysis and Forecasting of Trending Topics in Online Media Streams 1 ACM MM 2013 Tim Althoff, Damian Borth, Jörn Hees, Andreas Dengel German Research.

Analysis and Forecasting of Trending Topics in Online Media Streams 52

Dataset

• Public dataset of Wikipedia view statistics• Historical data from 2008 until today• Large-scale: 5 million articles attracting

870 million views each day – 2.8 TB compressed

→ Semantic similarity between 5 million articles→ 9 billion sequences to choose from

• Use 200 top trending topics for evaluation

Page 50: Analysis and Forecasting of Trending Topics in Online Media Streams 1 ACM MM 2013 Tim Althoff, Damian Borth, Jörn Hees, Andreas Dengel German Research.

Analysis and Forecasting of Trending Topics in Online Media Streams 53

Discovering Semantically Similar Topics

Page 51: Analysis and Forecasting of Trending Topics in Online Media Streams 1 ACM MM 2013 Tim Althoff, Damian Borth, Jörn Hees, Andreas Dengel German Research.

Analysis and Forecasting of Trending Topics in Online Media Streams 54

Example: “The Hunger Games”

Page 52: Analysis and Forecasting of Trending Topics in Online Media Streams 1 ACM MM 2013 Tim Althoff, Damian Borth, Jörn Hees, Andreas Dengel German Research.

Analysis and Forecasting of Trending Topics in Online Media Streams 55

Example: “The Hunger Games”

Trending TopicTrigger

14 day horizonTime in days

Wikipedia article views

Different models

Page 53: Analysis and Forecasting of Trending Topics in Online Media Streams 1 ACM MM 2013 Tim Althoff, Damian Borth, Jörn Hees, Andreas Dengel German Research.

Analysis and Forecasting of Trending Topics in Online Media Streams 56

Example: “The Hunger Games”

Page 54: Analysis and Forecasting of Trending Topics in Online Media Streams 1 ACM MM 2013 Tim Althoff, Damian Borth, Jörn Hees, Andreas Dengel German Research.

Analysis and Forecasting of Trending Topics in Online Media Streams 57

Example: “The Hunger Games”

Page 55: Analysis and Forecasting of Trending Topics in Online Media Streams 1 ACM MM 2013 Tim Althoff, Damian Borth, Jörn Hees, Andreas Dengel German Research.

Analysis and Forecasting of Trending Topics in Online Media Streams 58

Example: “The Hunger Games”

Page 56: Analysis and Forecasting of Trending Topics in Online Media Streams 1 ACM MM 2013 Tim Althoff, Damian Borth, Jörn Hees, Andreas Dengel German Research.

Analysis and Forecasting of Trending Topics in Online Media Streams 59

Example: “The Hunger Games”

Page 57: Analysis and Forecasting of Trending Topics in Online Media Streams 1 ACM MM 2013 Tim Althoff, Damian Borth, Jörn Hees, Andreas Dengel German Research.

Analysis and Forecasting of Trending Topics in Online Media Streams 60

Example: “The Hunger Games”

Page 58: Analysis and Forecasting of Trending Topics in Online Media Streams 1 ACM MM 2013 Tim Althoff, Damian Borth, Jörn Hees, Andreas Dengel German Research.

Analysis and Forecasting of Trending Topics in Online Media Streams 61

Example: “The Hunger Games”

Page 59: Analysis and Forecasting of Trending Topics in Online Media Streams 1 ACM MM 2013 Tim Althoff, Damian Borth, Jörn Hees, Andreas Dengel German Research.

Analysis and Forecasting of Trending Topics in Online Media Streams 62

Example: “The Hunger Games”

Page 60: Analysis and Forecasting of Trending Topics in Online Media Streams 1 ACM MM 2013 Tim Althoff, Damian Borth, Jörn Hees, Andreas Dengel German Research.

Analysis and Forecasting of Trending Topics in Online Media Streams 63

Example: “The Hunger Games”

Page 61: Analysis and Forecasting of Trending Topics in Online Media Streams 1 ACM MM 2013 Tim Althoff, Damian Borth, Jörn Hees, Andreas Dengel German Research.

Analysis and Forecasting of Trending Topics in Online Media Streams 64

Example: “The Hunger Games”

Page 62: Analysis and Forecasting of Trending Topics in Online Media Streams 1 ACM MM 2013 Tim Althoff, Damian Borth, Jörn Hees, Andreas Dengel German Research.

Analysis and Forecasting of Trending Topics in Online Media Streams 65

Example: “The Hunger Games”

Page 63: Analysis and Forecasting of Trending Topics in Online Media Streams 1 ACM MM 2013 Tim Althoff, Damian Borth, Jörn Hees, Andreas Dengel German Research.

Analysis and Forecasting of Trending Topics in Online Media Streams 66

Example: “The Hunger Games”

Page 64: Analysis and Forecasting of Trending Topics in Online Media Streams 1 ACM MM 2013 Tim Althoff, Damian Borth, Jörn Hees, Andreas Dengel German Research.

Analysis and Forecasting of Trending Topics in Online Media Streams 67

Example: “The Hunger Games”

Page 65: Analysis and Forecasting of Trending Topics in Online Media Streams 1 ACM MM 2013 Tim Althoff, Damian Borth, Jörn Hees, Andreas Dengel German Research.

Analysis and Forecasting of Trending Topics in Online Media Streams 68

Example: “The Hunger Games”

Page 66: Analysis and Forecasting of Trending Topics in Online Media Streams 1 ACM MM 2013 Tim Althoff, Damian Borth, Jörn Hees, Andreas Dengel German Research.

Analysis and Forecasting of Trending Topics in Online Media Streams 69

Example: “The Hunger Games”

Page 67: Analysis and Forecasting of Trending Topics in Online Media Streams 1 ACM MM 2013 Tim Althoff, Damian Borth, Jörn Hees, Andreas Dengel German Research.

Analysis and Forecasting of Trending Topics in Online Media Streams 70

Quantitative Results

Page 68: Analysis and Forecasting of Trending Topics in Online Media Streams 1 ACM MM 2013 Tim Althoff, Damian Borth, Jörn Hees, Andreas Dengel German Research.

Analysis and Forecasting of Trending Topics in Online Media Streams 71

Root Mean Square Error

Page 69: Analysis and Forecasting of Trending Topics in Online Media Streams 1 ACM MM 2013 Tim Althoff, Damian Borth, Jörn Hees, Andreas Dengel German Research.

Analysis and Forecasting of Trending Topics in Online Media Streams 72

Root Mean Square Error

Page 70: Analysis and Forecasting of Trending Topics in Online Media Streams 1 ACM MM 2013 Tim Althoff, Damian Borth, Jörn Hees, Andreas Dengel German Research.

Analysis and Forecasting of Trending Topics in Online Media Streams 73

Root Mean Square Error

Page 71: Analysis and Forecasting of Trending Topics in Online Media Streams 1 ACM MM 2013 Tim Althoff, Damian Borth, Jörn Hees, Andreas Dengel German Research.

Analysis and Forecasting of Trending Topics in Online Media Streams 74

Mean Average Percentage Error

Page 72: Analysis and Forecasting of Trending Topics in Online Media Streams 1 ACM MM 2013 Tim Althoff, Damian Borth, Jörn Hees, Andreas Dengel German Research.

Analysis and Forecasting of Trending Topics in Online Media Streams 75

Mean Average Percentage Error

Page 73: Analysis and Forecasting of Trending Topics in Online Media Streams 1 ACM MM 2013 Tim Althoff, Damian Borth, Jörn Hees, Andreas Dengel German Research.

Analysis and Forecasting of Trending Topics in Online Media Streams 76

Mean Average Percentage Error

Page 74: Analysis and Forecasting of Trending Topics in Online Media Streams 1 ACM MM 2013 Tim Althoff, Damian Borth, Jörn Hees, Andreas Dengel German Research.

Analysis and Forecasting of Trending Topics in Online Media Streams 77

Conclusion

Page 75: Analysis and Forecasting of Trending Topics in Online Media Streams 1 ACM MM 2013 Tim Althoff, Damian Borth, Jörn Hees, Andreas Dengel German Research.

Analysis and Forecasting of Trending Topics in Online Media Streams 78

Main Contributions

1.Multi-channel analysis of trending topics2.Fully automatic forecasting technique exploiting

semantic relationships between topics3.Empirical evaluation on a large-scale Wikipedia

dataset

Page 76: Analysis and Forecasting of Trending Topics in Online Media Streams 1 ACM MM 2013 Tim Althoff, Damian Borth, Jörn Hees, Andreas Dengel German Research.

Analysis and Forecasting of Trending Topics in Online Media Streams 79

Insights

1. Trends on Twitter and Wikipedia are significantly more ephemeral than on Google.

2. All observed media channels tend to specialize in specific topic categories.

3. Semantically similar topics exhibit similar behavior.4. Exploiting this observation can significantly improve

forecasting performance (by 20-90% compared to baselines).

Page 77: Analysis and Forecasting of Trending Topics in Online Media Streams 1 ACM MM 2013 Tim Althoff, Damian Borth, Jörn Hees, Andreas Dengel German Research.

Analysis and Forecasting of Trending Topics in Online Media Streams 80

Acknowledgments

Page 78: Analysis and Forecasting of Trending Topics in Online Media Streams 1 ACM MM 2013 Tim Althoff, Damian Borth, Jörn Hees, Andreas Dengel German Research.

Analysis and Forecasting of Trending Topics in Online Media Streams 81

Questions?

Thank you for your attention

Tim Althoff, Damian Borth, Jörn Hees, Andreas Dengel

Analysis and Forecasting of Trending Topics in Online Media Streams

Page 79: Analysis and Forecasting of Trending Topics in Online Media Streams 1 ACM MM 2013 Tim Althoff, Damian Borth, Jörn Hees, Andreas Dengel German Research.

Analysis and Forecasting of Trending Topics in Online Media Streams 82

Backup Slides

Page 80: Analysis and Forecasting of Trending Topics in Online Media Streams 1 ACM MM 2013 Tim Althoff, Damian Borth, Jörn Hees, Andreas Dengel German Research.

Analysis and Forecasting of Trending Topics in Online Media Streams 83

Motivation

• Big Data: “High volume, high velocity, and/or high variety information assets that require new forms of processing to enable enhanced decision making, insight discovery and process optimization”

• Large-scale user behavior

› „Mirror of Society“

[Mark A. Beyer and Douglas Laney. The importance of ’big data’: A definition. June 2012.]

Page 81: Analysis and Forecasting of Trending Topics in Online Media Streams 1 ACM MM 2013 Tim Althoff, Damian Borth, Jörn Hees, Andreas Dengel German Research.

Analysis and Forecasting of Trending Topics in Online Media Streams 84

Large-scale Record of Human Behavior

• 1.2 million video minutes per second (est. 2016)

• Primary method for communication by college students in the U.S.

• 25 million articles and 1.8 billion page edits in 285 languages created by 100k contributors

• 340 million tweets each day by 500 million users

• One billion search requests and about twenty petabytes of user-generated data each day (2009)

Page 82: Analysis and Forecasting of Trending Topics in Online Media Streams 1 ACM MM 2013 Tim Althoff, Damian Borth, Jörn Hees, Andreas Dengel German Research.

Analysis and Forecasting of Trending Topics in Online Media Streams 85

Unemployment Claims

jobs

[Choi and Varian. Predicting initial claims for unemployment benefits. Google, Inc. 2009.]

Page 83: Analysis and Forecasting of Trending Topics in Online Media Streams 1 ACM MM 2013 Tim Althoff, Damian Borth, Jörn Hees, Andreas Dengel German Research.

Analysis and Forecasting of Trending Topics in Online Media Streams 86

What is the world talking about?Social Media can offer insights in• what the world is talking about• how people feel about events, brands and productsresulting in an collective awareness for trending topics

A trending topic• is a subject matter of discussion

agreed on by a group of people• experiences a sudden spike in

user interest or engagement

[Damian Borth, Adrian Ulges, Thomas Breuel. Dynamic Vocabularies for Web-based Concept Detection by Trend Discovery. ACM Multimedia, 2012]

Page 84: Analysis and Forecasting of Trending Topics in Online Media Streams 1 ACM MM 2013 Tim Althoff, Damian Borth, Jörn Hees, Andreas Dengel German Research.

Analysis and Forecasting of Trending Topics in Online Media Streams 87

Trending Topics Discovery

Page 85: Analysis and Forecasting of Trending Topics in Online Media Streams 1 ACM MM 2013 Tim Althoff, Damian Borth, Jörn Hees, Andreas Dengel German Research.

Analysis and Forecasting of Trending Topics in Online Media Streams 88

Trending Topics Discovery

Page 86: Analysis and Forecasting of Trending Topics in Online Media Streams 1 ACM MM 2013 Tim Althoff, Damian Borth, Jörn Hees, Andreas Dengel German Research.

Analysis and Forecasting of Trending Topics in Online Media Streams 89

Lifetime Analysis

Page 87: Analysis and Forecasting of Trending Topics in Online Media Streams 1 ACM MM 2013 Tim Althoff, Damian Borth, Jörn Hees, Andreas Dengel German Research.

Analysis and Forecasting of Trending Topics in Online Media Streams 90

Lifetime Analysis

Page 88: Analysis and Forecasting of Trending Topics in Online Media Streams 1 ACM MM 2013 Tim Althoff, Damian Borth, Jörn Hees, Andreas Dengel German Research.

Analysis and Forecasting of Trending Topics in Online Media Streams 91

Lifetime Analysis

Page 89: Analysis and Forecasting of Trending Topics in Online Media Streams 1 ACM MM 2013 Tim Althoff, Damian Borth, Jörn Hees, Andreas Dengel German Research.

Analysis and Forecasting of Trending Topics in Online Media Streams 92

Lifetime Analysis

Page 90: Analysis and Forecasting of Trending Topics in Online Media Streams 1 ACM MM 2013 Tim Althoff, Damian Borth, Jörn Hees, Andreas Dengel German Research.

Analysis and Forecasting of Trending Topics in Online Media Streams 93

Example 2: “Battlefield 3”

Page 91: Analysis and Forecasting of Trending Topics in Online Media Streams 1 ACM MM 2013 Tim Althoff, Damian Borth, Jörn Hees, Andreas Dengel German Research.

Analysis and Forecasting of Trending Topics in Online Media Streams 94

Example 2: “Battlefield 3”

Trending TopicTrigger

14 day horizonTime in days

Wikipedia article views

Page 92: Analysis and Forecasting of Trending Topics in Online Media Streams 1 ACM MM 2013 Tim Althoff, Damian Borth, Jörn Hees, Andreas Dengel German Research.

Analysis and Forecasting of Trending Topics in Online Media Streams 95

Example 2: “Battlefield 3”

Page 93: Analysis and Forecasting of Trending Topics in Online Media Streams 1 ACM MM 2013 Tim Althoff, Damian Borth, Jörn Hees, Andreas Dengel German Research.

Analysis and Forecasting of Trending Topics in Online Media Streams 96

Example 2: “Battlefield 3”

Page 94: Analysis and Forecasting of Trending Topics in Online Media Streams 1 ACM MM 2013 Tim Althoff, Damian Borth, Jörn Hees, Andreas Dengel German Research.

Analysis and Forecasting of Trending Topics in Online Media Streams 97

Example 2: “Battlefield 3”

Page 95: Analysis and Forecasting of Trending Topics in Online Media Streams 1 ACM MM 2013 Tim Althoff, Damian Borth, Jörn Hees, Andreas Dengel German Research.

Analysis and Forecasting of Trending Topics in Online Media Streams 98

Example 2: “Battlefield 3”

Page 96: Analysis and Forecasting of Trending Topics in Online Media Streams 1 ACM MM 2013 Tim Althoff, Damian Borth, Jörn Hees, Andreas Dengel German Research.

Analysis and Forecasting of Trending Topics in Online Media Streams 99

Example 2: “Battlefield 3”

Page 97: Analysis and Forecasting of Trending Topics in Online Media Streams 1 ACM MM 2013 Tim Althoff, Damian Borth, Jörn Hees, Andreas Dengel German Research.

Analysis and Forecasting of Trending Topics in Online Media Streams 100

Example 2: “Battlefield 3”

Page 98: Analysis and Forecasting of Trending Topics in Online Media Streams 1 ACM MM 2013 Tim Althoff, Damian Borth, Jörn Hees, Andreas Dengel German Research.

Analysis and Forecasting of Trending Topics in Online Media Streams 101

Example 2: “Battlefield 3”

Page 99: Analysis and Forecasting of Trending Topics in Online Media Streams 1 ACM MM 2013 Tim Althoff, Damian Borth, Jörn Hees, Andreas Dengel German Research.

Analysis and Forecasting of Trending Topics in Online Media Streams 102

Example 2: “Battlefield 3”

Page 100: Analysis and Forecasting of Trending Topics in Online Media Streams 1 ACM MM 2013 Tim Althoff, Damian Borth, Jörn Hees, Andreas Dengel German Research.

Analysis and Forecasting of Trending Topics in Online Media Streams 103

Example 2: “Battlefield 3”

Page 101: Analysis and Forecasting of Trending Topics in Online Media Streams 1 ACM MM 2013 Tim Althoff, Damian Borth, Jörn Hees, Andreas Dengel German Research.

Analysis and Forecasting of Trending Topics in Online Media Streams 104

Example 2: “Battlefield 3”

Page 102: Analysis and Forecasting of Trending Topics in Online Media Streams 1 ACM MM 2013 Tim Althoff, Damian Borth, Jörn Hees, Andreas Dengel German Research.

Analysis and Forecasting of Trending Topics in Online Media Streams 105

Example 2: “Battlefield 3”

Page 103: Analysis and Forecasting of Trending Topics in Online Media Streams 1 ACM MM 2013 Tim Althoff, Damian Borth, Jörn Hees, Andreas Dengel German Research.

Analysis and Forecasting of Trending Topics in Online Media Streams 106

Example 2: “Battlefield 3”

Page 104: Analysis and Forecasting of Trending Topics in Online Media Streams 1 ACM MM 2013 Tim Althoff, Damian Borth, Jörn Hees, Andreas Dengel German Research.

Analysis and Forecasting of Trending Topics in Online Media Streams 107

Example 2: “Battlefield 3”

Page 105: Analysis and Forecasting of Trending Topics in Online Media Streams 1 ACM MM 2013 Tim Althoff, Damian Borth, Jörn Hees, Andreas Dengel German Research.

Analysis and Forecasting of Trending Topics in Online Media Streams 108

Example 2: “Battlefield 3”