Socable Influence Maximization

Wei Chen

Microsoft Research Asia

In collaboration with Chi Wang University of Illinois at Urbana-Champaign

Yajun Wang Microsoft Research Asia

KDD'10, July 27, 20101

Scalable Influence Maximization for Prevalent Viral Marketing in Large-Scale

Social Networks

Outline

KDD'10, July 27, 20102

Background and problem definition

Maximum Influence Arborescence (MIA) heuristic

Experimental evaluations

Related work and future directions

Ubiquitous Social Networks

KDD'10, July 27, 20103

A Hypothetical Example of Viral Marketing

4 KDD'10, July 27, 2010

Avatar is great

Effectiveness of Viral Marketing

5 KDD'10, July 27, 2010

level of trust on different types of ads *

*source from Forrester Research and Intelliseek

very effective

Social influence graphvertices are individualslinks are social relationshipsnumber p(u,v) on a directed link from u to v is the probability that v is activated by u after u is activated

Independent cascade modelinitially some seed nodes are activatedAt each step, each newly activated node u activates its neighbor v with probability p(u,v)influence spread: expected number of nodes activated

Influence maximization:find k seeds that generate the largest influence spread

The Problem of Influence Maximization

6 KDD'10, July 27, 2010

Influence maximization as a discrete optimization problem proposed by Kempe, Kleinberg, and Tardos, in KDD’2003

Finding optimal solution is provably hard (NP-hard)

Greedy approximation algorithm, 63% approximation of the optimal solution

Repeat k rounds: in the i-th round, select a node v that provides the largest marginal increase in influence spread

require the evaluation of influence spread given a seed set --- hard and slow

Several subsequent studies improved the running time

Serious drawback:

very slow, not scalable: > 3 hrs on a 30k node graph for 50 seeds

Research Background

7 KDD'10, July 27, 2010

Design new heuristics

MIA (maximum influence arborescence) heuristic

for general independent cascade model

103 speedup --- from hours to seconds (or days to minutes)

influence spread close to that of the greedy algorithm of [KKT’03]

We also show that computing exact influence spread given a seed set is #P-hard (counting hardness)

resolve an open problem in [KKT’03]

indicate the intrinsic difficulty of computing influence spread

Our Work

8 KDD'10, July 27, 2010

For any pair of nodes u and v, find the maximum influence path (MIP) from u to v

ignore MIPs with too small probabilities ( < parameter )

Maximum Influence Arborescence (MIA) Heuristic I: Maximum Influence Paths (MIPs)

9 KDD'10, July 27, 2010

MIA Heuristic II: Maximum Influence in-(out-) Arborescences

10 KDD'10, July 27, 2010

Local influence regions

for every node v, all MIPs to v form its maximuminfluence in-arborescence (MIIA )

Local influence regions

for every node v, all MIPs to v form its maximuminfluence in-arborescence (MIIA )

for every node u, all MIPs from u form its maximuminfluence out-arborescence (MIOA )

These MIIAs and MIOAs can be computed efficiently using the Dijkstra shortest path algorithm

MIA Heuristic II: Maximum Influence in-(out-) Arborescences

11 KDD'10, July 27, 2010

Recursive computation of activation probability ap(u) of a node u in its in-arborescence, given a seed set S

Can be used in the greedy algorithm for selecting k seeds,but not efficient enough

MIA Heuristic III: Computing Influence through the MIA structure

12 KDD'10, July 27, 2010

If v is the root of a MIIA, and u is a node in the MIIA, then their activation probabilities have a linear relationship:

All ‘s in a MIIA can be recursively computedtime reduced from quadratic to linear time

If u is selected as a seed, its marginal influence increase to v is

Summing up the above marginal influence over all nodes v, we obtain the marginal influence of u

Select the u with the largest marginal influence

Update for all w’s that are in the same MIIAs as u

MIA Heuristic IV: Efficient Updates on Activation Probabilities

13 KDD'10, July 27, 2010

Iterating the following two steps until finding k seeds

Selecting the node u giving the largest marginal influence

Update MIAs (linear coefficients) after selecting u as the seed

Key features:

updates are local, and linear to the arborescence size

tunable with parameter : tradeoff between running time and influence spread

MIA Heuristic IV: Summary

14 KDD'10, July 27, 2010

Experiment Results on MIA Heuristic

15 KDD'10, July 27, 2010

weighted cascade model:

• influence probability to a node v = 1 / (# of in-neighbors of v)

Influence spread vs. seed set size

NetHEPT dataset:

• collaboration network from physics archive

• 15K nodes, 31K edges

Epinions dataset:

• who-trust-whom network of Epinions.com

16 KDD'10, July 27, 2010

running time

104 times

speed up>103 times

speed up

Running time is for selecting 50 seeds

Scalability of MIA Heuristic

17 KDD'10, July 27, 2010

• synthesized graphs of different sizes generated from power-law graph model

• weighted cascade model

• running time is for selecting 50 seeds

Greedy approximation algorithmsOriginal greedy algorithm [Kempe, Kleinberg, and Tardos, 2003]

Lazy-forward optimization [Leskovec, Krause, Guestrin, Faloutsos, VanBriesen, and Glance, 2007]

Edge sampling and reachable sets [Kimura, Saito and Nakano, 2007; C., Wang, and Yang, 2009]

reduced seed selection from days to hours (with 30K nodes), but still not scalable

Heuristic algorithms

SPM/SP1M based on shortest paths [Kimura and Saito, 2006], not scalable

SPIN based on Shapley values [Narayanam and Narahari, 2008], not scalable

Degree discounts [C., Wang, and Yang, 2009], designed for the uniform IC model

CGA based on community partitions [Wang, Cong, Song, and Xie 2010]complementary

our local MIAs naturally adapt to the community structure, including overlapping communities

Related Work

18 KDD'10, July 27, 2010

Theoretical problem: efficient approximation algorithms:

How to efficiently approximate influence spread given a seed set?

Practical problem: Influence analysis from online social media

How to mine the influence graph?

Future Directions

19 KDD'10, July 27, 2010

Thanks!and questions?

20 KDD'10, July 27, 2010

21 KDD'10, July 27, 2010

weighted cascade model:

• influence probability to a node v = 1 / (# of in-neighbors of v)

Influence spread vs. seed set size running time

NetHEPT dataset:

• collaboration network from physics archive

Epinions dataset:

• who-trust-whom network of Epinions.com

104 times

speed up>103 times

speed up

Running time is for selecting 50 seeds

Socable Influence Maximization

Technology

Transcript of Socable Influence Maximization

CNCC'2019, Classical Algorithms Forum, Oct. 18, …...Research on Influence Maximization CNCC'2019, Classical Algorithms Forum, Oct. 18, 2019 4 Influence Propagation Modeling and Influence

Personalized Influence Maximization on Social Networks

Influence Maximization in Network by Genetic …...Inﬂuence Maximization in Network by Genetic Algorithm on Linear Threshold Model Arthur Rodrigues da Silva 1,2, Rodrigo Ferreira

소셜 네트워크 상에서 역방향 경로 활성화 기법 기반 …networking.khu.ac.kr/layouts/net/publications/data/1811...Abstract Influence Maximization (IM) deals with

Multi-Objective Influence Maximization

Potential Maximization and Coalition Government Formationecon.ucsb.edu/~garratt/faculty/garratt_parco_qin_rapoport.pdf · Running head: Potential Maximization Potential Maximization

Polarity Related Influence Maximization in Signed Social ... maximization... · Polarity Related Influence Maximization in Signed Social Networks Dong Li1, Zhi-Ming Xu1*, Nilanjan

Lecture 4 Influence Maximization Ding-Zhu Du University of Texas at Dallas.

A Data-based approach to Social Influence Maximization

Online Topic-aware Influence Maximization Queries

Scalable Influence Maximization for Prevalent Viral ...

Influence Maximization in Continuous Time Diffusion Networks

Influence Maximization When Negative Opinions May Emerge ... · Existing influence maximization model Social network as a (directed) graph Nodes represent individuals. Edges are social

Graph-Aware Evolutionary Algorithms for Influence Maximization

Influence maximization with viral product design

Influence diffusion dynamics and influence maximization in ... · Our recent work on social network related research 5 Harvard, Oct. 18, 2011 Social influence in social networks scalable

Fast and Accurate Influence Maximization on Large Networks with Pruned Monte-Carlo Simulations (AAAI'14)

On Bharathi-Kempe-Salek Conjecture about Influence Maximization Ding-Zhu Du University of Texas at Dallas.

Influence Maximization in Unknown Social Networks ... · entire social network. The subgraph is used to pick influential set of nodes by applying a influence maximization algorithm

Polarity Related Influence Maximization in Signed …...Influence maximization in signed social networks containing both positive relationships and negative relationships (e.g. foe