04 CLASS 2012 Scale-Free Property

download 04 CLASS 2012 Scale-Free Property

of 60

Transcript of 04 CLASS 2012 Scale-Free Property

  • 8/13/2019 04 CLASS 2012 Scale-Free Property

    1/60

    Class 4: Scale-Free Property

    Prof. Albert-Lszl BarabsiDr. Baruch Barzel, Dr. Mauro Martino

    Network Science: Scale-F

  • 8/13/2019 04 CLASS 2012 Scale-Free Property

    2/60

    lrand

    =

    logN

    log k

    Empirical data for real networks

    C ~ const

    P(k

    Regular

    network

    Erdos-

    Renyi

    Watts-

    Strogatz

    Pathlength

    Clustering

    Degree

    l " N1/ D

    klog

    Nloglrand

    klog

    Nloglrand

    C ~ const

    C ~ const

    N

    kpCrand

    P(k)=!(

    P(k) = e"

    Expone

    Network Science: Scale-F

  • 8/13/2019 04 CLASS 2012 Scale-Free Property

    3/60

    Nodes: WWW documentsLinks: URL links

    Over 3 billion documents

    ROBOT: collects all URLsfound in a document andfollows them recursively

    Exp

    ected

    P(k) ~ k-!Found

    R. Albert, H. Jeong, A-L Barabasi, Nature, 401 130 (1999).

    WORLD WIDE WEB

    Network Science: Scale-F

  • 8/13/2019 04 CLASS 2012 Scale-Free Property

    4/60

    Expected

    P(k) ~ k-!

    Found

    R. Albert, H. Jeong, A-L Barabasi, Nature, 40

    Degree distribution of the WWW

    Network Science: Scale-F

  • 8/13/2019 04 CLASS 2012 Scale-Free Property

    5/60

    The difference between a power law and an exponential distributio

    20 40 60 80 1000.20.61

    1cx)x(f !=

    xc)x(f !=

    50.cx)x(f !=

    Above a certain x value, the power law is always higher than the exponential.

    Network Science: Scale-F

  • 8/13/2019 04 CLASS 2012 Scale-Free Property

    6/60

    semilog

    10

    100 101 102 103

    -410-310-210-1100

    loglog

    This difference is particularly obvious if we plot them on a log vertical scale: for large xthere are orders of magnitude differences between the two functions.

    1cx)x(f !=

    xc)x(f !=

    cx)x(f !=

    xc)x(f !=

    50.cx)x(f !=

    c)x(f =

    The difference between a power law and an exponential distributio

    Network Science: Scale-F

  • 8/13/2019 04 CLASS 2012 Scale-Free Property

    7/60

    Over 3 billiondocumentsROBOT:collects all URLsfound in a document and

    follows them recursively

    Nodes: WWW documentsLinks: URL links

    P(k) ~ k-!

    Scale-free

    Network

    Exponential

    Network

    What does the difference mean? Visual representation.

    R. Albert, H. Jeong, A-L Barabasi, Nature, 401 130 (1999).Network Science: Scale-F

  • 8/13/2019 04 CLASS 2012 Scale-Free Property

    8/60

    WORLD WIDE WEB

    Network Science: Scale-F

  • 8/13/2019 04 CLASS 2012 Scale-Free Property

    9/60

    Definition:

    Networks with a power law tail in their degree distribution are calledscale-free networks

    Where does the name come from?

    Critical Phenomena and scale-invariance(a detour)

    Slides after Dante R. Chialvo

    Scale-free networks: Definition

    Network Science: Scale-F

  • 8/13/2019 04 CLASS 2012 Scale-Free Property

    10/60

    |m|

    TcT

    m:order

    param

    eter

    order disorder

    interacting elementary (spins) sitting in a lattic

    Neighboring spins like t

    in the same direction

    If the temperature T is

    attraction is not sufficie

    there is no net magneti

    If the temperature is low

    ferromagnetic order seis a phase transition at

    T = 0.99 Tc T = 0.999Tc

    "

    T = Tc T = 1.5 Tc T = 2 Tc

    FERROMAGNETIC MATERIALS

    Network Science: Scale-F

  • 8/13/2019 04 CLASS 2012 Scale-Free Property

    11/60

    At T = Tc: correlation length diverges

    Fluctuations emerge at all scales: scale-free behavior

    Scale-free behavior in space

    Network Science: Scale-F

  • 8/13/2019 04 CLASS 2012 Scale-Free Property

    12/60

    At T = Tc

    Scale-free behavior in time

    Network Science: Scale-F

  • 8/13/2019 04 CLASS 2012 Scale-Free Property

    13/60

    1-D

    2-D

    3-D

    Low T High T Solved

    Ising 1925

    Onsager 194

    Provencomputationallintractable - 20

    INSING MODEL and universality

    Universality:while T

    cand many other paramters depend on the details of the system, the critical

    exponents, like !or ", do not. The exponents are universal, which means that they depend only on the

    dimension of the space and the underlying symmetries of the problem. Hence many different systems

    magnets to liquieds and superconductors, share the same exponents.Network Science: Scale-F

  • 8/13/2019 04 CLASS 2012 Scale-Free Property

    14/60

    Correlation length diverges at the critical point: thewhole system is correlated!

    Scale invariance: there is no characteristic scale forthe fluctuation (scale-free behavior).

    ! Universality:exponents are independent of thesystems details.

    CRITICAL PHENOMENA

    Network Science: Scale-F

  • 8/13/2019 04 CLASS 2012 Scale-Free Property

    15/60

    Now we see where the scale-free name comes from.

    But whats in the name?

    We need to learn a bit more to understand that

    soon

    Before we get there, let us talk about universality.

    SCALE-FREE NETWORKS

    Network Science: Scale-F

  • 8/13/2019 04 CLASS 2012 Scale-Free Property

    16/60

    UniversalityHow generic is our finding of a power law degree distribution?

    Network Science: Scale-F

  • 8/13/2019 04 CLASS 2012 Scale-Free Property

    17/60

    (Faloutsos, Faloutsos and Faloutsos, 1999)

    Nodes: computers, routersLinks: physical lines

    INTERNET BACKBONE

    Network Science: Scale-F

  • 8/13/2019 04 CLASS 2012 Scale-Free Property

    18/60

    Network Science: Scale-F

  • 8/13/2019 04 CLASS 2012 Scale-Free Property

    19/60

    (= 3)

    (S. Redner, 1998)

    P(k) ~k-

    1736 PRL papers (1988)

    SCIENCE CITATION INDEX

    Nodes: papersLinks: citations

    578...

    25

    H.E. Stanley,...

    Network Science: Scale-F

  • 8/13/2019 04 CLASS 2012 Scale-Free Property

    20/60

    (Bacon game for Brainiacs )

    Number of links required to connect scto Erd#s, via co-authorship of pap

    Erd#s wrote 1500+ papers with 507 coauthors.

    Jerry Grossmans (Oakland Univ.) we

    allows mathematicians to computeErdos numbers:

    http://www.oakland.edu/enp/

    Connecting path lengths, amongmathematicians only:

    $ average is 4.65$ maximum is 13

    Paul Erd"s (1913-1996)

    Erdos has better centrality inhis network than Bacon has inhis.

    ERDOS NUMBER

    Network Science: Scale-F

  • 8/13/2019 04 CLASS 2012 Scale-Free Property

    21/60

    SCIENCE COAUTHORSHIP

  • 8/13/2019 04 CLASS 2012 Scale-Free Property

    22/60

    SCIENCE COAUTHORSHIP

    M: mathNS: neuro

    Nodes: scientist (authors)Links: joint publication

    (Newman, 2000, Barabasi et al 2001)

    Network Science: Scale-F

    ONLINE COMMUNITIES

  • 8/13/2019 04 CLASS 2012 Scale-Free Property

    23/60

    Nodes: online userLinks: email contact

    Ebel, Mielsch, Bornholdtz, PRE 2002.

    Kiel University log files112 days, N=59,912 nodes

    Pussokram.com online community512 days, 25,000 users.

    Holme, Edling, Liljeros, 2002.

    ONLINE COMMUNITIES

    ONLINE COMMUNITIES

  • 8/13/2019 04 CLASS 2012 Scale-Free Property

    24/60

    Nodes: online userLinks: email contact

    ONLINE COMMUNITIES

    Twitter:

    Jake HofAlan Mislove, Measurement and Analysis of Online Social Networks

    All distribtions show a fat-tail behavior:there are orders of magnitude spread in the degrees

  • 8/13/2019 04 CLASS 2012 Scale-Free Property

    25/60

    protein-geninteraction

    protein-protinteractions

    PROTEOM

    GENOM

    Citrate Cycle

    METABOLI

    Bio-chemreactions

  • 8/13/2019 04 CLASS 2012 Scale-Free Property

    26/60

    Citrate Cycle

    METABOLI

    Bio-chemreactions

    BOEHRING MENNHEIN

  • 8/13/2019 04 CLASS 2012 Scale-Free Property

    27/60

    BOEHRING-MENNHEIN

  • 8/13/2019 04 CLASS 2012 Scale-Free Property

    28/60

  • 8/13/2019 04 CLASS 2012 Scale-Free Property

    29/60

    protein-geninteraction

    protein-protinteractions

    PROTEOM

    GENOM

    Citrate Cycle

    METABOLI

    Bio-chemreactions

    METABOLIC NETWORK

  • 8/13/2019 04 CLASS 2012 Scale-Free Property

    30/60

    protein-protinteractions

    PROTEOM

    METABOLIC NETWORK

    Network Science: Scale-F

    TOPOLOGY OF THE PROTEIN NETWORK

  • 8/13/2019 04 CLASS 2012 Scale-Free Property

    31/60

    )exp()(~)( 00!

    "

    k

    kkkkkP +

    #+ #

    H. Jeong, S.P. Mason, A.-L. Barabasi, Z.N. Oltvai, Nature 411, 41-4

    Nodes: proteinsLinks: physical interactions-binding

    TOPOLOGY OF THE PROTEIN NETWORK

    C. Elegans Drosophila M.

  • 8/13/2019 04 CLASS 2012 Scale-Free Property

    32/60

    Li et al.Science 2004 Giot et al.Science 200

    HUMAN INGTERACTION NETWORK

  • 8/13/2019 04 CLASS 2012 Scale-Free Property

    33/60

    2,800 Y2H interactions4,100 binary LC interactions(HPRD, MINT, BIND, DIP, MIPS)

    Rual et al.Nature 2005; Stelze et al.Cell 2005

    HUMAN INGTERACTION NETWORK

    Network Science: Scale-F

    ACTOR NETWORK

  • 8/13/2019 04 CLASS 2012 Scale-Free Property

    34/60

    Nodes: actorsLinks: cast jointly

    N = 212,250 actors"k#= 28.78

    P(k) ~k-!

    Days of Thunder (1990)Far and Away (1992)Eyes Wide Shut (1999)

    !=2.3

    SWEDISH SE-WEB

  • 8/13/2019 04 CLASS 2012 Scale-Free Property

    35/60

    Nodes:people (Females; Males)Links: sexual relationships

    Liljeros et al. Nature 2001

    4781 Swedes; 18-74;

    59% response rate.

    Network Science: Scale-F

    SCALE-FREE NETWORKS

  • 8/13/2019 04 CLASS 2012 Scale-Free Property

    36/60

    Many real world networks have a similar architecture:

    Scale-free networks

    WWW, Internet (routers and domains), electronic circuits, computer software, movieactors, coauthorship networks, sexual web, instant messaging, email web, citations,phone calls, metabolic, protein interaction, protein domains, brain function web,linguistic networks, comic book characters, international trade, bank system,

    encryption trust net, energy landscapes, earthquakes, astrophysical network

    Network Science: Scale-F

    UNIVERSALITY AGAIN

  • 8/13/2019 04 CLASS 2012 Scale-Free Property

    37/60

    Critical phenomena:Universality means that the expare the same for different system

    they are independent of details.

    Networks:The exponents vary from systesystem.Most are between 2 and 3

    Universality:the emergence of common featacross different networks. Like scale-free property.

    Network Science: Scale-F

  • 8/13/2019 04 CLASS 2012 Scale-Free Property

    38/60

    Divergences inscale-free

    networks

    Network Science: Scale-F

    SCALE-FREE DISTRIBUTION: DISCRETE FORMALISM

  • 8/13/2019 04 CLASS 2012 Scale-Free Property

    39/60

    pk =Ck"#

    pkk=1

    "

    # =1

    C k"#

    k=1

    $

    % =1

    C= 1

    k"#

    k=1

    $

    %=

    1&(#)

    Riemann Zeta function

    pk =k"

    #

    $(#)

    for k>0 (i.e. we assume that there are no disconnected nodesthe network)

    Network Science: Scale-F

    SCALE-FREE DISTRIBUTION: DISCRETE FORMALISM

  • 8/13/2019 04 CLASS 2012 Scale-Free Property

    40/60

    pk =Ck"# k=[Kmin,$)

    pkk=Kmin

    "

    # =1

    C=1

    k"#

    k=Kmin

    $

    %=

    1

    &(#,Kmin )

    Generalized or incomplete Zeta function

    pk =k"#

    $(#,Kmin )

    For some applications we only care about the tail of the degree distribution

    Network Science: Scale-F

    SCALE-FREE DISTRIBUTION: CONTINUUM FORMALISM

  • 8/13/2019 04 CLASS 2012 Scale-Free Property

    41/60

    C=1

    k"#dk

    Kmin

    $

    %=(#"1)Kmin

    #"1

    P(k) =Ck"#

    k=[Kmin

    ,$)

    P(k)Kmin

    "

    # dk=1

    P(k) =("#1)Kmin"#1

    k#"

    Network Science: Scale-F

    DIVERGENCE OF THE HIGHER MOMENTS

  • 8/13/2019 04 CLASS 2012 Scale-Free Property

    42/60

    k=[Kmin ,")P(k) =("#1)Kmin"#1

    k#"

    m-th moment of the degree distribution: < km>= k

    mP(k)dk

    Kmin

    "

    #

    < km>= ("#1)Kmin

    "#1km#"

    dk

    Kmin

    $

    % =("#1)

    (m # "+1)Kmin

    "#1km#"+1[ ]

    Kmin

    $

    If m-!+1= "

    (#"1)

    (m " #+1)Kmin

    m

    If m-!+1>0, the integral diverges.For a fixed !this means that all moments with m>!-1 diverge.

    Network Science: Scale-F

    DIVERGENCE OF THE HIGHER MOMENTS

  • 8/13/2019 04 CLASS 2012 Scale-Free Property

    43/60

    < km>= ("#1)Kmin

    "#1km#$

    dk

    Kmin

    %

    & =("#1)

    (m # "+1)Kmin

    "#1km#"+1[ ]

    Kmin

    %

    For a fixed $this means all moments m>!-1 diverge.

    Most degree exponents are smallethan 3

    " diverges in the N"#limit

    Network Science: Scale-F

    What is the meaning of the observed divergence?

  • 8/13/2019 04 CLASS 2012 Scale-Free Property

    44/60

    Most degree exponents are smaller than 3" diverges!!!

    What does it mean?

    WWW: = 7

    Internet: = 3.5

    Metabolic: = 7.4

    Phone call: = 3.16

    "k= (< k

    2> # < k>

    2)

    1 / 2$%

    k=< k> "k

    The average values are not meaningful, as fluctuations are too large!

    Network Science: Scale-F

    FINITE SCALE-FREE NETWORKS

  • 8/13/2019 04 CLASS 2012 Scale-Free Property

    45/60

    All real networks are finite " let us explore its consequences."We have an expected maximum degree, Kmax

    Estimating Kmax

    P(k)dk

    Kmax

    "

    # $ 1

    N

    Kmax

    =KminN

    1

    "#1

    Why: the probability to have a node larger than Kmax should nexceed the prob. to have one node, i.e. 1/N fraction of all nod

    P(k)dk

    Kmax

    "

    # = ($%1)Kmin$%1 k%$dkKmax

    "

    # = ($%1)

    (%$+1)Kmin

    $%1k%$+1[ ]

    Kmax

    "=Kmin

    $%1

    Kmax$%1

    & 1N

    Network Science: Scale-F

    DISTANCES IN RANDOM GRAPHS

  • 8/13/2019 04 CLASS 2012 Scale-Free Property

    46/60

    Random graphs tend to have a tree-like topology with almost constant node degrees.

    nr. of first neighbors:

    nr. of second neighbors:

    nr. of neighbours at distance d:

    estimate maximum distance:

    klog

    Nloglmax

    maxl

    1l

    i

    Nk1

    kN1

    2 kN

    Nd " k

    Network Science: Scale-F

    Distances in scale-free networks

    DISTANCES IN SCALE-FREE NETWORKS

  • 8/13/2019 04 CLASS 2012 Scale-Free Property

    47/60

    Size of the biggest hub is of order O(N). Most nodes can be connected withof it, thus the average path length will be independent of the system size.

    The average path length increases slower than logarithmically. In a randomnodes have comparable degree, thus most paths will have comparable leng

    scale-free network the vast majority of the path go through the few high degreducing the distances between nodes.

    Some key models produce !=3, so the result is of particular importance for was first derived by Bollobas and collaborators for the network diameter in t

    a dynamical model, but it holds for the average path length as well.

    The second moment of the distribution is finite, thus in many ways the netwas a random network. Hence the average path length follows the result that

    for the random network model earlier.

    Cohen, Havlin Phys. Rev. Lett. 90, 58701(2003); Cohen, Havlin and ben-Avraham, in Handbook of Graphs and Networks, Eds. BShuster (Willy-VCH, NY, 2002) Chap. 4; Confirmed also by: Dorogovtsev et al (2002), Chung and Lu (2002); (Bollobas, Riordan, 21985; Newman, 2001

    Ultra

    Small

    World

    Small

    World

    Kmax

    =KminN

    1

    "#1

    SUMMARY OF THE BEHAVIOR OF SCALE-FREE NETWORKS

  • 8/13/2019 04 CLASS 2012 Scale-Free Property

    48/60

    !=1 !=2 !=3

    diverges

  • 8/13/2019 04 CLASS 2012 Scale-Free Property

    49/60

    diverges

  • 8/13/2019 04 CLASS 2012 Scale-Free Property

    50/60

    Kmax

    =KminN

    1

    "#1

    In order to document a scale-free networks, we need 2-3 orders of magnitude scalingThat is, Kmax~ 10

    3

    However, that constrains on the system size we require to document it.

    For example, to measure an exponent #=5,we need to maximum degree a system sizthe order of

    N =K

    max

    Kmin

    "

    #$

    %

    &'

    ()1

    *1012

    Onella et al. PNAS 2007

    N%

    Mobile CallNetwork

    Network Science: Scale-F

  • 8/13/2019 04 CLASS 2012 Scale-Free Property

    51/60

    Divergences in sca

    Network Science: Scale-F

    CLEANING UP DEGREE DISTRIBUTIONS

  • 8/13/2019 04 CLASS 2012 Scale-Free Property

    52/60

    Often it is difficult to determine the best fit to the degree distribution.

    Methods of data cleanup:

    1. logarithmic binning: bin the k range; use bins of exponentially increasing size(applied to PDF, or probability distribution function)

    2. Display the cumulative degree distribution (CDF)

    Ex. Determine the degree distribution and

    cumulative degree distribution of the graph

    on the right.

    )Kk(P)Kk(P

    )k(P)Kk(PK

    kk min

    !"=>

    =! #=

    1

    or

    Network Science: Scale-F

    CLEANING UP DEGREE DISTRIBUTIONS

  • 8/13/2019 04 CLASS 2012 Scale-Free Property

    53/60

    log( )x

    cx!"

    #

    Probability that node

    has degree x.

    cx"

    #

    Probability that a node has adegree bigger than x.

    )x(P

    If the (noncumulative)degree distributiondecays with a slope&>1,

    the cumulative degreedistributionwill decay with slope

    &-1.Does not apply for

    &=1!

    )xX(P >

    Network Science: Scale-F

    HUMAN INTERACTION NETWORK

  • 8/13/2019 04 CLASS 2012 Scale-Free Property

    54/60

    2,800 Y2H interactions

    4,100 binary LC interactions

    (HPRD, MINT, BIND, DIP, MIP

    Rual et al. Nature 2005; Stelze et al. Cell 2005Network Science: Scale-F

    HUMAN INTERACTION DATA BY RUAL ET AL.

  • 8/13/2019 04 CLASS 2012 Scale-Free Property

    55/60

    (linear scale)

    P(k) ~ k!!!'2

    Network Science: Scale-F

    HUMAN INTERACTION DATA BY RUAL ET AL.

  • 8/13/2019 04 CLASS 2012 Scale-Free Property

    56/60

    (linear scale)

    P(k) ~ (k+k0)!!

    k0'1.4, !'2.6.

    Network Science: Scale-F

    COMMON MISCONCEPTIONS

  • 8/13/2019 04 CLASS 2012 Scale-Free Property

    57/60

    Common Misconceptions

    -if there is a low-k saturation, it is not scale-fr

    -if there is a high-k cutoff, it is not scale-free

    Most real networks:

    P(k) ~ (k+k0)-"exp(-k/k1)

    low-k saturation

    High-k cutoff

    Network Science: Scale-F

    TOPOLOGY OF THE PROTEIN NETWORK

  • 8/13/2019 04 CLASS 2012 Scale-Free Property

    58/60

    )exp()(~)( 00!

    "

    k

    kkkkkP +

    #+ #

    H. Jeong, S.P. Mason, A.-L. Barabasi, Z.N. Oltvai, Nature 411, 41-42 (2001)

    Nodes: proteinsLinks: physical interactions-binding

    Network Science: Scale-F

    SUMMARY OF THE BEHAVIOR OF SCALE-FREE NETWORKS

    collab

  • 8/13/2019 04 CLASS 2012 Scale-Free Property

    59/60

    diverges

  • 8/13/2019 04 CLASS 2012 Scale-Free Property

    60/60