Data Science Center Eindhoven - 4TU.Federation · 4TU AMI SRO Big Data Meeting Big Data:...

25
4TU AMI SRO Big Data Meeting Big Data: Mathematics in Action! November 24, 2017 Data Science Center Eindhoven The Mathematics Behind Big Data Alessandro Di Bucchianico

Transcript of Data Science Center Eindhoven - 4TU.Federation · 4TU AMI SRO Big Data Meeting Big Data:...

Page 1: Data Science Center Eindhoven - 4TU.Federation · 4TU AMI SRO Big Data Meeting Big Data: Mathematics in Action! November 24, 2017 Data Science Center Eindhoven The Mathematics Behind

4TU AMI SRO Big Data MeetingBig Data Mathematics in ActionNovember 24 2017

Data Science Center Eindhoven

The Mathematics Behind Big Data

Alessandro Di Bucchianico

Outline

bull ldquoBig Datardquobull Some real-life examples with ldquohiddenrdquo mathematicsbull Some mathematical developmentsbull Conclusions

What is big data

Term coined by John Mashey chief scientist at Silicon Graphics in the 1990rsquos

hellipI was using one label for a range of issues and I wanted the simplest shortest phrase to convey that the boundaries of computing keep advancinghellip

But chemometrics has a long historyof analyzing ldquolargerdquo data sets

Presenter
Presentation Notes
httpwwwforbescomsitesgilpress20130509a-very-short-history-of-big-data 13httpbitsblogsnytimescom20130201the-origins-of-big-data-an-etymological-detective-story_r=013httpwwwmidsizeinsidercomen-usarticlebrief-history-big-data-everyone-should-readVSorLvmH4sc

Four Vrsquos of Big Data

Presenter
Presentation Notes
Picture taken from eycom13More Vrsquos have been introduced like V for Value (mostly in business)

High-dimensional data ldquo119951119951 ≪ 119953119953rdquo

Presenter
Presentation Notes
book Van de Geer Buumlhlmann13httpwwwstatriceedu~gallenexamples_high_dim_datapdf13DNA 20000 genes of 200 people13fMRI similar voxels (time series) of few people13Netflix ratings of 18000 movies

Topic Google Search

Topic Google Search

bull How can Google search so fast on its over 250 billion pages

bull What are proper ways to rank web pages

bull First idea count words in pages

Topic Google Search

Model the WWW as a graphbull Squares (nodes vertices) denote web pagesbull Arrows (edges) denote links from one page to another

Relevance is translated into number of incoming links weighted by importance of referring pages

Topic Google Search

Topic Google Search

A =

Topic Google Search

Presenter
Presentation Notes
Note that pages 2 and 4 both have 2 incoming links but not the same relevance

Equivalent Random Surfer Model

bull Start at any point on the graphbull Assign equal probabilities to all outgoing linksbull Choose new node with probabilities given by the

linksbull Relevance is proportion of visits after surfing for a

very long time (ldquostationary distributionrdquo)

Google Matrix

bull Choose a ldquodamping factorrdquo p (0ltplt1) bull Googlersquos p is secret but around 015

Interpretation for random surferbull choose next link according to A with probability 1- pbull ldquoteleportrdquo to random page with probability p

119872119872 = 1 minus 119901119901 119860119860 + 119901119901119901119901

119901119901 =1119899119899

1 ⋯ 1⋮ ⋱ ⋮1 ⋯ 1

Choice of Matrix B

Google choose other B to avoid unwanted boosting of pages

Practical Issues

bull The real Google matrix has size in the order of 30 billion columns x 30 billion rows

bull Every day around 250000 new pages are added bull Matrix has many zeros (100 links to a page)

which speeds up calculationsbull Doing the matrix calculation

requires fast computers butalso advanced math

Topic Streaming algorithms

High volume high velocity data makes exact counting of frequencies or number of different items practically infeasible but approximate answers suffice in several applications (web site counters customer counts in retail transactionshellip)

New algorithms using advanced probabilistic methods (random hash functions)bull Count-Min Sketch bull MinHashbull HyperLogLog

Topic Uncertainty Quantification

Virtual prototyping using mathematical models UQ does not deal with1 unknown uncertainty in the initial

conditions of parameters2 parametrisation of design building blocks dimension

reduction

For 1) polynomial chaos (Wiener chaos) inverse statistical models Bayesian analysis (calibration)

For 2) Model Order Reduction

Topic Network monitoring

0

0002

0004

0006

0008

001

0012

1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004

Measu

re V

alu

e

Time

Al-Qaeda Network Measures

Betweeness

Density

Closeness

Challengesbull monitor high number of variablesbull models to capture structural changesbull scalable algorithms for likelihood ratio calculations

Closeness CUSUM Statistic for Al-Qaeda (1994-2004)

0

1

2

3

4

5

6

7

8

1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004

C+

Change Point

Signal

Chart3

Betweeness
Density
Closeness
Time
Measure Value
Al-Qaeda Network Measures
0
00003
00001
00001
00015
00005
00003
0003
00012
00006
00043
00016
00008
00055
00018
00016
00067
00021
00027
0008
00027
00029
0009
0003
00024
00083
00028
00025
00081
00028
00033
00093
00031
00031
001
0003
00033
00099
00032
00036
00103
00034
0002
00062
00024
00008
00037
00015
0
00007
00004

Std Measures Control Charts

Std Measures Control Charts

Betweeness
Density
Closeness
Time
Value
Standard Measures

Chart1

Number of AGENT
Edge Count[Agent x Agent Agent x Agent]
Component Members-Weak[Agent x Agent Agent x Agent] Average
Isolate Count[Agent x Agent Agent x Agent]
Component Count-Weak[Agent x Agent Agent x Agent]
Component Count-Strong[Agent x Agent Agent x Agent]
32
38
1728798
350
355
355
94
201
1248005
293
301
315
112
405
101776
269
272
281
137
574
862131
247
250
255
162
733
710437
222
226
232
183
889
559781
198
201
214
221
1075
367623
157
162
177
228
1207
299836
142
146
156
213
1112
366667
159
162
172
219
1082
345
154
157
169
254
1247
232377
126
128
145
250
1330
232213
125
128
144
256
1328
225628
122
126
137
260
1370
211885
119
122
131
190
825
50694
187
191
198
141
496
855628
243
249
256
62
92
1487596
322
329
336

Control Charts

raw data

Chart4

C+
Closeness CUSUM Statistic for Al-Qaeda (1994-2004)
0
05910894512
0
0
12457431219
18368325731
37372293657
69469334997
36101009266
0
0

Std Measures Control Charts

Std Measures Control Charts

Betweeness
Density
Closeness
Time
Value
Standard Measures

Chart1

C+
Closeness CUSUM Chart for Al-Qaeda (1994-2004)

Control Charts

Number of AGENT
Edge Count[Agent x Agent Agent x Agent]
Component Members-Weak[Agent x Agent Agent x Agent] Average
Isolate Count[Agent x Agent Agent x Agent]
Component Count-Weak[Agent x Agent Agent x Agent]
Component Count-Strong[Agent x Agent Agent x Agent]
32
38
1728798
350
355
355
94
201
1248005
293
301
315
112
405
101776
269
272
281
137
574
862131
247
250
255
162
733
710437
222
226
232
183
889
559781
198
201
214
221
1075
367623
157
162
177
228
1207
299836
142
146
156
213
1112
366667
159
162
172
219
1082
345
154
157
169
254
1247
232377
126
128
145
250
1330
232213
125
128
144
256
1328
225628
122
126
137
260
1370
211885
119
122
131
190
825
50694
187
191
198
141
496
855628
243
249
256
62
92
1487596
322
329
336

raw data

Topic Precision farming

Topic Precision farming

Topic Compressed Sensing

Topic Network structure

bull dependencies between nodes in networks measured by degrees of direct neighbours

bull assortativity coefficient of Newman is nothing but Pearsonrsquos correlation statistic

bull inconsistent estimator when variances are infinitebull Spearman rank correlation behaves better but

calculation is computationally intensive bull requires heavy asymptotics

Van der Hofstad R and Litvak N (2014) Degree-Degree Dependencies in Random Graphs with Heavy-Tailed Degrees Internet Mathematics 10 (3-4) pp 287-334

Topic Topological Data Analysis

Common Big Data problem is to choose relevant ldquofeaturesrdquo from high-dimensional data

Combination of machine learning with topological tools (simplices cohomology) yields new algorithmsfor finding patterns and clustering

Presenter
Presentation Notes
httpsyoutubeXfWibrh6stw Gunnar Carsson

Mathematical contributions in general

bull Modellingbull Performance of algorithmsbull Statistical thinking

We need to work hard to makemathematical contributions more explicit

Conclusions

1 Mathematical methods are an important enabler in big data settings

2 Developments in big data settings not only require more computer speed and memory capacity but also new advanced mathematics

  • The Mathematics Behind Big Data
  • Outline
  • What is big data
  • Four Vrsquos of Big Data
  • High-dimensional data ldquo119951≪119953rdquo
  • Topic Google Search
  • Topic Google Search
  • Topic Google Search
  • Topic Google Search
  • Topic Google Search
  • Topic Google Search
  • Equivalent Random Surfer Model
  • Google Matrix
  • Choice of Matrix B
  • Practical Issues
  • Topic Streaming algorithms
  • Topic Uncertainty Quantification
  • Topic Network monitoring
  • Topic Precision farming
  • Topic Precision farming
  • Topic Compressed Sensing
  • Topic Network structure
  • Topic Topological Data Analysis
  • Mathematical contributions in general
  • Conclusions
1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Centrality-Betweenness[Agent x Agent Agent x Agent] Average 0 00001 00003 00006 00008 00016 00027 00029 00024 00025 00033 00031 00033 00036 0002 00008 0
Meta-Matrix Hamming Distance 0 00018 00036 0006 00065 00087 00124 0012 00146 00136 00135 00166 00164 00156 0015 0009 0004
Network Centralization-Betweenness[Agent x Agent Agent x Agent] 0 00091 00292 00492 00608 01105 01557 01586 01242 01187 01295 01338 01316 01414 0104 0042 00014
Edge Count-Sequential[Agent x Agent Agent x Agent] 0 001 00025 00017 0 0 00019 00008 00009 00009 00016 00015 0 00007 0 0002 0
Hierarchy[Agent x Agent Agent x Agent] 0 04827 01679 00696 00796 01496 01339 00727 00775 01273 01492 01711 0075 00717 00786 00882 05587
Exclusivity[Agent x Agent Agent x Agent] Average 00001 00003 00003 00003 00004 00004 00005 00005 00005 00006 00006 00006 00006 00006 00006 00004 00002
Network Centralization-Closeness[Agent x Agent Agent x Agent] 00001 00005 00012 00016 00018 00021 00027 0003 00028 00028 00031 0003 00032 00034 00024 00015 00004
Similarity Correlation[Agent x Agent Agent x Agent] Average 00001 00023 00036 00043 0005 0006 0007 00077 0007 00069 0008 00084 0008 00082 00053 0003 00008
Simmelian Ties[Agent x Agent Agent x Agent] Average 00002 00008 00021 00029 0004 0005 00059 00068 00061 00058 00065 00076 00076 0008 00045 00029 00004
Overall Complexity 00003 00015 0003 00043 00055 00067 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Expertise Correlation[Agent x Agent Agent x Agent] Average 00003 00015 0003 00042 00053 00065 00079 00088 00081 00079 00091 00097 00097 00101 0006 00036 00007
Density[Agent x Agent Agent x Agent] 00003 00015 0003 00043 00055 00067 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Centrality-Inverse Closeness[Agent x Agent Agent x Agent] Average 00003 00057 00191 00306 00387 0053 00733 00943 0084 00877 01111 01117 01125 01206 00587 00257 00018
Efficiency-Global[Agent x Agent Agent x Agent] 00003 00087 00223 00338 00421 00591 00815 0101 00904 00972 01245 01244 01203 01286 00632 00277 00026
Connectedness[Agent x Agent Agent x Agent] 00003 0021 00589 00932 0119 01953 02921 03446 0301 03223 04222 04187 04049 04402 02202 00899 00059
Distinctiveness Correlation[Agent x Agent Agent x Agent] Average 00006 00029 00059 00083 00106 00129 00156 00175 00162 00157 00181 00193 00193 00199 0012 00073 00014
Centrality-Closeness[Agent x Agent Agent x Agent] Average 00027 00028 00029 00031 00032 00035 00042 00048 00044 00044 00053 00052 00054 00058 00038 00031 00027
Boundary Spanner[Agent x Agent Agent x Agent] Average 00027 00191 00328 00328 00546 00574 00683 00601 00656 0071 00683 0071 00847 00656 00738 00546 00246
Network Centralization-Row Degree[Agent x Agent Agent x Agent] 00134 00314 006 00752 00794 0081 00851 00841 00848 00878 00893 00969 00996 00993 00842 00648 0024
Network Centralization-Column Degree[Agent x Agent Agent x Agent] 00134 00588 00819 00834 00849 00892 00878 00896 00876 00905 00948 00996 01024 00993 00842 0062 00322
Network Centralization-Out Degree[Agent x Agent Agent x Agent] 00135 00315 00601 00754 00797 00812 00853 00843 00851 0088 00895 00972 00999 00996 00845 0065 0024
Network Centralization-Total Degree[Agent x Agent Agent x Agent] 00135 00453 00672 00769 00813 00828 00869 0086 00867 00897 00925 00988 01016 00999 00847 00638 00282
Network Centralization-In Degree[Agent x Agent Agent x Agent] 00135 00589 00821 00836 00852 00895 00881 00898 00878 00908 0095 00999 01027 00996 00845 00622 00323
Clustering Coefficient-Watts-Strogatz[Agent x Agent Agent x Agent] Average 0023 00786 0126 01471 01728 01993 02334 02782 02382 02256 02604 02856 02775 02958 02157 01468 00355
Efficiency-Local[Agent x Agent Agent x Agent] 00238 01032 01587 01916 02253 02598 03034 0362 03167 03054 03644 03843 03678 03861 02663 01705 0041
Interlockers[Agent x Agent Agent x Agent] Average 00246 00546 00656 00628 00738 00902 00902 00902 00847 0071 00738 00874 00874 0082 00792 00683 00546
Breadth-Column[Agent x Agent Agent x Agent] 00273 01011 01803 02596 03224 03716 04399 05082 04672 04672 05355 05328 05519 05738 03825 02404 00601
Breadth-Row[Agent x Agent Agent x Agent] 00273 01339 02158 02923 03388 03934 04754 05301 04863 04973 05792 05765 05765 05984 04153 02514 00601
Clique Count[Agent x Agent Agent x Agent] Average 00301 02896 06421 10929 14699 17213 19781 22486 20601 20492 24071 2571 24918 24727 13689 06913 01148
Constraint-Burt[Agent x Agent Agent x Agent] Average 0038 01331 01404 01481 01869 02077 02643 02719 02506 02623 02827 0295 03034 02956 02548 01967 00948
Interdependence[Agent x Agent Agent x Agent] 00485 00073 00042 0003 00024 0002 00017 00015 00016 00016 00014 00013 00014 00013 00022 00037 00176
Effective Network Size-Burt[Agent x Agent Agent x Agent] Average 0053 03636 06977 10333 13118 15388 18575 21156 19688 19928 22983 2382 23991 2429 14392 08408 01957
Redundancy-Row[Agent x Agent Agent x Agent] 00601 03525 08443 12459 16093 19699 23716 26913 24754 23798 27541 29781 29617 30683 1765 10219 01311
Redundancy-Column[Agent x Agent Agent x Agent] 00601 03743 08634 12568 1623 2 23989 27077 24945 24016 27814 30055 29891 30902 17842 10383 01448
Node Levels[Agent x Agent Agent x Agent] Average 00601 09836 14344 23087 24126 36339 46913 42541 3918 43197 51913 44727 46421 48169 47568 23033 03169
Centrality-Bonacich Power[Agent x Agent Agent x Agent] Average 01038 05492 11066 15683 20027 2429 29372 32978 30383 29563 34071 36339 36284 37432 22541 13552 02514
Triad Count[Agent x Agent Agent x Agent] Average 01967 08279 27104 42432 5918 78251 96503 101475 94454 84508 94645 111421 110164 112732 67131 35574 01995
Efficiency[Agent x Agent Agent x Agent] 03333 09552 0964 09658 09658 09746 09798 09806 09795 09818 09835 09822 0982 09828 09809 09756 09607
Speed-Minimum[Agent x Agent Agent x Agent] 05 0125 0125 01 01111 00909 00909 01 01 01 00909 01111 01 01 00769 01111 01667
Edge Count-Skip[Agent x Agent Agent x Agent] 07895 07761 08988 09408 09304 0946 0933 09519 09487 09409 09455 09534 09465 09584 09188 0881 06196
Transitivity[Agent x Agent Agent x Agent] 08372 03475 03372 03238 03336 03626 0375 03495 03503 03253 0319 03266 03302 0336 03822 0405 0258
Edge Count-Pooled[Agent x Agent Agent x Agent] 08421 08657 09432 09669 09714 09764 09665 09751 09748 09704 09735 09737 09759 09788 09612 09435 08152
Speed-Average[Agent x Agent Agent x Agent] 08519 02994 03068 02802 02799 0233 02156 02387 0245 02484 02448 02516 02464 02429 02259 02373 04139
Edge Count-Lateral[Agent x Agent Agent x Agent] 09474 03731 03926 0392 04216 04061 03944 03828 03768 03651 03633 03872 03795 03985 03842 03891 05761
Resemblance Correlation[Agent x Agent Agent x Agent] Average 09994 09971 09941 09917 09894 09871 09844 09825 09838 09843 09819 09807 09807 09801 0988 09927 09986
Fragmentation[Agent x Agent Agent x Agent] 09997 0979 09411 09068 0881 08047 07079 06554 0699 06777 05778 05813 05951 05598 07798 09101 09941
Communicative Need[Agent x Agent Agent x Agent] 1 0608 07457 07662 0801 08069 08098 08233 0817 08124 0784 08169 08368 08365 08212 09004 08039
Upper Boundedness[Agent x Agent Agent x Agent] 1 09021 09909 0999 09987 0994 09966 09993 09989 09965 0995 09928 09987 09988 09986 09983 09073
Average Distance[Agent x Agent Agent x Agent] 11739 33399 326 35685 35731 42917 46372 41893 40814 40263 40845 39749 40577 41172 44266 42139 24163
Network Levels[Agent x Agent Agent x Agent] 2 8 8 10 9 11 11 10 10 10 11 9 10 10 13 9 6
Span Of Control[Agent x Agent Agent x Agent] 2375 27917 42188 48644 50903 52917 51932 54369 53981 5128 52176 55417 54426 55466 46089 40656 20909
Number of AGENT 32 94 112 137 162 183 221 228 213 219 254 250 256 260 190 141 62
Edge Count[Agent x Agent Agent x Agent] 38 201 405 574 733 889 1075 1207 1112 1082 1247 1330 1328 1370 825 496 92
Component Members-Weak[Agent x Agent Agent x Agent] Average 1728798 1248005 101776 862131 710437 559781 367623 299836 366667 345 232377 232213 225628 211885 50694 855628 1487596
Isolate Count[Agent x Agent Agent x Agent] 350 293 269 247 222 198 157 142 159 154 126 125 122 119 187 243 322
Component Count-Weak[Agent x Agent Agent x Agent] 355 301 272 250 226 201 162 146 162 157 128 128 126 122 191 249 329
Component Count-Strong[Agent x Agent Agent x Agent] 355 315 281 255 232 214 177 156 172 169 145 144 137 131 198 256 336
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Agents 221 228 213 219 254 250 256 260 190 141 62
Z -03746343246 00624390541 -08741467575 -04995124328 16858544608 14360982444 18107325691 20604887855 -23102450019 -53697586531 -103024439274
C+ 0 0 0 0 11858544608 21219527053 34326852743 49931740598 21829290579 0 0
C- 0 0 03741467575 03736591903 0 0 0 0 18102450019 6680003655 164824475824
Signal 0 0 0 0 0 0 0 1 0 1 1
Average 227
Std Dev 16015617378
k 05
Control Limit 4
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Component Members-Weak Average 367623 299836 366667 345 232377 232213 225628 211885 50694 855628 1487596
Z 09566024078 -01181147213 09414456731 05979300132 -11876316317 -1190231741 -12946324712 -15125184597 31653794044 86935922814 187130183455
C+ 04566024078 0 04414456731 05393756863 0 0 0 0 26653794044 108589716857 290719900313
C- 0 0 0 0 06876316317 13778633728 21724958439 31850143037 0 0 0
Signal 0 0 0 0 0 0 0 0 0 1 1
Average 307286
Std Dev 63074271515
k 05
Control Limit 4
1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Edge Count 38 201 405 574 733 889 1075 1207 1112 1082 1247 1330 1328 1370 825 496 92
Slope of Edge Count 163 204 169 159 156 186 132 -95 -30 165 83 -2 42 -545 -329 -404
Z -05039753822 15973457028 -01964649795 -07089823173 -08627375186 06748144948 -20927791293 -137269226972 -6393653789 59836399187 20500693512 -16784942813 -06150208054 -316223197419 -114547624997 -100197139539
C+ 0 10973457028 04008807233 0 0 01748144948 0 0 0 54836399187 70337092699 48552149886 37401941833 0 0 0
C- 00039753822 0 0 02089823173 05717198359 0 15927791293 148197018266 207133556155 142297156968 116796463456 128581406269 129731614323 440954811741 550502436738 645699576277
Signal 0 0 0 0 0 0 0 0 1 1 1 1 1 1 1 1 1
Average 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 9475 4825 43 3075 54 72 -1055 -2085 -309
Std Dev 195115350498 195115350498 195115350498 195115350498 195115350498 195115350498 195115350498 195115350498 1284143683549 1324547092406 1254033492376 115840047767 882307580533 710539700979 29504858357 278774102097 2505474007049
k 05
Control Limit 4
1988 1988 1988 1988 1988 1988
1989 1989 1989 1989 1989 1989
1990 1990 1990 1990 1990 1990
1991 1991 1991 1991 1991 1991
1992 1992 1992 1992 1992 1992
1993 1993 1993 1993 1993 1993
1994 1994 1994 1994 1994 1994
1995 1995 1995 1995 1995 1995
1996 1996 1996 1996 1996 1996
1997 1997 1997 1997 1997 1997
1998 1998 1998 1998 1998 1998
1999 1999 1999 1999 1999 1999
2000 2000 2000 2000 2000 2000
2001 2001 2001 2001 2001 2001
2002 2002 2002 2002 2002 2002
2003 2003 2003 2003 2003 2003
2004 2004 2004 2004 2004 2004
httpwwwmarocnlnieuwsforumsshowphppg=timeline
Significant Events Founded in Afghanistan Moves to Sudan WTC 1 and Somalia Leaves Sudan Returns to Afghanistan Saudia Arabia bomb Issues fatwa US embassy bombings USS Cole WTC 2 Afghanistan 3 Man is Captured Dirty bomb foiled 911 plotter captured Sheikh Mohammed captured Saudi bombings Morroco attacks Sectarian violence plot Madrid Europe truce offered
1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Betweeness 0 00001 00003 00006 00008 00016 00027 00029 00024 00025 00033 00031 00033 00036 0002 00008 0
Density 00003 00015 0003 00043 00055 00067 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Closeness 00001 00005 00012 00016 00018 00021 00027 0003 00028 00028 00031 0003 00032 00034 00024 00015 00004
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Closeness 00027 0003 00028 00028 00031 0003 00032 00034 00024 00015 00004
Z -08728715609 10910894512 -02182178902 -02182178902 17457431219 10910894512 24003967926 3709704134 -28368325731 -87287156094 -159299059872
C+ 0 05910894512 0 0 12457431219 18368325731 37372293657 69469334997 36101009266 0 0
C- 03728715609 0 0 0 0 0 0 0 23368325731 105655481825 259954541697
03728715609 05910894512 0 0 12457431219 18368325731 37372293657 69469334997 36101009266 105655481825 259954541697
Signal 0 0 0 0 0 0 0 1 0 1 1
Average 00028333333
Std Dev 00001527525
k 05
Control Limit 4
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Density 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Z -08444407432 11042686642 -0259827921 -06495698025 16888814864 30529780716 28581071308 36375908938 -43521176765 -9223891195 -150700194171
C+ 0 06042686642 0 0 11888814864 3741859558 60999666888 92375575826 43854399061 0 0
C- 03444407432 0 0 01495698025 0 0 0 0 38521176765 125760088714 271460282886
Signal 0 0 0 0 0 0 1 1 1 1 1
Average 00084333333
Std Dev 00005131601
k 05
Control Limit 4
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Betweeness 00027 00029 00024 00025 00033 00031 00033 00036 0002 00008 0
Z 01324532357 09271726499 -10596258857 -06622661785 25166114784 17218920642 25166114784 37086905998 -26490647141 -74173811996 -105962588565
C+ 0 04271726499 0 0 20166114784 32385035426 5255115021 84638056208 53147409067 0 0
C- 0 0 05596258857 07218920642 0 0 0 0 21490647141 90664459137 191627047702
Signal 0 0 0 0 0 0 1 1 1 1 1
Average 00026666667
Std Dev 00002516611
k 05
Control Limit 4
1994
1995
1996
1997
1998
1999
2000
2001
2002
2003
2004
1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Centrality-Betweenness[Agent x Agent Agent x Agent] Average 0 00001 00003 00006 00008 00016 00027 00029 00024 00025 00033 00031 00033 00036 0002 00008 0
Meta-Matrix Hamming Distance 0 00018 00036 0006 00065 00087 00124 0012 00146 00136 00135 00166 00164 00156 0015 0009 0004
Network Centralization-Betweenness[Agent x Agent Agent x Agent] 0 00091 00292 00492 00608 01105 01557 01586 01242 01187 01295 01338 01316 01414 0104 0042 00014
Edge Count-Sequential[Agent x Agent Agent x Agent] 0 001 00025 00017 0 0 00019 00008 00009 00009 00016 00015 0 00007 0 0002 0
Hierarchy[Agent x Agent Agent x Agent] 0 04827 01679 00696 00796 01496 01339 00727 00775 01273 01492 01711 0075 00717 00786 00882 05587
Exclusivity[Agent x Agent Agent x Agent] Average 00001 00003 00003 00003 00004 00004 00005 00005 00005 00006 00006 00006 00006 00006 00006 00004 00002
Network Centralization-Closeness[Agent x Agent Agent x Agent] 00001 00005 00012 00016 00018 00021 00027 0003 00028 00028 00031 0003 00032 00034 00024 00015 00004
Similarity Correlation[Agent x Agent Agent x Agent] Average 00001 00023 00036 00043 0005 0006 0007 00077 0007 00069 0008 00084 0008 00082 00053 0003 00008
Simmelian Ties[Agent x Agent Agent x Agent] Average 00002 00008 00021 00029 0004 0005 00059 00068 00061 00058 00065 00076 00076 0008 00045 00029 00004
Overall Complexity 00003 00015 0003 00043 00055 00067 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Expertise Correlation[Agent x Agent Agent x Agent] Average 00003 00015 0003 00042 00053 00065 00079 00088 00081 00079 00091 00097 00097 00101 0006 00036 00007
Density[Agent x Agent Agent x Agent] 00003 00015 0003 00043 00055 00067 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Centrality-Inverse Closeness[Agent x Agent Agent x Agent] Average 00003 00057 00191 00306 00387 0053 00733 00943 0084 00877 01111 01117 01125 01206 00587 00257 00018
Efficiency-Global[Agent x Agent Agent x Agent] 00003 00087 00223 00338 00421 00591 00815 0101 00904 00972 01245 01244 01203 01286 00632 00277 00026
Connectedness[Agent x Agent Agent x Agent] 00003 0021 00589 00932 0119 01953 02921 03446 0301 03223 04222 04187 04049 04402 02202 00899 00059
Distinctiveness Correlation[Agent x Agent Agent x Agent] Average 00006 00029 00059 00083 00106 00129 00156 00175 00162 00157 00181 00193 00193 00199 0012 00073 00014
Centrality-Closeness[Agent x Agent Agent x Agent] Average 00027 00028 00029 00031 00032 00035 00042 00048 00044 00044 00053 00052 00054 00058 00038 00031 00027
Boundary Spanner[Agent x Agent Agent x Agent] Average 00027 00191 00328 00328 00546 00574 00683 00601 00656 0071 00683 0071 00847 00656 00738 00546 00246
Network Centralization-Row Degree[Agent x Agent Agent x Agent] 00134 00314 006 00752 00794 0081 00851 00841 00848 00878 00893 00969 00996 00993 00842 00648 0024
Network Centralization-Column Degree[Agent x Agent Agent x Agent] 00134 00588 00819 00834 00849 00892 00878 00896 00876 00905 00948 00996 01024 00993 00842 0062 00322
Network Centralization-Out Degree[Agent x Agent Agent x Agent] 00135 00315 00601 00754 00797 00812 00853 00843 00851 0088 00895 00972 00999 00996 00845 0065 0024
Network Centralization-Total Degree[Agent x Agent Agent x Agent] 00135 00453 00672 00769 00813 00828 00869 0086 00867 00897 00925 00988 01016 00999 00847 00638 00282
Network Centralization-In Degree[Agent x Agent Agent x Agent] 00135 00589 00821 00836 00852 00895 00881 00898 00878 00908 0095 00999 01027 00996 00845 00622 00323
Clustering Coefficient-Watts-Strogatz[Agent x Agent Agent x Agent] Average 0023 00786 0126 01471 01728 01993 02334 02782 02382 02256 02604 02856 02775 02958 02157 01468 00355
Efficiency-Local[Agent x Agent Agent x Agent] 00238 01032 01587 01916 02253 02598 03034 0362 03167 03054 03644 03843 03678 03861 02663 01705 0041
Interlockers[Agent x Agent Agent x Agent] Average 00246 00546 00656 00628 00738 00902 00902 00902 00847 0071 00738 00874 00874 0082 00792 00683 00546
Breadth-Column[Agent x Agent Agent x Agent] 00273 01011 01803 02596 03224 03716 04399 05082 04672 04672 05355 05328 05519 05738 03825 02404 00601
Breadth-Row[Agent x Agent Agent x Agent] 00273 01339 02158 02923 03388 03934 04754 05301 04863 04973 05792 05765 05765 05984 04153 02514 00601
Clique Count[Agent x Agent Agent x Agent] Average 00301 02896 06421 10929 14699 17213 19781 22486 20601 20492 24071 2571 24918 24727 13689 06913 01148
Constraint-Burt[Agent x Agent Agent x Agent] Average 0038 01331 01404 01481 01869 02077 02643 02719 02506 02623 02827 0295 03034 02956 02548 01967 00948
Interdependence[Agent x Agent Agent x Agent] 00485 00073 00042 0003 00024 0002 00017 00015 00016 00016 00014 00013 00014 00013 00022 00037 00176
Effective Network Size-Burt[Agent x Agent Agent x Agent] Average 0053 03636 06977 10333 13118 15388 18575 21156 19688 19928 22983 2382 23991 2429 14392 08408 01957
Redundancy-Row[Agent x Agent Agent x Agent] 00601 03525 08443 12459 16093 19699 23716 26913 24754 23798 27541 29781 29617 30683 1765 10219 01311
Redundancy-Column[Agent x Agent Agent x Agent] 00601 03743 08634 12568 1623 2 23989 27077 24945 24016 27814 30055 29891 30902 17842 10383 01448
Node Levels[Agent x Agent Agent x Agent] Average 00601 09836 14344 23087 24126 36339 46913 42541 3918 43197 51913 44727 46421 48169 47568 23033 03169
Centrality-Bonacich Power[Agent x Agent Agent x Agent] Average 01038 05492 11066 15683 20027 2429 29372 32978 30383 29563 34071 36339 36284 37432 22541 13552 02514
Triad Count[Agent x Agent Agent x Agent] Average 01967 08279 27104 42432 5918 78251 96503 101475 94454 84508 94645 111421 110164 112732 67131 35574 01995
Efficiency[Agent x Agent Agent x Agent] 03333 09552 0964 09658 09658 09746 09798 09806 09795 09818 09835 09822 0982 09828 09809 09756 09607
Speed-Minimum[Agent x Agent Agent x Agent] 05 0125 0125 01 01111 00909 00909 01 01 01 00909 01111 01 01 00769 01111 01667
Edge Count-Skip[Agent x Agent Agent x Agent] 07895 07761 08988 09408 09304 0946 0933 09519 09487 09409 09455 09534 09465 09584 09188 0881 06196
Transitivity[Agent x Agent Agent x Agent] 08372 03475 03372 03238 03336 03626 0375 03495 03503 03253 0319 03266 03302 0336 03822 0405 0258
Edge Count-Pooled[Agent x Agent Agent x Agent] 08421 08657 09432 09669 09714 09764 09665 09751 09748 09704 09735 09737 09759 09788 09612 09435 08152
Speed-Average[Agent x Agent Agent x Agent] 08519 02994 03068 02802 02799 0233 02156 02387 0245 02484 02448 02516 02464 02429 02259 02373 04139
Edge Count-Lateral[Agent x Agent Agent x Agent] 09474 03731 03926 0392 04216 04061 03944 03828 03768 03651 03633 03872 03795 03985 03842 03891 05761
Resemblance Correlation[Agent x Agent Agent x Agent] Average 09994 09971 09941 09917 09894 09871 09844 09825 09838 09843 09819 09807 09807 09801 0988 09927 09986
Fragmentation[Agent x Agent Agent x Agent] 09997 0979 09411 09068 0881 08047 07079 06554 0699 06777 05778 05813 05951 05598 07798 09101 09941
Communicative Need[Agent x Agent Agent x Agent] 1 0608 07457 07662 0801 08069 08098 08233 0817 08124 0784 08169 08368 08365 08212 09004 08039
Upper Boundedness[Agent x Agent Agent x Agent] 1 09021 09909 0999 09987 0994 09966 09993 09989 09965 0995 09928 09987 09988 09986 09983 09073
Average Distance[Agent x Agent Agent x Agent] 11739 33399 326 35685 35731 42917 46372 41893 40814 40263 40845 39749 40577 41172 44266 42139 24163
Network Levels[Agent x Agent Agent x Agent] 2 8 8 10 9 11 11 10 10 10 11 9 10 10 13 9 6
Span Of Control[Agent x Agent Agent x Agent] 2375 27917 42188 48644 50903 52917 51932 54369 53981 5128 52176 55417 54426 55466 46089 40656 20909
Number of AGENT 32 94 112 137 162 183 221 228 213 219 254 250 256 260 190 141 62
Edge Count[Agent x Agent Agent x Agent] 38 201 405 574 733 889 1075 1207 1112 1082 1247 1330 1328 1370 825 496 92
Component Members-Weak[Agent x Agent Agent x Agent] Average 1728798 1248005 101776 862131 710437 559781 367623 299836 366667 345 232377 232213 225628 211885 50694 855628 1487596
Isolate Count[Agent x Agent Agent x Agent] 350 293 269 247 222 198 157 142 159 154 126 125 122 119 187 243 322
Component Count-Weak[Agent x Agent Agent x Agent] 355 301 272 250 226 201 162 146 162 157 128 128 126 122 191 249 329
Component Count-Strong[Agent x Agent Agent x Agent] 355 315 281 255 232 214 177 156 172 169 145 144 137 131 198 256 336
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Agents 221 228 213 219 254 250 256 260 190 141 62
Z -03746343246 00624390541 -08741467575 -04995124328 16858544608 14360982444 18107325691 20604887855 -23102450019 -53697586531 -103024439274
C+ 0 0 0 0 11858544608 21219527053 34326852743 49931740598 21829290579 0 0
C- 0 0 03741467575 03736591903 0 0 0 0 18102450019 6680003655 164824475824
Signal 0 0 0 0 0 0 0 1 0 1 1
Average 227
Std Dev 16015617378
k 05
Control Limit 4
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Component Members-Weak Average 367623 299836 366667 345 232377 232213 225628 211885 50694 855628 1487596
Z 09566024078 -01181147213 09414456731 05979300132 -11876316317 -1190231741 -12946324712 -15125184597 31653794044 86935922814 187130183455
C+ 04566024078 0 04414456731 05393756863 0 0 0 0 26653794044 108589716857 290719900313
C- 0 0 0 0 06876316317 13778633728 21724958439 31850143037 0 0 0
Signal 0 0 0 0 0 0 0 0 0 1 1
Average 307286
Std Dev 63074271515
k 05
Control Limit 4
1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Edge Count 38 201 405 574 733 889 1075 1207 1112 1082 1247 1330 1328 1370 825 496 92
Slope of Edge Count 163 204 169 159 156 186 132 -95 -30 165 83 -2 42 -545 -329 -404
Z -05039753822 15973457028 -01964649795 -07089823173 -08627375186 06748144948 -20927791293 -137269226972 -6393653789 59836399187 20500693512 -16784942813 -06150208054 -316223197419 -114547624997 -100197139539
C+ 0 10973457028 04008807233 0 0 01748144948 0 0 0 54836399187 70337092699 48552149886 37401941833 0 0 0
C- 00039753822 0 0 02089823173 05717198359 0 15927791293 148197018266 207133556155 142297156968 116796463456 128581406269 129731614323 440954811741 550502436738 645699576277
Signal 0 0 0 0 0 0 0 0 1 1 1 1 1 1 1 1 1
Average 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 9475 4825 43 3075 54 72 -1055 -2085 -309
Std Dev 195115350498 195115350498 195115350498 195115350498 195115350498 195115350498 195115350498 195115350498 1284143683549 1324547092406 1254033492376 115840047767 882307580533 710539700979 29504858357 278774102097 2505474007049
k 05
Control Limit 4
1988 1988 1988 1988 1988 1988
1989 1989 1989 1989 1989 1989
1990 1990 1990 1990 1990 1990
1991 1991 1991 1991 1991 1991
1992 1992 1992 1992 1992 1992
1993 1993 1993 1993 1993 1993
1994 1994 1994 1994 1994 1994
1995 1995 1995 1995 1995 1995
1996 1996 1996 1996 1996 1996
1997 1997 1997 1997 1997 1997
1998 1998 1998 1998 1998 1998
1999 1999 1999 1999 1999 1999
2000 2000 2000 2000 2000 2000
2001 2001 2001 2001 2001 2001
2002 2002 2002 2002 2002 2002
2003 2003 2003 2003 2003 2003
2004 2004 2004 2004 2004 2004
httpwwwmarocnlnieuwsforumsshowphppg=timeline
Significant Events Founded in Afghanistan Moves to Sudan WTC 1 and Somalia Leaves Sudan Returns to Afghanistan Saudia Arabia bomb Issues fatwa US embassy bombings USS Cole WTC 2 Afghanistan 3 Man is Captured Dirty bomb foiled 911 plotter captured Sheikh Mohammed captured Saudi bombings Morroco attacks Sectarian violence plot Madrid Europe truce offered
1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Betweeness 0 00001 00003 00006 00008 00016 00027 00029 00024 00025 00033 00031 00033 00036 0002 00008 0
Density 00003 00015 0003 00043 00055 00067 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Closeness 00001 00005 00012 00016 00018 00021 00027 0003 00028 00028 00031 0003 00032 00034 00024 00015 00004
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Closeness 00027 0003 00028 00028 00031 0003 00032 00034 00024 00015 00004
Z -08728715609 10910894512 -02182178902 -02182178902 17457431219 10910894512 24003967926 3709704134 -28368325731 -87287156094 -159299059872
C+ 0 05910894512 0 0 12457431219 18368325731 37372293657 69469334997 36101009266 0 0
C- 03728715609 0 0 0 0 0 0 0 23368325731 105655481825 259954541697
Signal 0 0 0 0 0 0 0 1 0 1 1
Average 00028333333
Std Dev 00001527525
k 05
Control Limit 4
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Density 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Z -08444407432 11042686642 -0259827921 -06495698025 16888814864 30529780716 28581071308 36375908938 -43521176765 -9223891195 -150700194171
C+ 0 06042686642 0 0 11888814864 3741859558 60999666888 92375575826 43854399061 0 0
C- 03444407432 0 0 01495698025 0 0 0 0 38521176765 125760088714 271460282886
Signal 0 0 0 0 0 0 1 1 1 1 1
Average 00084333333
Std Dev 00005131601
k 05
Control Limit 4
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Betweeness 00027 00029 00024 00025 00033 00031 00033 00036 0002 00008 0
Z 01324532357 09271726499 -10596258857 -06622661785 25166114784 17218920642 25166114784 37086905998 -26490647141 -74173811996 -105962588565
C+ 0 04271726499 0 0 20166114784 32385035426 5255115021 84638056208 53147409067 0 0
C- 0 0 05596258857 07218920642 0 0 0 0 21490647141 90664459137 191627047702
Signal 0 0 0 0 0 0 1 1 1 1 1
Average 00026666667
Std Dev 00002516611
k 05
Control Limit 4
1988 1988 1988
1989 1989 1989
1990 1990 1990
1991 1991 1991
1992 1992 1992
1993 1993 1993
1994 1994 1994
1995 1995 1995
1996 1996 1996
1997 1997 1997
1998 1998 1998
1999 1999 1999
2000 2000 2000
2001 2001 2001
2002 2002 2002
2003 2003 2003
2004 2004 2004
Page 2: Data Science Center Eindhoven - 4TU.Federation · 4TU AMI SRO Big Data Meeting Big Data: Mathematics in Action! November 24, 2017 Data Science Center Eindhoven The Mathematics Behind

Outline

bull ldquoBig Datardquobull Some real-life examples with ldquohiddenrdquo mathematicsbull Some mathematical developmentsbull Conclusions

What is big data

Term coined by John Mashey chief scientist at Silicon Graphics in the 1990rsquos

hellipI was using one label for a range of issues and I wanted the simplest shortest phrase to convey that the boundaries of computing keep advancinghellip

But chemometrics has a long historyof analyzing ldquolargerdquo data sets

Presenter
Presentation Notes
httpwwwforbescomsitesgilpress20130509a-very-short-history-of-big-data 13httpbitsblogsnytimescom20130201the-origins-of-big-data-an-etymological-detective-story_r=013httpwwwmidsizeinsidercomen-usarticlebrief-history-big-data-everyone-should-readVSorLvmH4sc

Four Vrsquos of Big Data

Presenter
Presentation Notes
Picture taken from eycom13More Vrsquos have been introduced like V for Value (mostly in business)

High-dimensional data ldquo119951119951 ≪ 119953119953rdquo

Presenter
Presentation Notes
book Van de Geer Buumlhlmann13httpwwwstatriceedu~gallenexamples_high_dim_datapdf13DNA 20000 genes of 200 people13fMRI similar voxels (time series) of few people13Netflix ratings of 18000 movies

Topic Google Search

Topic Google Search

bull How can Google search so fast on its over 250 billion pages

bull What are proper ways to rank web pages

bull First idea count words in pages

Topic Google Search

Model the WWW as a graphbull Squares (nodes vertices) denote web pagesbull Arrows (edges) denote links from one page to another

Relevance is translated into number of incoming links weighted by importance of referring pages

Topic Google Search

Topic Google Search

A =

Topic Google Search

Presenter
Presentation Notes
Note that pages 2 and 4 both have 2 incoming links but not the same relevance

Equivalent Random Surfer Model

bull Start at any point on the graphbull Assign equal probabilities to all outgoing linksbull Choose new node with probabilities given by the

linksbull Relevance is proportion of visits after surfing for a

very long time (ldquostationary distributionrdquo)

Google Matrix

bull Choose a ldquodamping factorrdquo p (0ltplt1) bull Googlersquos p is secret but around 015

Interpretation for random surferbull choose next link according to A with probability 1- pbull ldquoteleportrdquo to random page with probability p

119872119872 = 1 minus 119901119901 119860119860 + 119901119901119901119901

119901119901 =1119899119899

1 ⋯ 1⋮ ⋱ ⋮1 ⋯ 1

Choice of Matrix B

Google choose other B to avoid unwanted boosting of pages

Practical Issues

bull The real Google matrix has size in the order of 30 billion columns x 30 billion rows

bull Every day around 250000 new pages are added bull Matrix has many zeros (100 links to a page)

which speeds up calculationsbull Doing the matrix calculation

requires fast computers butalso advanced math

Topic Streaming algorithms

High volume high velocity data makes exact counting of frequencies or number of different items practically infeasible but approximate answers suffice in several applications (web site counters customer counts in retail transactionshellip)

New algorithms using advanced probabilistic methods (random hash functions)bull Count-Min Sketch bull MinHashbull HyperLogLog

Topic Uncertainty Quantification

Virtual prototyping using mathematical models UQ does not deal with1 unknown uncertainty in the initial

conditions of parameters2 parametrisation of design building blocks dimension

reduction

For 1) polynomial chaos (Wiener chaos) inverse statistical models Bayesian analysis (calibration)

For 2) Model Order Reduction

Topic Network monitoring

0

0002

0004

0006

0008

001

0012

1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004

Measu

re V

alu

e

Time

Al-Qaeda Network Measures

Betweeness

Density

Closeness

Challengesbull monitor high number of variablesbull models to capture structural changesbull scalable algorithms for likelihood ratio calculations

Closeness CUSUM Statistic for Al-Qaeda (1994-2004)

0

1

2

3

4

5

6

7

8

1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004

C+

Change Point

Signal

Chart3

Betweeness
Density
Closeness
Time
Measure Value
Al-Qaeda Network Measures
0
00003
00001
00001
00015
00005
00003
0003
00012
00006
00043
00016
00008
00055
00018
00016
00067
00021
00027
0008
00027
00029
0009
0003
00024
00083
00028
00025
00081
00028
00033
00093
00031
00031
001
0003
00033
00099
00032
00036
00103
00034
0002
00062
00024
00008
00037
00015
0
00007
00004

Std Measures Control Charts

Std Measures Control Charts

Betweeness
Density
Closeness
Time
Value
Standard Measures

Chart1

Number of AGENT
Edge Count[Agent x Agent Agent x Agent]
Component Members-Weak[Agent x Agent Agent x Agent] Average
Isolate Count[Agent x Agent Agent x Agent]
Component Count-Weak[Agent x Agent Agent x Agent]
Component Count-Strong[Agent x Agent Agent x Agent]
32
38
1728798
350
355
355
94
201
1248005
293
301
315
112
405
101776
269
272
281
137
574
862131
247
250
255
162
733
710437
222
226
232
183
889
559781
198
201
214
221
1075
367623
157
162
177
228
1207
299836
142
146
156
213
1112
366667
159
162
172
219
1082
345
154
157
169
254
1247
232377
126
128
145
250
1330
232213
125
128
144
256
1328
225628
122
126
137
260
1370
211885
119
122
131
190
825
50694
187
191
198
141
496
855628
243
249
256
62
92
1487596
322
329
336

Control Charts

raw data

Chart4

C+
Closeness CUSUM Statistic for Al-Qaeda (1994-2004)
0
05910894512
0
0
12457431219
18368325731
37372293657
69469334997
36101009266
0
0

Std Measures Control Charts

Std Measures Control Charts

Betweeness
Density
Closeness
Time
Value
Standard Measures

Chart1

C+
Closeness CUSUM Chart for Al-Qaeda (1994-2004)

Control Charts

Number of AGENT
Edge Count[Agent x Agent Agent x Agent]
Component Members-Weak[Agent x Agent Agent x Agent] Average
Isolate Count[Agent x Agent Agent x Agent]
Component Count-Weak[Agent x Agent Agent x Agent]
Component Count-Strong[Agent x Agent Agent x Agent]
32
38
1728798
350
355
355
94
201
1248005
293
301
315
112
405
101776
269
272
281
137
574
862131
247
250
255
162
733
710437
222
226
232
183
889
559781
198
201
214
221
1075
367623
157
162
177
228
1207
299836
142
146
156
213
1112
366667
159
162
172
219
1082
345
154
157
169
254
1247
232377
126
128
145
250
1330
232213
125
128
144
256
1328
225628
122
126
137
260
1370
211885
119
122
131
190
825
50694
187
191
198
141
496
855628
243
249
256
62
92
1487596
322
329
336

raw data

Topic Precision farming

Topic Precision farming

Topic Compressed Sensing

Topic Network structure

bull dependencies between nodes in networks measured by degrees of direct neighbours

bull assortativity coefficient of Newman is nothing but Pearsonrsquos correlation statistic

bull inconsistent estimator when variances are infinitebull Spearman rank correlation behaves better but

calculation is computationally intensive bull requires heavy asymptotics

Van der Hofstad R and Litvak N (2014) Degree-Degree Dependencies in Random Graphs with Heavy-Tailed Degrees Internet Mathematics 10 (3-4) pp 287-334

Topic Topological Data Analysis

Common Big Data problem is to choose relevant ldquofeaturesrdquo from high-dimensional data

Combination of machine learning with topological tools (simplices cohomology) yields new algorithmsfor finding patterns and clustering

Presenter
Presentation Notes
httpsyoutubeXfWibrh6stw Gunnar Carsson

Mathematical contributions in general

bull Modellingbull Performance of algorithmsbull Statistical thinking

We need to work hard to makemathematical contributions more explicit

Conclusions

1 Mathematical methods are an important enabler in big data settings

2 Developments in big data settings not only require more computer speed and memory capacity but also new advanced mathematics

  • The Mathematics Behind Big Data
  • Outline
  • What is big data
  • Four Vrsquos of Big Data
  • High-dimensional data ldquo119951≪119953rdquo
  • Topic Google Search
  • Topic Google Search
  • Topic Google Search
  • Topic Google Search
  • Topic Google Search
  • Topic Google Search
  • Equivalent Random Surfer Model
  • Google Matrix
  • Choice of Matrix B
  • Practical Issues
  • Topic Streaming algorithms
  • Topic Uncertainty Quantification
  • Topic Network monitoring
  • Topic Precision farming
  • Topic Precision farming
  • Topic Compressed Sensing
  • Topic Network structure
  • Topic Topological Data Analysis
  • Mathematical contributions in general
  • Conclusions
1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Centrality-Betweenness[Agent x Agent Agent x Agent] Average 0 00001 00003 00006 00008 00016 00027 00029 00024 00025 00033 00031 00033 00036 0002 00008 0
Meta-Matrix Hamming Distance 0 00018 00036 0006 00065 00087 00124 0012 00146 00136 00135 00166 00164 00156 0015 0009 0004
Network Centralization-Betweenness[Agent x Agent Agent x Agent] 0 00091 00292 00492 00608 01105 01557 01586 01242 01187 01295 01338 01316 01414 0104 0042 00014
Edge Count-Sequential[Agent x Agent Agent x Agent] 0 001 00025 00017 0 0 00019 00008 00009 00009 00016 00015 0 00007 0 0002 0
Hierarchy[Agent x Agent Agent x Agent] 0 04827 01679 00696 00796 01496 01339 00727 00775 01273 01492 01711 0075 00717 00786 00882 05587
Exclusivity[Agent x Agent Agent x Agent] Average 00001 00003 00003 00003 00004 00004 00005 00005 00005 00006 00006 00006 00006 00006 00006 00004 00002
Network Centralization-Closeness[Agent x Agent Agent x Agent] 00001 00005 00012 00016 00018 00021 00027 0003 00028 00028 00031 0003 00032 00034 00024 00015 00004
Similarity Correlation[Agent x Agent Agent x Agent] Average 00001 00023 00036 00043 0005 0006 0007 00077 0007 00069 0008 00084 0008 00082 00053 0003 00008
Simmelian Ties[Agent x Agent Agent x Agent] Average 00002 00008 00021 00029 0004 0005 00059 00068 00061 00058 00065 00076 00076 0008 00045 00029 00004
Overall Complexity 00003 00015 0003 00043 00055 00067 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Expertise Correlation[Agent x Agent Agent x Agent] Average 00003 00015 0003 00042 00053 00065 00079 00088 00081 00079 00091 00097 00097 00101 0006 00036 00007
Density[Agent x Agent Agent x Agent] 00003 00015 0003 00043 00055 00067 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Centrality-Inverse Closeness[Agent x Agent Agent x Agent] Average 00003 00057 00191 00306 00387 0053 00733 00943 0084 00877 01111 01117 01125 01206 00587 00257 00018
Efficiency-Global[Agent x Agent Agent x Agent] 00003 00087 00223 00338 00421 00591 00815 0101 00904 00972 01245 01244 01203 01286 00632 00277 00026
Connectedness[Agent x Agent Agent x Agent] 00003 0021 00589 00932 0119 01953 02921 03446 0301 03223 04222 04187 04049 04402 02202 00899 00059
Distinctiveness Correlation[Agent x Agent Agent x Agent] Average 00006 00029 00059 00083 00106 00129 00156 00175 00162 00157 00181 00193 00193 00199 0012 00073 00014
Centrality-Closeness[Agent x Agent Agent x Agent] Average 00027 00028 00029 00031 00032 00035 00042 00048 00044 00044 00053 00052 00054 00058 00038 00031 00027
Boundary Spanner[Agent x Agent Agent x Agent] Average 00027 00191 00328 00328 00546 00574 00683 00601 00656 0071 00683 0071 00847 00656 00738 00546 00246
Network Centralization-Row Degree[Agent x Agent Agent x Agent] 00134 00314 006 00752 00794 0081 00851 00841 00848 00878 00893 00969 00996 00993 00842 00648 0024
Network Centralization-Column Degree[Agent x Agent Agent x Agent] 00134 00588 00819 00834 00849 00892 00878 00896 00876 00905 00948 00996 01024 00993 00842 0062 00322
Network Centralization-Out Degree[Agent x Agent Agent x Agent] 00135 00315 00601 00754 00797 00812 00853 00843 00851 0088 00895 00972 00999 00996 00845 0065 0024
Network Centralization-Total Degree[Agent x Agent Agent x Agent] 00135 00453 00672 00769 00813 00828 00869 0086 00867 00897 00925 00988 01016 00999 00847 00638 00282
Network Centralization-In Degree[Agent x Agent Agent x Agent] 00135 00589 00821 00836 00852 00895 00881 00898 00878 00908 0095 00999 01027 00996 00845 00622 00323
Clustering Coefficient-Watts-Strogatz[Agent x Agent Agent x Agent] Average 0023 00786 0126 01471 01728 01993 02334 02782 02382 02256 02604 02856 02775 02958 02157 01468 00355
Efficiency-Local[Agent x Agent Agent x Agent] 00238 01032 01587 01916 02253 02598 03034 0362 03167 03054 03644 03843 03678 03861 02663 01705 0041
Interlockers[Agent x Agent Agent x Agent] Average 00246 00546 00656 00628 00738 00902 00902 00902 00847 0071 00738 00874 00874 0082 00792 00683 00546
Breadth-Column[Agent x Agent Agent x Agent] 00273 01011 01803 02596 03224 03716 04399 05082 04672 04672 05355 05328 05519 05738 03825 02404 00601
Breadth-Row[Agent x Agent Agent x Agent] 00273 01339 02158 02923 03388 03934 04754 05301 04863 04973 05792 05765 05765 05984 04153 02514 00601
Clique Count[Agent x Agent Agent x Agent] Average 00301 02896 06421 10929 14699 17213 19781 22486 20601 20492 24071 2571 24918 24727 13689 06913 01148
Constraint-Burt[Agent x Agent Agent x Agent] Average 0038 01331 01404 01481 01869 02077 02643 02719 02506 02623 02827 0295 03034 02956 02548 01967 00948
Interdependence[Agent x Agent Agent x Agent] 00485 00073 00042 0003 00024 0002 00017 00015 00016 00016 00014 00013 00014 00013 00022 00037 00176
Effective Network Size-Burt[Agent x Agent Agent x Agent] Average 0053 03636 06977 10333 13118 15388 18575 21156 19688 19928 22983 2382 23991 2429 14392 08408 01957
Redundancy-Row[Agent x Agent Agent x Agent] 00601 03525 08443 12459 16093 19699 23716 26913 24754 23798 27541 29781 29617 30683 1765 10219 01311
Redundancy-Column[Agent x Agent Agent x Agent] 00601 03743 08634 12568 1623 2 23989 27077 24945 24016 27814 30055 29891 30902 17842 10383 01448
Node Levels[Agent x Agent Agent x Agent] Average 00601 09836 14344 23087 24126 36339 46913 42541 3918 43197 51913 44727 46421 48169 47568 23033 03169
Centrality-Bonacich Power[Agent x Agent Agent x Agent] Average 01038 05492 11066 15683 20027 2429 29372 32978 30383 29563 34071 36339 36284 37432 22541 13552 02514
Triad Count[Agent x Agent Agent x Agent] Average 01967 08279 27104 42432 5918 78251 96503 101475 94454 84508 94645 111421 110164 112732 67131 35574 01995
Efficiency[Agent x Agent Agent x Agent] 03333 09552 0964 09658 09658 09746 09798 09806 09795 09818 09835 09822 0982 09828 09809 09756 09607
Speed-Minimum[Agent x Agent Agent x Agent] 05 0125 0125 01 01111 00909 00909 01 01 01 00909 01111 01 01 00769 01111 01667
Edge Count-Skip[Agent x Agent Agent x Agent] 07895 07761 08988 09408 09304 0946 0933 09519 09487 09409 09455 09534 09465 09584 09188 0881 06196
Transitivity[Agent x Agent Agent x Agent] 08372 03475 03372 03238 03336 03626 0375 03495 03503 03253 0319 03266 03302 0336 03822 0405 0258
Edge Count-Pooled[Agent x Agent Agent x Agent] 08421 08657 09432 09669 09714 09764 09665 09751 09748 09704 09735 09737 09759 09788 09612 09435 08152
Speed-Average[Agent x Agent Agent x Agent] 08519 02994 03068 02802 02799 0233 02156 02387 0245 02484 02448 02516 02464 02429 02259 02373 04139
Edge Count-Lateral[Agent x Agent Agent x Agent] 09474 03731 03926 0392 04216 04061 03944 03828 03768 03651 03633 03872 03795 03985 03842 03891 05761
Resemblance Correlation[Agent x Agent Agent x Agent] Average 09994 09971 09941 09917 09894 09871 09844 09825 09838 09843 09819 09807 09807 09801 0988 09927 09986
Fragmentation[Agent x Agent Agent x Agent] 09997 0979 09411 09068 0881 08047 07079 06554 0699 06777 05778 05813 05951 05598 07798 09101 09941
Communicative Need[Agent x Agent Agent x Agent] 1 0608 07457 07662 0801 08069 08098 08233 0817 08124 0784 08169 08368 08365 08212 09004 08039
Upper Boundedness[Agent x Agent Agent x Agent] 1 09021 09909 0999 09987 0994 09966 09993 09989 09965 0995 09928 09987 09988 09986 09983 09073
Average Distance[Agent x Agent Agent x Agent] 11739 33399 326 35685 35731 42917 46372 41893 40814 40263 40845 39749 40577 41172 44266 42139 24163
Network Levels[Agent x Agent Agent x Agent] 2 8 8 10 9 11 11 10 10 10 11 9 10 10 13 9 6
Span Of Control[Agent x Agent Agent x Agent] 2375 27917 42188 48644 50903 52917 51932 54369 53981 5128 52176 55417 54426 55466 46089 40656 20909
Number of AGENT 32 94 112 137 162 183 221 228 213 219 254 250 256 260 190 141 62
Edge Count[Agent x Agent Agent x Agent] 38 201 405 574 733 889 1075 1207 1112 1082 1247 1330 1328 1370 825 496 92
Component Members-Weak[Agent x Agent Agent x Agent] Average 1728798 1248005 101776 862131 710437 559781 367623 299836 366667 345 232377 232213 225628 211885 50694 855628 1487596
Isolate Count[Agent x Agent Agent x Agent] 350 293 269 247 222 198 157 142 159 154 126 125 122 119 187 243 322
Component Count-Weak[Agent x Agent Agent x Agent] 355 301 272 250 226 201 162 146 162 157 128 128 126 122 191 249 329
Component Count-Strong[Agent x Agent Agent x Agent] 355 315 281 255 232 214 177 156 172 169 145 144 137 131 198 256 336
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Agents 221 228 213 219 254 250 256 260 190 141 62
Z -03746343246 00624390541 -08741467575 -04995124328 16858544608 14360982444 18107325691 20604887855 -23102450019 -53697586531 -103024439274
C+ 0 0 0 0 11858544608 21219527053 34326852743 49931740598 21829290579 0 0
C- 0 0 03741467575 03736591903 0 0 0 0 18102450019 6680003655 164824475824
Signal 0 0 0 0 0 0 0 1 0 1 1
Average 227
Std Dev 16015617378
k 05
Control Limit 4
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Component Members-Weak Average 367623 299836 366667 345 232377 232213 225628 211885 50694 855628 1487596
Z 09566024078 -01181147213 09414456731 05979300132 -11876316317 -1190231741 -12946324712 -15125184597 31653794044 86935922814 187130183455
C+ 04566024078 0 04414456731 05393756863 0 0 0 0 26653794044 108589716857 290719900313
C- 0 0 0 0 06876316317 13778633728 21724958439 31850143037 0 0 0
Signal 0 0 0 0 0 0 0 0 0 1 1
Average 307286
Std Dev 63074271515
k 05
Control Limit 4
1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Edge Count 38 201 405 574 733 889 1075 1207 1112 1082 1247 1330 1328 1370 825 496 92
Slope of Edge Count 163 204 169 159 156 186 132 -95 -30 165 83 -2 42 -545 -329 -404
Z -05039753822 15973457028 -01964649795 -07089823173 -08627375186 06748144948 -20927791293 -137269226972 -6393653789 59836399187 20500693512 -16784942813 -06150208054 -316223197419 -114547624997 -100197139539
C+ 0 10973457028 04008807233 0 0 01748144948 0 0 0 54836399187 70337092699 48552149886 37401941833 0 0 0
C- 00039753822 0 0 02089823173 05717198359 0 15927791293 148197018266 207133556155 142297156968 116796463456 128581406269 129731614323 440954811741 550502436738 645699576277
Signal 0 0 0 0 0 0 0 0 1 1 1 1 1 1 1 1 1
Average 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 9475 4825 43 3075 54 72 -1055 -2085 -309
Std Dev 195115350498 195115350498 195115350498 195115350498 195115350498 195115350498 195115350498 195115350498 1284143683549 1324547092406 1254033492376 115840047767 882307580533 710539700979 29504858357 278774102097 2505474007049
k 05
Control Limit 4
1988 1988 1988 1988 1988 1988
1989 1989 1989 1989 1989 1989
1990 1990 1990 1990 1990 1990
1991 1991 1991 1991 1991 1991
1992 1992 1992 1992 1992 1992
1993 1993 1993 1993 1993 1993
1994 1994 1994 1994 1994 1994
1995 1995 1995 1995 1995 1995
1996 1996 1996 1996 1996 1996
1997 1997 1997 1997 1997 1997
1998 1998 1998 1998 1998 1998
1999 1999 1999 1999 1999 1999
2000 2000 2000 2000 2000 2000
2001 2001 2001 2001 2001 2001
2002 2002 2002 2002 2002 2002
2003 2003 2003 2003 2003 2003
2004 2004 2004 2004 2004 2004
httpwwwmarocnlnieuwsforumsshowphppg=timeline
Significant Events Founded in Afghanistan Moves to Sudan WTC 1 and Somalia Leaves Sudan Returns to Afghanistan Saudia Arabia bomb Issues fatwa US embassy bombings USS Cole WTC 2 Afghanistan 3 Man is Captured Dirty bomb foiled 911 plotter captured Sheikh Mohammed captured Saudi bombings Morroco attacks Sectarian violence plot Madrid Europe truce offered
1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Betweeness 0 00001 00003 00006 00008 00016 00027 00029 00024 00025 00033 00031 00033 00036 0002 00008 0
Density 00003 00015 0003 00043 00055 00067 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Closeness 00001 00005 00012 00016 00018 00021 00027 0003 00028 00028 00031 0003 00032 00034 00024 00015 00004
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Closeness 00027 0003 00028 00028 00031 0003 00032 00034 00024 00015 00004
Z -08728715609 10910894512 -02182178902 -02182178902 17457431219 10910894512 24003967926 3709704134 -28368325731 -87287156094 -159299059872
C+ 0 05910894512 0 0 12457431219 18368325731 37372293657 69469334997 36101009266 0 0
C- 03728715609 0 0 0 0 0 0 0 23368325731 105655481825 259954541697
03728715609 05910894512 0 0 12457431219 18368325731 37372293657 69469334997 36101009266 105655481825 259954541697
Signal 0 0 0 0 0 0 0 1 0 1 1
Average 00028333333
Std Dev 00001527525
k 05
Control Limit 4
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Density 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Z -08444407432 11042686642 -0259827921 -06495698025 16888814864 30529780716 28581071308 36375908938 -43521176765 -9223891195 -150700194171
C+ 0 06042686642 0 0 11888814864 3741859558 60999666888 92375575826 43854399061 0 0
C- 03444407432 0 0 01495698025 0 0 0 0 38521176765 125760088714 271460282886
Signal 0 0 0 0 0 0 1 1 1 1 1
Average 00084333333
Std Dev 00005131601
k 05
Control Limit 4
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Betweeness 00027 00029 00024 00025 00033 00031 00033 00036 0002 00008 0
Z 01324532357 09271726499 -10596258857 -06622661785 25166114784 17218920642 25166114784 37086905998 -26490647141 -74173811996 -105962588565
C+ 0 04271726499 0 0 20166114784 32385035426 5255115021 84638056208 53147409067 0 0
C- 0 0 05596258857 07218920642 0 0 0 0 21490647141 90664459137 191627047702
Signal 0 0 0 0 0 0 1 1 1 1 1
Average 00026666667
Std Dev 00002516611
k 05
Control Limit 4
1994
1995
1996
1997
1998
1999
2000
2001
2002
2003
2004
1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Centrality-Betweenness[Agent x Agent Agent x Agent] Average 0 00001 00003 00006 00008 00016 00027 00029 00024 00025 00033 00031 00033 00036 0002 00008 0
Meta-Matrix Hamming Distance 0 00018 00036 0006 00065 00087 00124 0012 00146 00136 00135 00166 00164 00156 0015 0009 0004
Network Centralization-Betweenness[Agent x Agent Agent x Agent] 0 00091 00292 00492 00608 01105 01557 01586 01242 01187 01295 01338 01316 01414 0104 0042 00014
Edge Count-Sequential[Agent x Agent Agent x Agent] 0 001 00025 00017 0 0 00019 00008 00009 00009 00016 00015 0 00007 0 0002 0
Hierarchy[Agent x Agent Agent x Agent] 0 04827 01679 00696 00796 01496 01339 00727 00775 01273 01492 01711 0075 00717 00786 00882 05587
Exclusivity[Agent x Agent Agent x Agent] Average 00001 00003 00003 00003 00004 00004 00005 00005 00005 00006 00006 00006 00006 00006 00006 00004 00002
Network Centralization-Closeness[Agent x Agent Agent x Agent] 00001 00005 00012 00016 00018 00021 00027 0003 00028 00028 00031 0003 00032 00034 00024 00015 00004
Similarity Correlation[Agent x Agent Agent x Agent] Average 00001 00023 00036 00043 0005 0006 0007 00077 0007 00069 0008 00084 0008 00082 00053 0003 00008
Simmelian Ties[Agent x Agent Agent x Agent] Average 00002 00008 00021 00029 0004 0005 00059 00068 00061 00058 00065 00076 00076 0008 00045 00029 00004
Overall Complexity 00003 00015 0003 00043 00055 00067 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Expertise Correlation[Agent x Agent Agent x Agent] Average 00003 00015 0003 00042 00053 00065 00079 00088 00081 00079 00091 00097 00097 00101 0006 00036 00007
Density[Agent x Agent Agent x Agent] 00003 00015 0003 00043 00055 00067 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Centrality-Inverse Closeness[Agent x Agent Agent x Agent] Average 00003 00057 00191 00306 00387 0053 00733 00943 0084 00877 01111 01117 01125 01206 00587 00257 00018
Efficiency-Global[Agent x Agent Agent x Agent] 00003 00087 00223 00338 00421 00591 00815 0101 00904 00972 01245 01244 01203 01286 00632 00277 00026
Connectedness[Agent x Agent Agent x Agent] 00003 0021 00589 00932 0119 01953 02921 03446 0301 03223 04222 04187 04049 04402 02202 00899 00059
Distinctiveness Correlation[Agent x Agent Agent x Agent] Average 00006 00029 00059 00083 00106 00129 00156 00175 00162 00157 00181 00193 00193 00199 0012 00073 00014
Centrality-Closeness[Agent x Agent Agent x Agent] Average 00027 00028 00029 00031 00032 00035 00042 00048 00044 00044 00053 00052 00054 00058 00038 00031 00027
Boundary Spanner[Agent x Agent Agent x Agent] Average 00027 00191 00328 00328 00546 00574 00683 00601 00656 0071 00683 0071 00847 00656 00738 00546 00246
Network Centralization-Row Degree[Agent x Agent Agent x Agent] 00134 00314 006 00752 00794 0081 00851 00841 00848 00878 00893 00969 00996 00993 00842 00648 0024
Network Centralization-Column Degree[Agent x Agent Agent x Agent] 00134 00588 00819 00834 00849 00892 00878 00896 00876 00905 00948 00996 01024 00993 00842 0062 00322
Network Centralization-Out Degree[Agent x Agent Agent x Agent] 00135 00315 00601 00754 00797 00812 00853 00843 00851 0088 00895 00972 00999 00996 00845 0065 0024
Network Centralization-Total Degree[Agent x Agent Agent x Agent] 00135 00453 00672 00769 00813 00828 00869 0086 00867 00897 00925 00988 01016 00999 00847 00638 00282
Network Centralization-In Degree[Agent x Agent Agent x Agent] 00135 00589 00821 00836 00852 00895 00881 00898 00878 00908 0095 00999 01027 00996 00845 00622 00323
Clustering Coefficient-Watts-Strogatz[Agent x Agent Agent x Agent] Average 0023 00786 0126 01471 01728 01993 02334 02782 02382 02256 02604 02856 02775 02958 02157 01468 00355
Efficiency-Local[Agent x Agent Agent x Agent] 00238 01032 01587 01916 02253 02598 03034 0362 03167 03054 03644 03843 03678 03861 02663 01705 0041
Interlockers[Agent x Agent Agent x Agent] Average 00246 00546 00656 00628 00738 00902 00902 00902 00847 0071 00738 00874 00874 0082 00792 00683 00546
Breadth-Column[Agent x Agent Agent x Agent] 00273 01011 01803 02596 03224 03716 04399 05082 04672 04672 05355 05328 05519 05738 03825 02404 00601
Breadth-Row[Agent x Agent Agent x Agent] 00273 01339 02158 02923 03388 03934 04754 05301 04863 04973 05792 05765 05765 05984 04153 02514 00601
Clique Count[Agent x Agent Agent x Agent] Average 00301 02896 06421 10929 14699 17213 19781 22486 20601 20492 24071 2571 24918 24727 13689 06913 01148
Constraint-Burt[Agent x Agent Agent x Agent] Average 0038 01331 01404 01481 01869 02077 02643 02719 02506 02623 02827 0295 03034 02956 02548 01967 00948
Interdependence[Agent x Agent Agent x Agent] 00485 00073 00042 0003 00024 0002 00017 00015 00016 00016 00014 00013 00014 00013 00022 00037 00176
Effective Network Size-Burt[Agent x Agent Agent x Agent] Average 0053 03636 06977 10333 13118 15388 18575 21156 19688 19928 22983 2382 23991 2429 14392 08408 01957
Redundancy-Row[Agent x Agent Agent x Agent] 00601 03525 08443 12459 16093 19699 23716 26913 24754 23798 27541 29781 29617 30683 1765 10219 01311
Redundancy-Column[Agent x Agent Agent x Agent] 00601 03743 08634 12568 1623 2 23989 27077 24945 24016 27814 30055 29891 30902 17842 10383 01448
Node Levels[Agent x Agent Agent x Agent] Average 00601 09836 14344 23087 24126 36339 46913 42541 3918 43197 51913 44727 46421 48169 47568 23033 03169
Centrality-Bonacich Power[Agent x Agent Agent x Agent] Average 01038 05492 11066 15683 20027 2429 29372 32978 30383 29563 34071 36339 36284 37432 22541 13552 02514
Triad Count[Agent x Agent Agent x Agent] Average 01967 08279 27104 42432 5918 78251 96503 101475 94454 84508 94645 111421 110164 112732 67131 35574 01995
Efficiency[Agent x Agent Agent x Agent] 03333 09552 0964 09658 09658 09746 09798 09806 09795 09818 09835 09822 0982 09828 09809 09756 09607
Speed-Minimum[Agent x Agent Agent x Agent] 05 0125 0125 01 01111 00909 00909 01 01 01 00909 01111 01 01 00769 01111 01667
Edge Count-Skip[Agent x Agent Agent x Agent] 07895 07761 08988 09408 09304 0946 0933 09519 09487 09409 09455 09534 09465 09584 09188 0881 06196
Transitivity[Agent x Agent Agent x Agent] 08372 03475 03372 03238 03336 03626 0375 03495 03503 03253 0319 03266 03302 0336 03822 0405 0258
Edge Count-Pooled[Agent x Agent Agent x Agent] 08421 08657 09432 09669 09714 09764 09665 09751 09748 09704 09735 09737 09759 09788 09612 09435 08152
Speed-Average[Agent x Agent Agent x Agent] 08519 02994 03068 02802 02799 0233 02156 02387 0245 02484 02448 02516 02464 02429 02259 02373 04139
Edge Count-Lateral[Agent x Agent Agent x Agent] 09474 03731 03926 0392 04216 04061 03944 03828 03768 03651 03633 03872 03795 03985 03842 03891 05761
Resemblance Correlation[Agent x Agent Agent x Agent] Average 09994 09971 09941 09917 09894 09871 09844 09825 09838 09843 09819 09807 09807 09801 0988 09927 09986
Fragmentation[Agent x Agent Agent x Agent] 09997 0979 09411 09068 0881 08047 07079 06554 0699 06777 05778 05813 05951 05598 07798 09101 09941
Communicative Need[Agent x Agent Agent x Agent] 1 0608 07457 07662 0801 08069 08098 08233 0817 08124 0784 08169 08368 08365 08212 09004 08039
Upper Boundedness[Agent x Agent Agent x Agent] 1 09021 09909 0999 09987 0994 09966 09993 09989 09965 0995 09928 09987 09988 09986 09983 09073
Average Distance[Agent x Agent Agent x Agent] 11739 33399 326 35685 35731 42917 46372 41893 40814 40263 40845 39749 40577 41172 44266 42139 24163
Network Levels[Agent x Agent Agent x Agent] 2 8 8 10 9 11 11 10 10 10 11 9 10 10 13 9 6
Span Of Control[Agent x Agent Agent x Agent] 2375 27917 42188 48644 50903 52917 51932 54369 53981 5128 52176 55417 54426 55466 46089 40656 20909
Number of AGENT 32 94 112 137 162 183 221 228 213 219 254 250 256 260 190 141 62
Edge Count[Agent x Agent Agent x Agent] 38 201 405 574 733 889 1075 1207 1112 1082 1247 1330 1328 1370 825 496 92
Component Members-Weak[Agent x Agent Agent x Agent] Average 1728798 1248005 101776 862131 710437 559781 367623 299836 366667 345 232377 232213 225628 211885 50694 855628 1487596
Isolate Count[Agent x Agent Agent x Agent] 350 293 269 247 222 198 157 142 159 154 126 125 122 119 187 243 322
Component Count-Weak[Agent x Agent Agent x Agent] 355 301 272 250 226 201 162 146 162 157 128 128 126 122 191 249 329
Component Count-Strong[Agent x Agent Agent x Agent] 355 315 281 255 232 214 177 156 172 169 145 144 137 131 198 256 336
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Agents 221 228 213 219 254 250 256 260 190 141 62
Z -03746343246 00624390541 -08741467575 -04995124328 16858544608 14360982444 18107325691 20604887855 -23102450019 -53697586531 -103024439274
C+ 0 0 0 0 11858544608 21219527053 34326852743 49931740598 21829290579 0 0
C- 0 0 03741467575 03736591903 0 0 0 0 18102450019 6680003655 164824475824
Signal 0 0 0 0 0 0 0 1 0 1 1
Average 227
Std Dev 16015617378
k 05
Control Limit 4
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Component Members-Weak Average 367623 299836 366667 345 232377 232213 225628 211885 50694 855628 1487596
Z 09566024078 -01181147213 09414456731 05979300132 -11876316317 -1190231741 -12946324712 -15125184597 31653794044 86935922814 187130183455
C+ 04566024078 0 04414456731 05393756863 0 0 0 0 26653794044 108589716857 290719900313
C- 0 0 0 0 06876316317 13778633728 21724958439 31850143037 0 0 0
Signal 0 0 0 0 0 0 0 0 0 1 1
Average 307286
Std Dev 63074271515
k 05
Control Limit 4
1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Edge Count 38 201 405 574 733 889 1075 1207 1112 1082 1247 1330 1328 1370 825 496 92
Slope of Edge Count 163 204 169 159 156 186 132 -95 -30 165 83 -2 42 -545 -329 -404
Z -05039753822 15973457028 -01964649795 -07089823173 -08627375186 06748144948 -20927791293 -137269226972 -6393653789 59836399187 20500693512 -16784942813 -06150208054 -316223197419 -114547624997 -100197139539
C+ 0 10973457028 04008807233 0 0 01748144948 0 0 0 54836399187 70337092699 48552149886 37401941833 0 0 0
C- 00039753822 0 0 02089823173 05717198359 0 15927791293 148197018266 207133556155 142297156968 116796463456 128581406269 129731614323 440954811741 550502436738 645699576277
Signal 0 0 0 0 0 0 0 0 1 1 1 1 1 1 1 1 1
Average 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 9475 4825 43 3075 54 72 -1055 -2085 -309
Std Dev 195115350498 195115350498 195115350498 195115350498 195115350498 195115350498 195115350498 195115350498 1284143683549 1324547092406 1254033492376 115840047767 882307580533 710539700979 29504858357 278774102097 2505474007049
k 05
Control Limit 4
1988 1988 1988 1988 1988 1988
1989 1989 1989 1989 1989 1989
1990 1990 1990 1990 1990 1990
1991 1991 1991 1991 1991 1991
1992 1992 1992 1992 1992 1992
1993 1993 1993 1993 1993 1993
1994 1994 1994 1994 1994 1994
1995 1995 1995 1995 1995 1995
1996 1996 1996 1996 1996 1996
1997 1997 1997 1997 1997 1997
1998 1998 1998 1998 1998 1998
1999 1999 1999 1999 1999 1999
2000 2000 2000 2000 2000 2000
2001 2001 2001 2001 2001 2001
2002 2002 2002 2002 2002 2002
2003 2003 2003 2003 2003 2003
2004 2004 2004 2004 2004 2004
httpwwwmarocnlnieuwsforumsshowphppg=timeline
Significant Events Founded in Afghanistan Moves to Sudan WTC 1 and Somalia Leaves Sudan Returns to Afghanistan Saudia Arabia bomb Issues fatwa US embassy bombings USS Cole WTC 2 Afghanistan 3 Man is Captured Dirty bomb foiled 911 plotter captured Sheikh Mohammed captured Saudi bombings Morroco attacks Sectarian violence plot Madrid Europe truce offered
1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Betweeness 0 00001 00003 00006 00008 00016 00027 00029 00024 00025 00033 00031 00033 00036 0002 00008 0
Density 00003 00015 0003 00043 00055 00067 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Closeness 00001 00005 00012 00016 00018 00021 00027 0003 00028 00028 00031 0003 00032 00034 00024 00015 00004
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Closeness 00027 0003 00028 00028 00031 0003 00032 00034 00024 00015 00004
Z -08728715609 10910894512 -02182178902 -02182178902 17457431219 10910894512 24003967926 3709704134 -28368325731 -87287156094 -159299059872
C+ 0 05910894512 0 0 12457431219 18368325731 37372293657 69469334997 36101009266 0 0
C- 03728715609 0 0 0 0 0 0 0 23368325731 105655481825 259954541697
Signal 0 0 0 0 0 0 0 1 0 1 1
Average 00028333333
Std Dev 00001527525
k 05
Control Limit 4
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Density 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Z -08444407432 11042686642 -0259827921 -06495698025 16888814864 30529780716 28581071308 36375908938 -43521176765 -9223891195 -150700194171
C+ 0 06042686642 0 0 11888814864 3741859558 60999666888 92375575826 43854399061 0 0
C- 03444407432 0 0 01495698025 0 0 0 0 38521176765 125760088714 271460282886
Signal 0 0 0 0 0 0 1 1 1 1 1
Average 00084333333
Std Dev 00005131601
k 05
Control Limit 4
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Betweeness 00027 00029 00024 00025 00033 00031 00033 00036 0002 00008 0
Z 01324532357 09271726499 -10596258857 -06622661785 25166114784 17218920642 25166114784 37086905998 -26490647141 -74173811996 -105962588565
C+ 0 04271726499 0 0 20166114784 32385035426 5255115021 84638056208 53147409067 0 0
C- 0 0 05596258857 07218920642 0 0 0 0 21490647141 90664459137 191627047702
Signal 0 0 0 0 0 0 1 1 1 1 1
Average 00026666667
Std Dev 00002516611
k 05
Control Limit 4
1988 1988 1988
1989 1989 1989
1990 1990 1990
1991 1991 1991
1992 1992 1992
1993 1993 1993
1994 1994 1994
1995 1995 1995
1996 1996 1996
1997 1997 1997
1998 1998 1998
1999 1999 1999
2000 2000 2000
2001 2001 2001
2002 2002 2002
2003 2003 2003
2004 2004 2004
Page 3: Data Science Center Eindhoven - 4TU.Federation · 4TU AMI SRO Big Data Meeting Big Data: Mathematics in Action! November 24, 2017 Data Science Center Eindhoven The Mathematics Behind

What is big data

Term coined by John Mashey chief scientist at Silicon Graphics in the 1990rsquos

hellipI was using one label for a range of issues and I wanted the simplest shortest phrase to convey that the boundaries of computing keep advancinghellip

But chemometrics has a long historyof analyzing ldquolargerdquo data sets

Presenter
Presentation Notes
httpwwwforbescomsitesgilpress20130509a-very-short-history-of-big-data 13httpbitsblogsnytimescom20130201the-origins-of-big-data-an-etymological-detective-story_r=013httpwwwmidsizeinsidercomen-usarticlebrief-history-big-data-everyone-should-readVSorLvmH4sc

Four Vrsquos of Big Data

Presenter
Presentation Notes
Picture taken from eycom13More Vrsquos have been introduced like V for Value (mostly in business)

High-dimensional data ldquo119951119951 ≪ 119953119953rdquo

Presenter
Presentation Notes
book Van de Geer Buumlhlmann13httpwwwstatriceedu~gallenexamples_high_dim_datapdf13DNA 20000 genes of 200 people13fMRI similar voxels (time series) of few people13Netflix ratings of 18000 movies

Topic Google Search

Topic Google Search

bull How can Google search so fast on its over 250 billion pages

bull What are proper ways to rank web pages

bull First idea count words in pages

Topic Google Search

Model the WWW as a graphbull Squares (nodes vertices) denote web pagesbull Arrows (edges) denote links from one page to another

Relevance is translated into number of incoming links weighted by importance of referring pages

Topic Google Search

Topic Google Search

A =

Topic Google Search

Presenter
Presentation Notes
Note that pages 2 and 4 both have 2 incoming links but not the same relevance

Equivalent Random Surfer Model

bull Start at any point on the graphbull Assign equal probabilities to all outgoing linksbull Choose new node with probabilities given by the

linksbull Relevance is proportion of visits after surfing for a

very long time (ldquostationary distributionrdquo)

Google Matrix

bull Choose a ldquodamping factorrdquo p (0ltplt1) bull Googlersquos p is secret but around 015

Interpretation for random surferbull choose next link according to A with probability 1- pbull ldquoteleportrdquo to random page with probability p

119872119872 = 1 minus 119901119901 119860119860 + 119901119901119901119901

119901119901 =1119899119899

1 ⋯ 1⋮ ⋱ ⋮1 ⋯ 1

Choice of Matrix B

Google choose other B to avoid unwanted boosting of pages

Practical Issues

bull The real Google matrix has size in the order of 30 billion columns x 30 billion rows

bull Every day around 250000 new pages are added bull Matrix has many zeros (100 links to a page)

which speeds up calculationsbull Doing the matrix calculation

requires fast computers butalso advanced math

Topic Streaming algorithms

High volume high velocity data makes exact counting of frequencies or number of different items practically infeasible but approximate answers suffice in several applications (web site counters customer counts in retail transactionshellip)

New algorithms using advanced probabilistic methods (random hash functions)bull Count-Min Sketch bull MinHashbull HyperLogLog

Topic Uncertainty Quantification

Virtual prototyping using mathematical models UQ does not deal with1 unknown uncertainty in the initial

conditions of parameters2 parametrisation of design building blocks dimension

reduction

For 1) polynomial chaos (Wiener chaos) inverse statistical models Bayesian analysis (calibration)

For 2) Model Order Reduction

Topic Network monitoring

0

0002

0004

0006

0008

001

0012

1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004

Measu

re V

alu

e

Time

Al-Qaeda Network Measures

Betweeness

Density

Closeness

Challengesbull monitor high number of variablesbull models to capture structural changesbull scalable algorithms for likelihood ratio calculations

Closeness CUSUM Statistic for Al-Qaeda (1994-2004)

0

1

2

3

4

5

6

7

8

1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004

C+

Change Point

Signal

Chart3

Betweeness
Density
Closeness
Time
Measure Value
Al-Qaeda Network Measures
0
00003
00001
00001
00015
00005
00003
0003
00012
00006
00043
00016
00008
00055
00018
00016
00067
00021
00027
0008
00027
00029
0009
0003
00024
00083
00028
00025
00081
00028
00033
00093
00031
00031
001
0003
00033
00099
00032
00036
00103
00034
0002
00062
00024
00008
00037
00015
0
00007
00004

Std Measures Control Charts

Std Measures Control Charts

Betweeness
Density
Closeness
Time
Value
Standard Measures

Chart1

Number of AGENT
Edge Count[Agent x Agent Agent x Agent]
Component Members-Weak[Agent x Agent Agent x Agent] Average
Isolate Count[Agent x Agent Agent x Agent]
Component Count-Weak[Agent x Agent Agent x Agent]
Component Count-Strong[Agent x Agent Agent x Agent]
32
38
1728798
350
355
355
94
201
1248005
293
301
315
112
405
101776
269
272
281
137
574
862131
247
250
255
162
733
710437
222
226
232
183
889
559781
198
201
214
221
1075
367623
157
162
177
228
1207
299836
142
146
156
213
1112
366667
159
162
172
219
1082
345
154
157
169
254
1247
232377
126
128
145
250
1330
232213
125
128
144
256
1328
225628
122
126
137
260
1370
211885
119
122
131
190
825
50694
187
191
198
141
496
855628
243
249
256
62
92
1487596
322
329
336

Control Charts

raw data

Chart4

C+
Closeness CUSUM Statistic for Al-Qaeda (1994-2004)
0
05910894512
0
0
12457431219
18368325731
37372293657
69469334997
36101009266
0
0

Std Measures Control Charts

Std Measures Control Charts

Betweeness
Density
Closeness
Time
Value
Standard Measures

Chart1

C+
Closeness CUSUM Chart for Al-Qaeda (1994-2004)

Control Charts

Number of AGENT
Edge Count[Agent x Agent Agent x Agent]
Component Members-Weak[Agent x Agent Agent x Agent] Average
Isolate Count[Agent x Agent Agent x Agent]
Component Count-Weak[Agent x Agent Agent x Agent]
Component Count-Strong[Agent x Agent Agent x Agent]
32
38
1728798
350
355
355
94
201
1248005
293
301
315
112
405
101776
269
272
281
137
574
862131
247
250
255
162
733
710437
222
226
232
183
889
559781
198
201
214
221
1075
367623
157
162
177
228
1207
299836
142
146
156
213
1112
366667
159
162
172
219
1082
345
154
157
169
254
1247
232377
126
128
145
250
1330
232213
125
128
144
256
1328
225628
122
126
137
260
1370
211885
119
122
131
190
825
50694
187
191
198
141
496
855628
243
249
256
62
92
1487596
322
329
336

raw data

Topic Precision farming

Topic Precision farming

Topic Compressed Sensing

Topic Network structure

bull dependencies between nodes in networks measured by degrees of direct neighbours

bull assortativity coefficient of Newman is nothing but Pearsonrsquos correlation statistic

bull inconsistent estimator when variances are infinitebull Spearman rank correlation behaves better but

calculation is computationally intensive bull requires heavy asymptotics

Van der Hofstad R and Litvak N (2014) Degree-Degree Dependencies in Random Graphs with Heavy-Tailed Degrees Internet Mathematics 10 (3-4) pp 287-334

Topic Topological Data Analysis

Common Big Data problem is to choose relevant ldquofeaturesrdquo from high-dimensional data

Combination of machine learning with topological tools (simplices cohomology) yields new algorithmsfor finding patterns and clustering

Presenter
Presentation Notes
httpsyoutubeXfWibrh6stw Gunnar Carsson

Mathematical contributions in general

bull Modellingbull Performance of algorithmsbull Statistical thinking

We need to work hard to makemathematical contributions more explicit

Conclusions

1 Mathematical methods are an important enabler in big data settings

2 Developments in big data settings not only require more computer speed and memory capacity but also new advanced mathematics

  • The Mathematics Behind Big Data
  • Outline
  • What is big data
  • Four Vrsquos of Big Data
  • High-dimensional data ldquo119951≪119953rdquo
  • Topic Google Search
  • Topic Google Search
  • Topic Google Search
  • Topic Google Search
  • Topic Google Search
  • Topic Google Search
  • Equivalent Random Surfer Model
  • Google Matrix
  • Choice of Matrix B
  • Practical Issues
  • Topic Streaming algorithms
  • Topic Uncertainty Quantification
  • Topic Network monitoring
  • Topic Precision farming
  • Topic Precision farming
  • Topic Compressed Sensing
  • Topic Network structure
  • Topic Topological Data Analysis
  • Mathematical contributions in general
  • Conclusions
1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Centrality-Betweenness[Agent x Agent Agent x Agent] Average 0 00001 00003 00006 00008 00016 00027 00029 00024 00025 00033 00031 00033 00036 0002 00008 0
Meta-Matrix Hamming Distance 0 00018 00036 0006 00065 00087 00124 0012 00146 00136 00135 00166 00164 00156 0015 0009 0004
Network Centralization-Betweenness[Agent x Agent Agent x Agent] 0 00091 00292 00492 00608 01105 01557 01586 01242 01187 01295 01338 01316 01414 0104 0042 00014
Edge Count-Sequential[Agent x Agent Agent x Agent] 0 001 00025 00017 0 0 00019 00008 00009 00009 00016 00015 0 00007 0 0002 0
Hierarchy[Agent x Agent Agent x Agent] 0 04827 01679 00696 00796 01496 01339 00727 00775 01273 01492 01711 0075 00717 00786 00882 05587
Exclusivity[Agent x Agent Agent x Agent] Average 00001 00003 00003 00003 00004 00004 00005 00005 00005 00006 00006 00006 00006 00006 00006 00004 00002
Network Centralization-Closeness[Agent x Agent Agent x Agent] 00001 00005 00012 00016 00018 00021 00027 0003 00028 00028 00031 0003 00032 00034 00024 00015 00004
Similarity Correlation[Agent x Agent Agent x Agent] Average 00001 00023 00036 00043 0005 0006 0007 00077 0007 00069 0008 00084 0008 00082 00053 0003 00008
Simmelian Ties[Agent x Agent Agent x Agent] Average 00002 00008 00021 00029 0004 0005 00059 00068 00061 00058 00065 00076 00076 0008 00045 00029 00004
Overall Complexity 00003 00015 0003 00043 00055 00067 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Expertise Correlation[Agent x Agent Agent x Agent] Average 00003 00015 0003 00042 00053 00065 00079 00088 00081 00079 00091 00097 00097 00101 0006 00036 00007
Density[Agent x Agent Agent x Agent] 00003 00015 0003 00043 00055 00067 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Centrality-Inverse Closeness[Agent x Agent Agent x Agent] Average 00003 00057 00191 00306 00387 0053 00733 00943 0084 00877 01111 01117 01125 01206 00587 00257 00018
Efficiency-Global[Agent x Agent Agent x Agent] 00003 00087 00223 00338 00421 00591 00815 0101 00904 00972 01245 01244 01203 01286 00632 00277 00026
Connectedness[Agent x Agent Agent x Agent] 00003 0021 00589 00932 0119 01953 02921 03446 0301 03223 04222 04187 04049 04402 02202 00899 00059
Distinctiveness Correlation[Agent x Agent Agent x Agent] Average 00006 00029 00059 00083 00106 00129 00156 00175 00162 00157 00181 00193 00193 00199 0012 00073 00014
Centrality-Closeness[Agent x Agent Agent x Agent] Average 00027 00028 00029 00031 00032 00035 00042 00048 00044 00044 00053 00052 00054 00058 00038 00031 00027
Boundary Spanner[Agent x Agent Agent x Agent] Average 00027 00191 00328 00328 00546 00574 00683 00601 00656 0071 00683 0071 00847 00656 00738 00546 00246
Network Centralization-Row Degree[Agent x Agent Agent x Agent] 00134 00314 006 00752 00794 0081 00851 00841 00848 00878 00893 00969 00996 00993 00842 00648 0024
Network Centralization-Column Degree[Agent x Agent Agent x Agent] 00134 00588 00819 00834 00849 00892 00878 00896 00876 00905 00948 00996 01024 00993 00842 0062 00322
Network Centralization-Out Degree[Agent x Agent Agent x Agent] 00135 00315 00601 00754 00797 00812 00853 00843 00851 0088 00895 00972 00999 00996 00845 0065 0024
Network Centralization-Total Degree[Agent x Agent Agent x Agent] 00135 00453 00672 00769 00813 00828 00869 0086 00867 00897 00925 00988 01016 00999 00847 00638 00282
Network Centralization-In Degree[Agent x Agent Agent x Agent] 00135 00589 00821 00836 00852 00895 00881 00898 00878 00908 0095 00999 01027 00996 00845 00622 00323
Clustering Coefficient-Watts-Strogatz[Agent x Agent Agent x Agent] Average 0023 00786 0126 01471 01728 01993 02334 02782 02382 02256 02604 02856 02775 02958 02157 01468 00355
Efficiency-Local[Agent x Agent Agent x Agent] 00238 01032 01587 01916 02253 02598 03034 0362 03167 03054 03644 03843 03678 03861 02663 01705 0041
Interlockers[Agent x Agent Agent x Agent] Average 00246 00546 00656 00628 00738 00902 00902 00902 00847 0071 00738 00874 00874 0082 00792 00683 00546
Breadth-Column[Agent x Agent Agent x Agent] 00273 01011 01803 02596 03224 03716 04399 05082 04672 04672 05355 05328 05519 05738 03825 02404 00601
Breadth-Row[Agent x Agent Agent x Agent] 00273 01339 02158 02923 03388 03934 04754 05301 04863 04973 05792 05765 05765 05984 04153 02514 00601
Clique Count[Agent x Agent Agent x Agent] Average 00301 02896 06421 10929 14699 17213 19781 22486 20601 20492 24071 2571 24918 24727 13689 06913 01148
Constraint-Burt[Agent x Agent Agent x Agent] Average 0038 01331 01404 01481 01869 02077 02643 02719 02506 02623 02827 0295 03034 02956 02548 01967 00948
Interdependence[Agent x Agent Agent x Agent] 00485 00073 00042 0003 00024 0002 00017 00015 00016 00016 00014 00013 00014 00013 00022 00037 00176
Effective Network Size-Burt[Agent x Agent Agent x Agent] Average 0053 03636 06977 10333 13118 15388 18575 21156 19688 19928 22983 2382 23991 2429 14392 08408 01957
Redundancy-Row[Agent x Agent Agent x Agent] 00601 03525 08443 12459 16093 19699 23716 26913 24754 23798 27541 29781 29617 30683 1765 10219 01311
Redundancy-Column[Agent x Agent Agent x Agent] 00601 03743 08634 12568 1623 2 23989 27077 24945 24016 27814 30055 29891 30902 17842 10383 01448
Node Levels[Agent x Agent Agent x Agent] Average 00601 09836 14344 23087 24126 36339 46913 42541 3918 43197 51913 44727 46421 48169 47568 23033 03169
Centrality-Bonacich Power[Agent x Agent Agent x Agent] Average 01038 05492 11066 15683 20027 2429 29372 32978 30383 29563 34071 36339 36284 37432 22541 13552 02514
Triad Count[Agent x Agent Agent x Agent] Average 01967 08279 27104 42432 5918 78251 96503 101475 94454 84508 94645 111421 110164 112732 67131 35574 01995
Efficiency[Agent x Agent Agent x Agent] 03333 09552 0964 09658 09658 09746 09798 09806 09795 09818 09835 09822 0982 09828 09809 09756 09607
Speed-Minimum[Agent x Agent Agent x Agent] 05 0125 0125 01 01111 00909 00909 01 01 01 00909 01111 01 01 00769 01111 01667
Edge Count-Skip[Agent x Agent Agent x Agent] 07895 07761 08988 09408 09304 0946 0933 09519 09487 09409 09455 09534 09465 09584 09188 0881 06196
Transitivity[Agent x Agent Agent x Agent] 08372 03475 03372 03238 03336 03626 0375 03495 03503 03253 0319 03266 03302 0336 03822 0405 0258
Edge Count-Pooled[Agent x Agent Agent x Agent] 08421 08657 09432 09669 09714 09764 09665 09751 09748 09704 09735 09737 09759 09788 09612 09435 08152
Speed-Average[Agent x Agent Agent x Agent] 08519 02994 03068 02802 02799 0233 02156 02387 0245 02484 02448 02516 02464 02429 02259 02373 04139
Edge Count-Lateral[Agent x Agent Agent x Agent] 09474 03731 03926 0392 04216 04061 03944 03828 03768 03651 03633 03872 03795 03985 03842 03891 05761
Resemblance Correlation[Agent x Agent Agent x Agent] Average 09994 09971 09941 09917 09894 09871 09844 09825 09838 09843 09819 09807 09807 09801 0988 09927 09986
Fragmentation[Agent x Agent Agent x Agent] 09997 0979 09411 09068 0881 08047 07079 06554 0699 06777 05778 05813 05951 05598 07798 09101 09941
Communicative Need[Agent x Agent Agent x Agent] 1 0608 07457 07662 0801 08069 08098 08233 0817 08124 0784 08169 08368 08365 08212 09004 08039
Upper Boundedness[Agent x Agent Agent x Agent] 1 09021 09909 0999 09987 0994 09966 09993 09989 09965 0995 09928 09987 09988 09986 09983 09073
Average Distance[Agent x Agent Agent x Agent] 11739 33399 326 35685 35731 42917 46372 41893 40814 40263 40845 39749 40577 41172 44266 42139 24163
Network Levels[Agent x Agent Agent x Agent] 2 8 8 10 9 11 11 10 10 10 11 9 10 10 13 9 6
Span Of Control[Agent x Agent Agent x Agent] 2375 27917 42188 48644 50903 52917 51932 54369 53981 5128 52176 55417 54426 55466 46089 40656 20909
Number of AGENT 32 94 112 137 162 183 221 228 213 219 254 250 256 260 190 141 62
Edge Count[Agent x Agent Agent x Agent] 38 201 405 574 733 889 1075 1207 1112 1082 1247 1330 1328 1370 825 496 92
Component Members-Weak[Agent x Agent Agent x Agent] Average 1728798 1248005 101776 862131 710437 559781 367623 299836 366667 345 232377 232213 225628 211885 50694 855628 1487596
Isolate Count[Agent x Agent Agent x Agent] 350 293 269 247 222 198 157 142 159 154 126 125 122 119 187 243 322
Component Count-Weak[Agent x Agent Agent x Agent] 355 301 272 250 226 201 162 146 162 157 128 128 126 122 191 249 329
Component Count-Strong[Agent x Agent Agent x Agent] 355 315 281 255 232 214 177 156 172 169 145 144 137 131 198 256 336
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Agents 221 228 213 219 254 250 256 260 190 141 62
Z -03746343246 00624390541 -08741467575 -04995124328 16858544608 14360982444 18107325691 20604887855 -23102450019 -53697586531 -103024439274
C+ 0 0 0 0 11858544608 21219527053 34326852743 49931740598 21829290579 0 0
C- 0 0 03741467575 03736591903 0 0 0 0 18102450019 6680003655 164824475824
Signal 0 0 0 0 0 0 0 1 0 1 1
Average 227
Std Dev 16015617378
k 05
Control Limit 4
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Component Members-Weak Average 367623 299836 366667 345 232377 232213 225628 211885 50694 855628 1487596
Z 09566024078 -01181147213 09414456731 05979300132 -11876316317 -1190231741 -12946324712 -15125184597 31653794044 86935922814 187130183455
C+ 04566024078 0 04414456731 05393756863 0 0 0 0 26653794044 108589716857 290719900313
C- 0 0 0 0 06876316317 13778633728 21724958439 31850143037 0 0 0
Signal 0 0 0 0 0 0 0 0 0 1 1
Average 307286
Std Dev 63074271515
k 05
Control Limit 4
1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Edge Count 38 201 405 574 733 889 1075 1207 1112 1082 1247 1330 1328 1370 825 496 92
Slope of Edge Count 163 204 169 159 156 186 132 -95 -30 165 83 -2 42 -545 -329 -404
Z -05039753822 15973457028 -01964649795 -07089823173 -08627375186 06748144948 -20927791293 -137269226972 -6393653789 59836399187 20500693512 -16784942813 -06150208054 -316223197419 -114547624997 -100197139539
C+ 0 10973457028 04008807233 0 0 01748144948 0 0 0 54836399187 70337092699 48552149886 37401941833 0 0 0
C- 00039753822 0 0 02089823173 05717198359 0 15927791293 148197018266 207133556155 142297156968 116796463456 128581406269 129731614323 440954811741 550502436738 645699576277
Signal 0 0 0 0 0 0 0 0 1 1 1 1 1 1 1 1 1
Average 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 9475 4825 43 3075 54 72 -1055 -2085 -309
Std Dev 195115350498 195115350498 195115350498 195115350498 195115350498 195115350498 195115350498 195115350498 1284143683549 1324547092406 1254033492376 115840047767 882307580533 710539700979 29504858357 278774102097 2505474007049
k 05
Control Limit 4
1988 1988 1988 1988 1988 1988
1989 1989 1989 1989 1989 1989
1990 1990 1990 1990 1990 1990
1991 1991 1991 1991 1991 1991
1992 1992 1992 1992 1992 1992
1993 1993 1993 1993 1993 1993
1994 1994 1994 1994 1994 1994
1995 1995 1995 1995 1995 1995
1996 1996 1996 1996 1996 1996
1997 1997 1997 1997 1997 1997
1998 1998 1998 1998 1998 1998
1999 1999 1999 1999 1999 1999
2000 2000 2000 2000 2000 2000
2001 2001 2001 2001 2001 2001
2002 2002 2002 2002 2002 2002
2003 2003 2003 2003 2003 2003
2004 2004 2004 2004 2004 2004
httpwwwmarocnlnieuwsforumsshowphppg=timeline
Significant Events Founded in Afghanistan Moves to Sudan WTC 1 and Somalia Leaves Sudan Returns to Afghanistan Saudia Arabia bomb Issues fatwa US embassy bombings USS Cole WTC 2 Afghanistan 3 Man is Captured Dirty bomb foiled 911 plotter captured Sheikh Mohammed captured Saudi bombings Morroco attacks Sectarian violence plot Madrid Europe truce offered
1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Betweeness 0 00001 00003 00006 00008 00016 00027 00029 00024 00025 00033 00031 00033 00036 0002 00008 0
Density 00003 00015 0003 00043 00055 00067 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Closeness 00001 00005 00012 00016 00018 00021 00027 0003 00028 00028 00031 0003 00032 00034 00024 00015 00004
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Closeness 00027 0003 00028 00028 00031 0003 00032 00034 00024 00015 00004
Z -08728715609 10910894512 -02182178902 -02182178902 17457431219 10910894512 24003967926 3709704134 -28368325731 -87287156094 -159299059872
C+ 0 05910894512 0 0 12457431219 18368325731 37372293657 69469334997 36101009266 0 0
C- 03728715609 0 0 0 0 0 0 0 23368325731 105655481825 259954541697
03728715609 05910894512 0 0 12457431219 18368325731 37372293657 69469334997 36101009266 105655481825 259954541697
Signal 0 0 0 0 0 0 0 1 0 1 1
Average 00028333333
Std Dev 00001527525
k 05
Control Limit 4
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Density 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Z -08444407432 11042686642 -0259827921 -06495698025 16888814864 30529780716 28581071308 36375908938 -43521176765 -9223891195 -150700194171
C+ 0 06042686642 0 0 11888814864 3741859558 60999666888 92375575826 43854399061 0 0
C- 03444407432 0 0 01495698025 0 0 0 0 38521176765 125760088714 271460282886
Signal 0 0 0 0 0 0 1 1 1 1 1
Average 00084333333
Std Dev 00005131601
k 05
Control Limit 4
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Betweeness 00027 00029 00024 00025 00033 00031 00033 00036 0002 00008 0
Z 01324532357 09271726499 -10596258857 -06622661785 25166114784 17218920642 25166114784 37086905998 -26490647141 -74173811996 -105962588565
C+ 0 04271726499 0 0 20166114784 32385035426 5255115021 84638056208 53147409067 0 0
C- 0 0 05596258857 07218920642 0 0 0 0 21490647141 90664459137 191627047702
Signal 0 0 0 0 0 0 1 1 1 1 1
Average 00026666667
Std Dev 00002516611
k 05
Control Limit 4
1994
1995
1996
1997
1998
1999
2000
2001
2002
2003
2004
1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Centrality-Betweenness[Agent x Agent Agent x Agent] Average 0 00001 00003 00006 00008 00016 00027 00029 00024 00025 00033 00031 00033 00036 0002 00008 0
Meta-Matrix Hamming Distance 0 00018 00036 0006 00065 00087 00124 0012 00146 00136 00135 00166 00164 00156 0015 0009 0004
Network Centralization-Betweenness[Agent x Agent Agent x Agent] 0 00091 00292 00492 00608 01105 01557 01586 01242 01187 01295 01338 01316 01414 0104 0042 00014
Edge Count-Sequential[Agent x Agent Agent x Agent] 0 001 00025 00017 0 0 00019 00008 00009 00009 00016 00015 0 00007 0 0002 0
Hierarchy[Agent x Agent Agent x Agent] 0 04827 01679 00696 00796 01496 01339 00727 00775 01273 01492 01711 0075 00717 00786 00882 05587
Exclusivity[Agent x Agent Agent x Agent] Average 00001 00003 00003 00003 00004 00004 00005 00005 00005 00006 00006 00006 00006 00006 00006 00004 00002
Network Centralization-Closeness[Agent x Agent Agent x Agent] 00001 00005 00012 00016 00018 00021 00027 0003 00028 00028 00031 0003 00032 00034 00024 00015 00004
Similarity Correlation[Agent x Agent Agent x Agent] Average 00001 00023 00036 00043 0005 0006 0007 00077 0007 00069 0008 00084 0008 00082 00053 0003 00008
Simmelian Ties[Agent x Agent Agent x Agent] Average 00002 00008 00021 00029 0004 0005 00059 00068 00061 00058 00065 00076 00076 0008 00045 00029 00004
Overall Complexity 00003 00015 0003 00043 00055 00067 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Expertise Correlation[Agent x Agent Agent x Agent] Average 00003 00015 0003 00042 00053 00065 00079 00088 00081 00079 00091 00097 00097 00101 0006 00036 00007
Density[Agent x Agent Agent x Agent] 00003 00015 0003 00043 00055 00067 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Centrality-Inverse Closeness[Agent x Agent Agent x Agent] Average 00003 00057 00191 00306 00387 0053 00733 00943 0084 00877 01111 01117 01125 01206 00587 00257 00018
Efficiency-Global[Agent x Agent Agent x Agent] 00003 00087 00223 00338 00421 00591 00815 0101 00904 00972 01245 01244 01203 01286 00632 00277 00026
Connectedness[Agent x Agent Agent x Agent] 00003 0021 00589 00932 0119 01953 02921 03446 0301 03223 04222 04187 04049 04402 02202 00899 00059
Distinctiveness Correlation[Agent x Agent Agent x Agent] Average 00006 00029 00059 00083 00106 00129 00156 00175 00162 00157 00181 00193 00193 00199 0012 00073 00014
Centrality-Closeness[Agent x Agent Agent x Agent] Average 00027 00028 00029 00031 00032 00035 00042 00048 00044 00044 00053 00052 00054 00058 00038 00031 00027
Boundary Spanner[Agent x Agent Agent x Agent] Average 00027 00191 00328 00328 00546 00574 00683 00601 00656 0071 00683 0071 00847 00656 00738 00546 00246
Network Centralization-Row Degree[Agent x Agent Agent x Agent] 00134 00314 006 00752 00794 0081 00851 00841 00848 00878 00893 00969 00996 00993 00842 00648 0024
Network Centralization-Column Degree[Agent x Agent Agent x Agent] 00134 00588 00819 00834 00849 00892 00878 00896 00876 00905 00948 00996 01024 00993 00842 0062 00322
Network Centralization-Out Degree[Agent x Agent Agent x Agent] 00135 00315 00601 00754 00797 00812 00853 00843 00851 0088 00895 00972 00999 00996 00845 0065 0024
Network Centralization-Total Degree[Agent x Agent Agent x Agent] 00135 00453 00672 00769 00813 00828 00869 0086 00867 00897 00925 00988 01016 00999 00847 00638 00282
Network Centralization-In Degree[Agent x Agent Agent x Agent] 00135 00589 00821 00836 00852 00895 00881 00898 00878 00908 0095 00999 01027 00996 00845 00622 00323
Clustering Coefficient-Watts-Strogatz[Agent x Agent Agent x Agent] Average 0023 00786 0126 01471 01728 01993 02334 02782 02382 02256 02604 02856 02775 02958 02157 01468 00355
Efficiency-Local[Agent x Agent Agent x Agent] 00238 01032 01587 01916 02253 02598 03034 0362 03167 03054 03644 03843 03678 03861 02663 01705 0041
Interlockers[Agent x Agent Agent x Agent] Average 00246 00546 00656 00628 00738 00902 00902 00902 00847 0071 00738 00874 00874 0082 00792 00683 00546
Breadth-Column[Agent x Agent Agent x Agent] 00273 01011 01803 02596 03224 03716 04399 05082 04672 04672 05355 05328 05519 05738 03825 02404 00601
Breadth-Row[Agent x Agent Agent x Agent] 00273 01339 02158 02923 03388 03934 04754 05301 04863 04973 05792 05765 05765 05984 04153 02514 00601
Clique Count[Agent x Agent Agent x Agent] Average 00301 02896 06421 10929 14699 17213 19781 22486 20601 20492 24071 2571 24918 24727 13689 06913 01148
Constraint-Burt[Agent x Agent Agent x Agent] Average 0038 01331 01404 01481 01869 02077 02643 02719 02506 02623 02827 0295 03034 02956 02548 01967 00948
Interdependence[Agent x Agent Agent x Agent] 00485 00073 00042 0003 00024 0002 00017 00015 00016 00016 00014 00013 00014 00013 00022 00037 00176
Effective Network Size-Burt[Agent x Agent Agent x Agent] Average 0053 03636 06977 10333 13118 15388 18575 21156 19688 19928 22983 2382 23991 2429 14392 08408 01957
Redundancy-Row[Agent x Agent Agent x Agent] 00601 03525 08443 12459 16093 19699 23716 26913 24754 23798 27541 29781 29617 30683 1765 10219 01311
Redundancy-Column[Agent x Agent Agent x Agent] 00601 03743 08634 12568 1623 2 23989 27077 24945 24016 27814 30055 29891 30902 17842 10383 01448
Node Levels[Agent x Agent Agent x Agent] Average 00601 09836 14344 23087 24126 36339 46913 42541 3918 43197 51913 44727 46421 48169 47568 23033 03169
Centrality-Bonacich Power[Agent x Agent Agent x Agent] Average 01038 05492 11066 15683 20027 2429 29372 32978 30383 29563 34071 36339 36284 37432 22541 13552 02514
Triad Count[Agent x Agent Agent x Agent] Average 01967 08279 27104 42432 5918 78251 96503 101475 94454 84508 94645 111421 110164 112732 67131 35574 01995
Efficiency[Agent x Agent Agent x Agent] 03333 09552 0964 09658 09658 09746 09798 09806 09795 09818 09835 09822 0982 09828 09809 09756 09607
Speed-Minimum[Agent x Agent Agent x Agent] 05 0125 0125 01 01111 00909 00909 01 01 01 00909 01111 01 01 00769 01111 01667
Edge Count-Skip[Agent x Agent Agent x Agent] 07895 07761 08988 09408 09304 0946 0933 09519 09487 09409 09455 09534 09465 09584 09188 0881 06196
Transitivity[Agent x Agent Agent x Agent] 08372 03475 03372 03238 03336 03626 0375 03495 03503 03253 0319 03266 03302 0336 03822 0405 0258
Edge Count-Pooled[Agent x Agent Agent x Agent] 08421 08657 09432 09669 09714 09764 09665 09751 09748 09704 09735 09737 09759 09788 09612 09435 08152
Speed-Average[Agent x Agent Agent x Agent] 08519 02994 03068 02802 02799 0233 02156 02387 0245 02484 02448 02516 02464 02429 02259 02373 04139
Edge Count-Lateral[Agent x Agent Agent x Agent] 09474 03731 03926 0392 04216 04061 03944 03828 03768 03651 03633 03872 03795 03985 03842 03891 05761
Resemblance Correlation[Agent x Agent Agent x Agent] Average 09994 09971 09941 09917 09894 09871 09844 09825 09838 09843 09819 09807 09807 09801 0988 09927 09986
Fragmentation[Agent x Agent Agent x Agent] 09997 0979 09411 09068 0881 08047 07079 06554 0699 06777 05778 05813 05951 05598 07798 09101 09941
Communicative Need[Agent x Agent Agent x Agent] 1 0608 07457 07662 0801 08069 08098 08233 0817 08124 0784 08169 08368 08365 08212 09004 08039
Upper Boundedness[Agent x Agent Agent x Agent] 1 09021 09909 0999 09987 0994 09966 09993 09989 09965 0995 09928 09987 09988 09986 09983 09073
Average Distance[Agent x Agent Agent x Agent] 11739 33399 326 35685 35731 42917 46372 41893 40814 40263 40845 39749 40577 41172 44266 42139 24163
Network Levels[Agent x Agent Agent x Agent] 2 8 8 10 9 11 11 10 10 10 11 9 10 10 13 9 6
Span Of Control[Agent x Agent Agent x Agent] 2375 27917 42188 48644 50903 52917 51932 54369 53981 5128 52176 55417 54426 55466 46089 40656 20909
Number of AGENT 32 94 112 137 162 183 221 228 213 219 254 250 256 260 190 141 62
Edge Count[Agent x Agent Agent x Agent] 38 201 405 574 733 889 1075 1207 1112 1082 1247 1330 1328 1370 825 496 92
Component Members-Weak[Agent x Agent Agent x Agent] Average 1728798 1248005 101776 862131 710437 559781 367623 299836 366667 345 232377 232213 225628 211885 50694 855628 1487596
Isolate Count[Agent x Agent Agent x Agent] 350 293 269 247 222 198 157 142 159 154 126 125 122 119 187 243 322
Component Count-Weak[Agent x Agent Agent x Agent] 355 301 272 250 226 201 162 146 162 157 128 128 126 122 191 249 329
Component Count-Strong[Agent x Agent Agent x Agent] 355 315 281 255 232 214 177 156 172 169 145 144 137 131 198 256 336
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Agents 221 228 213 219 254 250 256 260 190 141 62
Z -03746343246 00624390541 -08741467575 -04995124328 16858544608 14360982444 18107325691 20604887855 -23102450019 -53697586531 -103024439274
C+ 0 0 0 0 11858544608 21219527053 34326852743 49931740598 21829290579 0 0
C- 0 0 03741467575 03736591903 0 0 0 0 18102450019 6680003655 164824475824
Signal 0 0 0 0 0 0 0 1 0 1 1
Average 227
Std Dev 16015617378
k 05
Control Limit 4
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Component Members-Weak Average 367623 299836 366667 345 232377 232213 225628 211885 50694 855628 1487596
Z 09566024078 -01181147213 09414456731 05979300132 -11876316317 -1190231741 -12946324712 -15125184597 31653794044 86935922814 187130183455
C+ 04566024078 0 04414456731 05393756863 0 0 0 0 26653794044 108589716857 290719900313
C- 0 0 0 0 06876316317 13778633728 21724958439 31850143037 0 0 0
Signal 0 0 0 0 0 0 0 0 0 1 1
Average 307286
Std Dev 63074271515
k 05
Control Limit 4
1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Edge Count 38 201 405 574 733 889 1075 1207 1112 1082 1247 1330 1328 1370 825 496 92
Slope of Edge Count 163 204 169 159 156 186 132 -95 -30 165 83 -2 42 -545 -329 -404
Z -05039753822 15973457028 -01964649795 -07089823173 -08627375186 06748144948 -20927791293 -137269226972 -6393653789 59836399187 20500693512 -16784942813 -06150208054 -316223197419 -114547624997 -100197139539
C+ 0 10973457028 04008807233 0 0 01748144948 0 0 0 54836399187 70337092699 48552149886 37401941833 0 0 0
C- 00039753822 0 0 02089823173 05717198359 0 15927791293 148197018266 207133556155 142297156968 116796463456 128581406269 129731614323 440954811741 550502436738 645699576277
Signal 0 0 0 0 0 0 0 0 1 1 1 1 1 1 1 1 1
Average 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 9475 4825 43 3075 54 72 -1055 -2085 -309
Std Dev 195115350498 195115350498 195115350498 195115350498 195115350498 195115350498 195115350498 195115350498 1284143683549 1324547092406 1254033492376 115840047767 882307580533 710539700979 29504858357 278774102097 2505474007049
k 05
Control Limit 4
1988 1988 1988 1988 1988 1988
1989 1989 1989 1989 1989 1989
1990 1990 1990 1990 1990 1990
1991 1991 1991 1991 1991 1991
1992 1992 1992 1992 1992 1992
1993 1993 1993 1993 1993 1993
1994 1994 1994 1994 1994 1994
1995 1995 1995 1995 1995 1995
1996 1996 1996 1996 1996 1996
1997 1997 1997 1997 1997 1997
1998 1998 1998 1998 1998 1998
1999 1999 1999 1999 1999 1999
2000 2000 2000 2000 2000 2000
2001 2001 2001 2001 2001 2001
2002 2002 2002 2002 2002 2002
2003 2003 2003 2003 2003 2003
2004 2004 2004 2004 2004 2004
httpwwwmarocnlnieuwsforumsshowphppg=timeline
Significant Events Founded in Afghanistan Moves to Sudan WTC 1 and Somalia Leaves Sudan Returns to Afghanistan Saudia Arabia bomb Issues fatwa US embassy bombings USS Cole WTC 2 Afghanistan 3 Man is Captured Dirty bomb foiled 911 plotter captured Sheikh Mohammed captured Saudi bombings Morroco attacks Sectarian violence plot Madrid Europe truce offered
1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Betweeness 0 00001 00003 00006 00008 00016 00027 00029 00024 00025 00033 00031 00033 00036 0002 00008 0
Density 00003 00015 0003 00043 00055 00067 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Closeness 00001 00005 00012 00016 00018 00021 00027 0003 00028 00028 00031 0003 00032 00034 00024 00015 00004
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Closeness 00027 0003 00028 00028 00031 0003 00032 00034 00024 00015 00004
Z -08728715609 10910894512 -02182178902 -02182178902 17457431219 10910894512 24003967926 3709704134 -28368325731 -87287156094 -159299059872
C+ 0 05910894512 0 0 12457431219 18368325731 37372293657 69469334997 36101009266 0 0
C- 03728715609 0 0 0 0 0 0 0 23368325731 105655481825 259954541697
Signal 0 0 0 0 0 0 0 1 0 1 1
Average 00028333333
Std Dev 00001527525
k 05
Control Limit 4
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Density 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Z -08444407432 11042686642 -0259827921 -06495698025 16888814864 30529780716 28581071308 36375908938 -43521176765 -9223891195 -150700194171
C+ 0 06042686642 0 0 11888814864 3741859558 60999666888 92375575826 43854399061 0 0
C- 03444407432 0 0 01495698025 0 0 0 0 38521176765 125760088714 271460282886
Signal 0 0 0 0 0 0 1 1 1 1 1
Average 00084333333
Std Dev 00005131601
k 05
Control Limit 4
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Betweeness 00027 00029 00024 00025 00033 00031 00033 00036 0002 00008 0
Z 01324532357 09271726499 -10596258857 -06622661785 25166114784 17218920642 25166114784 37086905998 -26490647141 -74173811996 -105962588565
C+ 0 04271726499 0 0 20166114784 32385035426 5255115021 84638056208 53147409067 0 0
C- 0 0 05596258857 07218920642 0 0 0 0 21490647141 90664459137 191627047702
Signal 0 0 0 0 0 0 1 1 1 1 1
Average 00026666667
Std Dev 00002516611
k 05
Control Limit 4
1988 1988 1988
1989 1989 1989
1990 1990 1990
1991 1991 1991
1992 1992 1992
1993 1993 1993
1994 1994 1994
1995 1995 1995
1996 1996 1996
1997 1997 1997
1998 1998 1998
1999 1999 1999
2000 2000 2000
2001 2001 2001
2002 2002 2002
2003 2003 2003
2004 2004 2004
Page 4: Data Science Center Eindhoven - 4TU.Federation · 4TU AMI SRO Big Data Meeting Big Data: Mathematics in Action! November 24, 2017 Data Science Center Eindhoven The Mathematics Behind

Four Vrsquos of Big Data

Presenter
Presentation Notes
Picture taken from eycom13More Vrsquos have been introduced like V for Value (mostly in business)

High-dimensional data ldquo119951119951 ≪ 119953119953rdquo

Presenter
Presentation Notes
book Van de Geer Buumlhlmann13httpwwwstatriceedu~gallenexamples_high_dim_datapdf13DNA 20000 genes of 200 people13fMRI similar voxels (time series) of few people13Netflix ratings of 18000 movies

Topic Google Search

Topic Google Search

bull How can Google search so fast on its over 250 billion pages

bull What are proper ways to rank web pages

bull First idea count words in pages

Topic Google Search

Model the WWW as a graphbull Squares (nodes vertices) denote web pagesbull Arrows (edges) denote links from one page to another

Relevance is translated into number of incoming links weighted by importance of referring pages

Topic Google Search

Topic Google Search

A =

Topic Google Search

Presenter
Presentation Notes
Note that pages 2 and 4 both have 2 incoming links but not the same relevance

Equivalent Random Surfer Model

bull Start at any point on the graphbull Assign equal probabilities to all outgoing linksbull Choose new node with probabilities given by the

linksbull Relevance is proportion of visits after surfing for a

very long time (ldquostationary distributionrdquo)

Google Matrix

bull Choose a ldquodamping factorrdquo p (0ltplt1) bull Googlersquos p is secret but around 015

Interpretation for random surferbull choose next link according to A with probability 1- pbull ldquoteleportrdquo to random page with probability p

119872119872 = 1 minus 119901119901 119860119860 + 119901119901119901119901

119901119901 =1119899119899

1 ⋯ 1⋮ ⋱ ⋮1 ⋯ 1

Choice of Matrix B

Google choose other B to avoid unwanted boosting of pages

Practical Issues

bull The real Google matrix has size in the order of 30 billion columns x 30 billion rows

bull Every day around 250000 new pages are added bull Matrix has many zeros (100 links to a page)

which speeds up calculationsbull Doing the matrix calculation

requires fast computers butalso advanced math

Topic Streaming algorithms

High volume high velocity data makes exact counting of frequencies or number of different items practically infeasible but approximate answers suffice in several applications (web site counters customer counts in retail transactionshellip)

New algorithms using advanced probabilistic methods (random hash functions)bull Count-Min Sketch bull MinHashbull HyperLogLog

Topic Uncertainty Quantification

Virtual prototyping using mathematical models UQ does not deal with1 unknown uncertainty in the initial

conditions of parameters2 parametrisation of design building blocks dimension

reduction

For 1) polynomial chaos (Wiener chaos) inverse statistical models Bayesian analysis (calibration)

For 2) Model Order Reduction

Topic Network monitoring

0

0002

0004

0006

0008

001

0012

1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004

Measu

re V

alu

e

Time

Al-Qaeda Network Measures

Betweeness

Density

Closeness

Challengesbull monitor high number of variablesbull models to capture structural changesbull scalable algorithms for likelihood ratio calculations

Closeness CUSUM Statistic for Al-Qaeda (1994-2004)

0

1

2

3

4

5

6

7

8

1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004

C+

Change Point

Signal

Chart3

Betweeness
Density
Closeness
Time
Measure Value
Al-Qaeda Network Measures
0
00003
00001
00001
00015
00005
00003
0003
00012
00006
00043
00016
00008
00055
00018
00016
00067
00021
00027
0008
00027
00029
0009
0003
00024
00083
00028
00025
00081
00028
00033
00093
00031
00031
001
0003
00033
00099
00032
00036
00103
00034
0002
00062
00024
00008
00037
00015
0
00007
00004

Std Measures Control Charts

Std Measures Control Charts

Betweeness
Density
Closeness
Time
Value
Standard Measures

Chart1

Number of AGENT
Edge Count[Agent x Agent Agent x Agent]
Component Members-Weak[Agent x Agent Agent x Agent] Average
Isolate Count[Agent x Agent Agent x Agent]
Component Count-Weak[Agent x Agent Agent x Agent]
Component Count-Strong[Agent x Agent Agent x Agent]
32
38
1728798
350
355
355
94
201
1248005
293
301
315
112
405
101776
269
272
281
137
574
862131
247
250
255
162
733
710437
222
226
232
183
889
559781
198
201
214
221
1075
367623
157
162
177
228
1207
299836
142
146
156
213
1112
366667
159
162
172
219
1082
345
154
157
169
254
1247
232377
126
128
145
250
1330
232213
125
128
144
256
1328
225628
122
126
137
260
1370
211885
119
122
131
190
825
50694
187
191
198
141
496
855628
243
249
256
62
92
1487596
322
329
336

Control Charts

raw data

Chart4

C+
Closeness CUSUM Statistic for Al-Qaeda (1994-2004)
0
05910894512
0
0
12457431219
18368325731
37372293657
69469334997
36101009266
0
0

Std Measures Control Charts

Std Measures Control Charts

Betweeness
Density
Closeness
Time
Value
Standard Measures

Chart1

C+
Closeness CUSUM Chart for Al-Qaeda (1994-2004)

Control Charts

Number of AGENT
Edge Count[Agent x Agent Agent x Agent]
Component Members-Weak[Agent x Agent Agent x Agent] Average
Isolate Count[Agent x Agent Agent x Agent]
Component Count-Weak[Agent x Agent Agent x Agent]
Component Count-Strong[Agent x Agent Agent x Agent]
32
38
1728798
350
355
355
94
201
1248005
293
301
315
112
405
101776
269
272
281
137
574
862131
247
250
255
162
733
710437
222
226
232
183
889
559781
198
201
214
221
1075
367623
157
162
177
228
1207
299836
142
146
156
213
1112
366667
159
162
172
219
1082
345
154
157
169
254
1247
232377
126
128
145
250
1330
232213
125
128
144
256
1328
225628
122
126
137
260
1370
211885
119
122
131
190
825
50694
187
191
198
141
496
855628
243
249
256
62
92
1487596
322
329
336

raw data

Topic Precision farming

Topic Precision farming

Topic Compressed Sensing

Topic Network structure

bull dependencies between nodes in networks measured by degrees of direct neighbours

bull assortativity coefficient of Newman is nothing but Pearsonrsquos correlation statistic

bull inconsistent estimator when variances are infinitebull Spearman rank correlation behaves better but

calculation is computationally intensive bull requires heavy asymptotics

Van der Hofstad R and Litvak N (2014) Degree-Degree Dependencies in Random Graphs with Heavy-Tailed Degrees Internet Mathematics 10 (3-4) pp 287-334

Topic Topological Data Analysis

Common Big Data problem is to choose relevant ldquofeaturesrdquo from high-dimensional data

Combination of machine learning with topological tools (simplices cohomology) yields new algorithmsfor finding patterns and clustering

Presenter
Presentation Notes
httpsyoutubeXfWibrh6stw Gunnar Carsson

Mathematical contributions in general

bull Modellingbull Performance of algorithmsbull Statistical thinking

We need to work hard to makemathematical contributions more explicit

Conclusions

1 Mathematical methods are an important enabler in big data settings

2 Developments in big data settings not only require more computer speed and memory capacity but also new advanced mathematics

  • The Mathematics Behind Big Data
  • Outline
  • What is big data
  • Four Vrsquos of Big Data
  • High-dimensional data ldquo119951≪119953rdquo
  • Topic Google Search
  • Topic Google Search
  • Topic Google Search
  • Topic Google Search
  • Topic Google Search
  • Topic Google Search
  • Equivalent Random Surfer Model
  • Google Matrix
  • Choice of Matrix B
  • Practical Issues
  • Topic Streaming algorithms
  • Topic Uncertainty Quantification
  • Topic Network monitoring
  • Topic Precision farming
  • Topic Precision farming
  • Topic Compressed Sensing
  • Topic Network structure
  • Topic Topological Data Analysis
  • Mathematical contributions in general
  • Conclusions
1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Centrality-Betweenness[Agent x Agent Agent x Agent] Average 0 00001 00003 00006 00008 00016 00027 00029 00024 00025 00033 00031 00033 00036 0002 00008 0
Meta-Matrix Hamming Distance 0 00018 00036 0006 00065 00087 00124 0012 00146 00136 00135 00166 00164 00156 0015 0009 0004
Network Centralization-Betweenness[Agent x Agent Agent x Agent] 0 00091 00292 00492 00608 01105 01557 01586 01242 01187 01295 01338 01316 01414 0104 0042 00014
Edge Count-Sequential[Agent x Agent Agent x Agent] 0 001 00025 00017 0 0 00019 00008 00009 00009 00016 00015 0 00007 0 0002 0
Hierarchy[Agent x Agent Agent x Agent] 0 04827 01679 00696 00796 01496 01339 00727 00775 01273 01492 01711 0075 00717 00786 00882 05587
Exclusivity[Agent x Agent Agent x Agent] Average 00001 00003 00003 00003 00004 00004 00005 00005 00005 00006 00006 00006 00006 00006 00006 00004 00002
Network Centralization-Closeness[Agent x Agent Agent x Agent] 00001 00005 00012 00016 00018 00021 00027 0003 00028 00028 00031 0003 00032 00034 00024 00015 00004
Similarity Correlation[Agent x Agent Agent x Agent] Average 00001 00023 00036 00043 0005 0006 0007 00077 0007 00069 0008 00084 0008 00082 00053 0003 00008
Simmelian Ties[Agent x Agent Agent x Agent] Average 00002 00008 00021 00029 0004 0005 00059 00068 00061 00058 00065 00076 00076 0008 00045 00029 00004
Overall Complexity 00003 00015 0003 00043 00055 00067 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Expertise Correlation[Agent x Agent Agent x Agent] Average 00003 00015 0003 00042 00053 00065 00079 00088 00081 00079 00091 00097 00097 00101 0006 00036 00007
Density[Agent x Agent Agent x Agent] 00003 00015 0003 00043 00055 00067 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Centrality-Inverse Closeness[Agent x Agent Agent x Agent] Average 00003 00057 00191 00306 00387 0053 00733 00943 0084 00877 01111 01117 01125 01206 00587 00257 00018
Efficiency-Global[Agent x Agent Agent x Agent] 00003 00087 00223 00338 00421 00591 00815 0101 00904 00972 01245 01244 01203 01286 00632 00277 00026
Connectedness[Agent x Agent Agent x Agent] 00003 0021 00589 00932 0119 01953 02921 03446 0301 03223 04222 04187 04049 04402 02202 00899 00059
Distinctiveness Correlation[Agent x Agent Agent x Agent] Average 00006 00029 00059 00083 00106 00129 00156 00175 00162 00157 00181 00193 00193 00199 0012 00073 00014
Centrality-Closeness[Agent x Agent Agent x Agent] Average 00027 00028 00029 00031 00032 00035 00042 00048 00044 00044 00053 00052 00054 00058 00038 00031 00027
Boundary Spanner[Agent x Agent Agent x Agent] Average 00027 00191 00328 00328 00546 00574 00683 00601 00656 0071 00683 0071 00847 00656 00738 00546 00246
Network Centralization-Row Degree[Agent x Agent Agent x Agent] 00134 00314 006 00752 00794 0081 00851 00841 00848 00878 00893 00969 00996 00993 00842 00648 0024
Network Centralization-Column Degree[Agent x Agent Agent x Agent] 00134 00588 00819 00834 00849 00892 00878 00896 00876 00905 00948 00996 01024 00993 00842 0062 00322
Network Centralization-Out Degree[Agent x Agent Agent x Agent] 00135 00315 00601 00754 00797 00812 00853 00843 00851 0088 00895 00972 00999 00996 00845 0065 0024
Network Centralization-Total Degree[Agent x Agent Agent x Agent] 00135 00453 00672 00769 00813 00828 00869 0086 00867 00897 00925 00988 01016 00999 00847 00638 00282
Network Centralization-In Degree[Agent x Agent Agent x Agent] 00135 00589 00821 00836 00852 00895 00881 00898 00878 00908 0095 00999 01027 00996 00845 00622 00323
Clustering Coefficient-Watts-Strogatz[Agent x Agent Agent x Agent] Average 0023 00786 0126 01471 01728 01993 02334 02782 02382 02256 02604 02856 02775 02958 02157 01468 00355
Efficiency-Local[Agent x Agent Agent x Agent] 00238 01032 01587 01916 02253 02598 03034 0362 03167 03054 03644 03843 03678 03861 02663 01705 0041
Interlockers[Agent x Agent Agent x Agent] Average 00246 00546 00656 00628 00738 00902 00902 00902 00847 0071 00738 00874 00874 0082 00792 00683 00546
Breadth-Column[Agent x Agent Agent x Agent] 00273 01011 01803 02596 03224 03716 04399 05082 04672 04672 05355 05328 05519 05738 03825 02404 00601
Breadth-Row[Agent x Agent Agent x Agent] 00273 01339 02158 02923 03388 03934 04754 05301 04863 04973 05792 05765 05765 05984 04153 02514 00601
Clique Count[Agent x Agent Agent x Agent] Average 00301 02896 06421 10929 14699 17213 19781 22486 20601 20492 24071 2571 24918 24727 13689 06913 01148
Constraint-Burt[Agent x Agent Agent x Agent] Average 0038 01331 01404 01481 01869 02077 02643 02719 02506 02623 02827 0295 03034 02956 02548 01967 00948
Interdependence[Agent x Agent Agent x Agent] 00485 00073 00042 0003 00024 0002 00017 00015 00016 00016 00014 00013 00014 00013 00022 00037 00176
Effective Network Size-Burt[Agent x Agent Agent x Agent] Average 0053 03636 06977 10333 13118 15388 18575 21156 19688 19928 22983 2382 23991 2429 14392 08408 01957
Redundancy-Row[Agent x Agent Agent x Agent] 00601 03525 08443 12459 16093 19699 23716 26913 24754 23798 27541 29781 29617 30683 1765 10219 01311
Redundancy-Column[Agent x Agent Agent x Agent] 00601 03743 08634 12568 1623 2 23989 27077 24945 24016 27814 30055 29891 30902 17842 10383 01448
Node Levels[Agent x Agent Agent x Agent] Average 00601 09836 14344 23087 24126 36339 46913 42541 3918 43197 51913 44727 46421 48169 47568 23033 03169
Centrality-Bonacich Power[Agent x Agent Agent x Agent] Average 01038 05492 11066 15683 20027 2429 29372 32978 30383 29563 34071 36339 36284 37432 22541 13552 02514
Triad Count[Agent x Agent Agent x Agent] Average 01967 08279 27104 42432 5918 78251 96503 101475 94454 84508 94645 111421 110164 112732 67131 35574 01995
Efficiency[Agent x Agent Agent x Agent] 03333 09552 0964 09658 09658 09746 09798 09806 09795 09818 09835 09822 0982 09828 09809 09756 09607
Speed-Minimum[Agent x Agent Agent x Agent] 05 0125 0125 01 01111 00909 00909 01 01 01 00909 01111 01 01 00769 01111 01667
Edge Count-Skip[Agent x Agent Agent x Agent] 07895 07761 08988 09408 09304 0946 0933 09519 09487 09409 09455 09534 09465 09584 09188 0881 06196
Transitivity[Agent x Agent Agent x Agent] 08372 03475 03372 03238 03336 03626 0375 03495 03503 03253 0319 03266 03302 0336 03822 0405 0258
Edge Count-Pooled[Agent x Agent Agent x Agent] 08421 08657 09432 09669 09714 09764 09665 09751 09748 09704 09735 09737 09759 09788 09612 09435 08152
Speed-Average[Agent x Agent Agent x Agent] 08519 02994 03068 02802 02799 0233 02156 02387 0245 02484 02448 02516 02464 02429 02259 02373 04139
Edge Count-Lateral[Agent x Agent Agent x Agent] 09474 03731 03926 0392 04216 04061 03944 03828 03768 03651 03633 03872 03795 03985 03842 03891 05761
Resemblance Correlation[Agent x Agent Agent x Agent] Average 09994 09971 09941 09917 09894 09871 09844 09825 09838 09843 09819 09807 09807 09801 0988 09927 09986
Fragmentation[Agent x Agent Agent x Agent] 09997 0979 09411 09068 0881 08047 07079 06554 0699 06777 05778 05813 05951 05598 07798 09101 09941
Communicative Need[Agent x Agent Agent x Agent] 1 0608 07457 07662 0801 08069 08098 08233 0817 08124 0784 08169 08368 08365 08212 09004 08039
Upper Boundedness[Agent x Agent Agent x Agent] 1 09021 09909 0999 09987 0994 09966 09993 09989 09965 0995 09928 09987 09988 09986 09983 09073
Average Distance[Agent x Agent Agent x Agent] 11739 33399 326 35685 35731 42917 46372 41893 40814 40263 40845 39749 40577 41172 44266 42139 24163
Network Levels[Agent x Agent Agent x Agent] 2 8 8 10 9 11 11 10 10 10 11 9 10 10 13 9 6
Span Of Control[Agent x Agent Agent x Agent] 2375 27917 42188 48644 50903 52917 51932 54369 53981 5128 52176 55417 54426 55466 46089 40656 20909
Number of AGENT 32 94 112 137 162 183 221 228 213 219 254 250 256 260 190 141 62
Edge Count[Agent x Agent Agent x Agent] 38 201 405 574 733 889 1075 1207 1112 1082 1247 1330 1328 1370 825 496 92
Component Members-Weak[Agent x Agent Agent x Agent] Average 1728798 1248005 101776 862131 710437 559781 367623 299836 366667 345 232377 232213 225628 211885 50694 855628 1487596
Isolate Count[Agent x Agent Agent x Agent] 350 293 269 247 222 198 157 142 159 154 126 125 122 119 187 243 322
Component Count-Weak[Agent x Agent Agent x Agent] 355 301 272 250 226 201 162 146 162 157 128 128 126 122 191 249 329
Component Count-Strong[Agent x Agent Agent x Agent] 355 315 281 255 232 214 177 156 172 169 145 144 137 131 198 256 336
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Agents 221 228 213 219 254 250 256 260 190 141 62
Z -03746343246 00624390541 -08741467575 -04995124328 16858544608 14360982444 18107325691 20604887855 -23102450019 -53697586531 -103024439274
C+ 0 0 0 0 11858544608 21219527053 34326852743 49931740598 21829290579 0 0
C- 0 0 03741467575 03736591903 0 0 0 0 18102450019 6680003655 164824475824
Signal 0 0 0 0 0 0 0 1 0 1 1
Average 227
Std Dev 16015617378
k 05
Control Limit 4
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Component Members-Weak Average 367623 299836 366667 345 232377 232213 225628 211885 50694 855628 1487596
Z 09566024078 -01181147213 09414456731 05979300132 -11876316317 -1190231741 -12946324712 -15125184597 31653794044 86935922814 187130183455
C+ 04566024078 0 04414456731 05393756863 0 0 0 0 26653794044 108589716857 290719900313
C- 0 0 0 0 06876316317 13778633728 21724958439 31850143037 0 0 0
Signal 0 0 0 0 0 0 0 0 0 1 1
Average 307286
Std Dev 63074271515
k 05
Control Limit 4
1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Edge Count 38 201 405 574 733 889 1075 1207 1112 1082 1247 1330 1328 1370 825 496 92
Slope of Edge Count 163 204 169 159 156 186 132 -95 -30 165 83 -2 42 -545 -329 -404
Z -05039753822 15973457028 -01964649795 -07089823173 -08627375186 06748144948 -20927791293 -137269226972 -6393653789 59836399187 20500693512 -16784942813 -06150208054 -316223197419 -114547624997 -100197139539
C+ 0 10973457028 04008807233 0 0 01748144948 0 0 0 54836399187 70337092699 48552149886 37401941833 0 0 0
C- 00039753822 0 0 02089823173 05717198359 0 15927791293 148197018266 207133556155 142297156968 116796463456 128581406269 129731614323 440954811741 550502436738 645699576277
Signal 0 0 0 0 0 0 0 0 1 1 1 1 1 1 1 1 1
Average 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 9475 4825 43 3075 54 72 -1055 -2085 -309
Std Dev 195115350498 195115350498 195115350498 195115350498 195115350498 195115350498 195115350498 195115350498 1284143683549 1324547092406 1254033492376 115840047767 882307580533 710539700979 29504858357 278774102097 2505474007049
k 05
Control Limit 4
1988 1988 1988 1988 1988 1988
1989 1989 1989 1989 1989 1989
1990 1990 1990 1990 1990 1990
1991 1991 1991 1991 1991 1991
1992 1992 1992 1992 1992 1992
1993 1993 1993 1993 1993 1993
1994 1994 1994 1994 1994 1994
1995 1995 1995 1995 1995 1995
1996 1996 1996 1996 1996 1996
1997 1997 1997 1997 1997 1997
1998 1998 1998 1998 1998 1998
1999 1999 1999 1999 1999 1999
2000 2000 2000 2000 2000 2000
2001 2001 2001 2001 2001 2001
2002 2002 2002 2002 2002 2002
2003 2003 2003 2003 2003 2003
2004 2004 2004 2004 2004 2004
httpwwwmarocnlnieuwsforumsshowphppg=timeline
Significant Events Founded in Afghanistan Moves to Sudan WTC 1 and Somalia Leaves Sudan Returns to Afghanistan Saudia Arabia bomb Issues fatwa US embassy bombings USS Cole WTC 2 Afghanistan 3 Man is Captured Dirty bomb foiled 911 plotter captured Sheikh Mohammed captured Saudi bombings Morroco attacks Sectarian violence plot Madrid Europe truce offered
1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Betweeness 0 00001 00003 00006 00008 00016 00027 00029 00024 00025 00033 00031 00033 00036 0002 00008 0
Density 00003 00015 0003 00043 00055 00067 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Closeness 00001 00005 00012 00016 00018 00021 00027 0003 00028 00028 00031 0003 00032 00034 00024 00015 00004
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Closeness 00027 0003 00028 00028 00031 0003 00032 00034 00024 00015 00004
Z -08728715609 10910894512 -02182178902 -02182178902 17457431219 10910894512 24003967926 3709704134 -28368325731 -87287156094 -159299059872
C+ 0 05910894512 0 0 12457431219 18368325731 37372293657 69469334997 36101009266 0 0
C- 03728715609 0 0 0 0 0 0 0 23368325731 105655481825 259954541697
03728715609 05910894512 0 0 12457431219 18368325731 37372293657 69469334997 36101009266 105655481825 259954541697
Signal 0 0 0 0 0 0 0 1 0 1 1
Average 00028333333
Std Dev 00001527525
k 05
Control Limit 4
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Density 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Z -08444407432 11042686642 -0259827921 -06495698025 16888814864 30529780716 28581071308 36375908938 -43521176765 -9223891195 -150700194171
C+ 0 06042686642 0 0 11888814864 3741859558 60999666888 92375575826 43854399061 0 0
C- 03444407432 0 0 01495698025 0 0 0 0 38521176765 125760088714 271460282886
Signal 0 0 0 0 0 0 1 1 1 1 1
Average 00084333333
Std Dev 00005131601
k 05
Control Limit 4
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Betweeness 00027 00029 00024 00025 00033 00031 00033 00036 0002 00008 0
Z 01324532357 09271726499 -10596258857 -06622661785 25166114784 17218920642 25166114784 37086905998 -26490647141 -74173811996 -105962588565
C+ 0 04271726499 0 0 20166114784 32385035426 5255115021 84638056208 53147409067 0 0
C- 0 0 05596258857 07218920642 0 0 0 0 21490647141 90664459137 191627047702
Signal 0 0 0 0 0 0 1 1 1 1 1
Average 00026666667
Std Dev 00002516611
k 05
Control Limit 4
1994
1995
1996
1997
1998
1999
2000
2001
2002
2003
2004
1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Centrality-Betweenness[Agent x Agent Agent x Agent] Average 0 00001 00003 00006 00008 00016 00027 00029 00024 00025 00033 00031 00033 00036 0002 00008 0
Meta-Matrix Hamming Distance 0 00018 00036 0006 00065 00087 00124 0012 00146 00136 00135 00166 00164 00156 0015 0009 0004
Network Centralization-Betweenness[Agent x Agent Agent x Agent] 0 00091 00292 00492 00608 01105 01557 01586 01242 01187 01295 01338 01316 01414 0104 0042 00014
Edge Count-Sequential[Agent x Agent Agent x Agent] 0 001 00025 00017 0 0 00019 00008 00009 00009 00016 00015 0 00007 0 0002 0
Hierarchy[Agent x Agent Agent x Agent] 0 04827 01679 00696 00796 01496 01339 00727 00775 01273 01492 01711 0075 00717 00786 00882 05587
Exclusivity[Agent x Agent Agent x Agent] Average 00001 00003 00003 00003 00004 00004 00005 00005 00005 00006 00006 00006 00006 00006 00006 00004 00002
Network Centralization-Closeness[Agent x Agent Agent x Agent] 00001 00005 00012 00016 00018 00021 00027 0003 00028 00028 00031 0003 00032 00034 00024 00015 00004
Similarity Correlation[Agent x Agent Agent x Agent] Average 00001 00023 00036 00043 0005 0006 0007 00077 0007 00069 0008 00084 0008 00082 00053 0003 00008
Simmelian Ties[Agent x Agent Agent x Agent] Average 00002 00008 00021 00029 0004 0005 00059 00068 00061 00058 00065 00076 00076 0008 00045 00029 00004
Overall Complexity 00003 00015 0003 00043 00055 00067 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Expertise Correlation[Agent x Agent Agent x Agent] Average 00003 00015 0003 00042 00053 00065 00079 00088 00081 00079 00091 00097 00097 00101 0006 00036 00007
Density[Agent x Agent Agent x Agent] 00003 00015 0003 00043 00055 00067 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Centrality-Inverse Closeness[Agent x Agent Agent x Agent] Average 00003 00057 00191 00306 00387 0053 00733 00943 0084 00877 01111 01117 01125 01206 00587 00257 00018
Efficiency-Global[Agent x Agent Agent x Agent] 00003 00087 00223 00338 00421 00591 00815 0101 00904 00972 01245 01244 01203 01286 00632 00277 00026
Connectedness[Agent x Agent Agent x Agent] 00003 0021 00589 00932 0119 01953 02921 03446 0301 03223 04222 04187 04049 04402 02202 00899 00059
Distinctiveness Correlation[Agent x Agent Agent x Agent] Average 00006 00029 00059 00083 00106 00129 00156 00175 00162 00157 00181 00193 00193 00199 0012 00073 00014
Centrality-Closeness[Agent x Agent Agent x Agent] Average 00027 00028 00029 00031 00032 00035 00042 00048 00044 00044 00053 00052 00054 00058 00038 00031 00027
Boundary Spanner[Agent x Agent Agent x Agent] Average 00027 00191 00328 00328 00546 00574 00683 00601 00656 0071 00683 0071 00847 00656 00738 00546 00246
Network Centralization-Row Degree[Agent x Agent Agent x Agent] 00134 00314 006 00752 00794 0081 00851 00841 00848 00878 00893 00969 00996 00993 00842 00648 0024
Network Centralization-Column Degree[Agent x Agent Agent x Agent] 00134 00588 00819 00834 00849 00892 00878 00896 00876 00905 00948 00996 01024 00993 00842 0062 00322
Network Centralization-Out Degree[Agent x Agent Agent x Agent] 00135 00315 00601 00754 00797 00812 00853 00843 00851 0088 00895 00972 00999 00996 00845 0065 0024
Network Centralization-Total Degree[Agent x Agent Agent x Agent] 00135 00453 00672 00769 00813 00828 00869 0086 00867 00897 00925 00988 01016 00999 00847 00638 00282
Network Centralization-In Degree[Agent x Agent Agent x Agent] 00135 00589 00821 00836 00852 00895 00881 00898 00878 00908 0095 00999 01027 00996 00845 00622 00323
Clustering Coefficient-Watts-Strogatz[Agent x Agent Agent x Agent] Average 0023 00786 0126 01471 01728 01993 02334 02782 02382 02256 02604 02856 02775 02958 02157 01468 00355
Efficiency-Local[Agent x Agent Agent x Agent] 00238 01032 01587 01916 02253 02598 03034 0362 03167 03054 03644 03843 03678 03861 02663 01705 0041
Interlockers[Agent x Agent Agent x Agent] Average 00246 00546 00656 00628 00738 00902 00902 00902 00847 0071 00738 00874 00874 0082 00792 00683 00546
Breadth-Column[Agent x Agent Agent x Agent] 00273 01011 01803 02596 03224 03716 04399 05082 04672 04672 05355 05328 05519 05738 03825 02404 00601
Breadth-Row[Agent x Agent Agent x Agent] 00273 01339 02158 02923 03388 03934 04754 05301 04863 04973 05792 05765 05765 05984 04153 02514 00601
Clique Count[Agent x Agent Agent x Agent] Average 00301 02896 06421 10929 14699 17213 19781 22486 20601 20492 24071 2571 24918 24727 13689 06913 01148
Constraint-Burt[Agent x Agent Agent x Agent] Average 0038 01331 01404 01481 01869 02077 02643 02719 02506 02623 02827 0295 03034 02956 02548 01967 00948
Interdependence[Agent x Agent Agent x Agent] 00485 00073 00042 0003 00024 0002 00017 00015 00016 00016 00014 00013 00014 00013 00022 00037 00176
Effective Network Size-Burt[Agent x Agent Agent x Agent] Average 0053 03636 06977 10333 13118 15388 18575 21156 19688 19928 22983 2382 23991 2429 14392 08408 01957
Redundancy-Row[Agent x Agent Agent x Agent] 00601 03525 08443 12459 16093 19699 23716 26913 24754 23798 27541 29781 29617 30683 1765 10219 01311
Redundancy-Column[Agent x Agent Agent x Agent] 00601 03743 08634 12568 1623 2 23989 27077 24945 24016 27814 30055 29891 30902 17842 10383 01448
Node Levels[Agent x Agent Agent x Agent] Average 00601 09836 14344 23087 24126 36339 46913 42541 3918 43197 51913 44727 46421 48169 47568 23033 03169
Centrality-Bonacich Power[Agent x Agent Agent x Agent] Average 01038 05492 11066 15683 20027 2429 29372 32978 30383 29563 34071 36339 36284 37432 22541 13552 02514
Triad Count[Agent x Agent Agent x Agent] Average 01967 08279 27104 42432 5918 78251 96503 101475 94454 84508 94645 111421 110164 112732 67131 35574 01995
Efficiency[Agent x Agent Agent x Agent] 03333 09552 0964 09658 09658 09746 09798 09806 09795 09818 09835 09822 0982 09828 09809 09756 09607
Speed-Minimum[Agent x Agent Agent x Agent] 05 0125 0125 01 01111 00909 00909 01 01 01 00909 01111 01 01 00769 01111 01667
Edge Count-Skip[Agent x Agent Agent x Agent] 07895 07761 08988 09408 09304 0946 0933 09519 09487 09409 09455 09534 09465 09584 09188 0881 06196
Transitivity[Agent x Agent Agent x Agent] 08372 03475 03372 03238 03336 03626 0375 03495 03503 03253 0319 03266 03302 0336 03822 0405 0258
Edge Count-Pooled[Agent x Agent Agent x Agent] 08421 08657 09432 09669 09714 09764 09665 09751 09748 09704 09735 09737 09759 09788 09612 09435 08152
Speed-Average[Agent x Agent Agent x Agent] 08519 02994 03068 02802 02799 0233 02156 02387 0245 02484 02448 02516 02464 02429 02259 02373 04139
Edge Count-Lateral[Agent x Agent Agent x Agent] 09474 03731 03926 0392 04216 04061 03944 03828 03768 03651 03633 03872 03795 03985 03842 03891 05761
Resemblance Correlation[Agent x Agent Agent x Agent] Average 09994 09971 09941 09917 09894 09871 09844 09825 09838 09843 09819 09807 09807 09801 0988 09927 09986
Fragmentation[Agent x Agent Agent x Agent] 09997 0979 09411 09068 0881 08047 07079 06554 0699 06777 05778 05813 05951 05598 07798 09101 09941
Communicative Need[Agent x Agent Agent x Agent] 1 0608 07457 07662 0801 08069 08098 08233 0817 08124 0784 08169 08368 08365 08212 09004 08039
Upper Boundedness[Agent x Agent Agent x Agent] 1 09021 09909 0999 09987 0994 09966 09993 09989 09965 0995 09928 09987 09988 09986 09983 09073
Average Distance[Agent x Agent Agent x Agent] 11739 33399 326 35685 35731 42917 46372 41893 40814 40263 40845 39749 40577 41172 44266 42139 24163
Network Levels[Agent x Agent Agent x Agent] 2 8 8 10 9 11 11 10 10 10 11 9 10 10 13 9 6
Span Of Control[Agent x Agent Agent x Agent] 2375 27917 42188 48644 50903 52917 51932 54369 53981 5128 52176 55417 54426 55466 46089 40656 20909
Number of AGENT 32 94 112 137 162 183 221 228 213 219 254 250 256 260 190 141 62
Edge Count[Agent x Agent Agent x Agent] 38 201 405 574 733 889 1075 1207 1112 1082 1247 1330 1328 1370 825 496 92
Component Members-Weak[Agent x Agent Agent x Agent] Average 1728798 1248005 101776 862131 710437 559781 367623 299836 366667 345 232377 232213 225628 211885 50694 855628 1487596
Isolate Count[Agent x Agent Agent x Agent] 350 293 269 247 222 198 157 142 159 154 126 125 122 119 187 243 322
Component Count-Weak[Agent x Agent Agent x Agent] 355 301 272 250 226 201 162 146 162 157 128 128 126 122 191 249 329
Component Count-Strong[Agent x Agent Agent x Agent] 355 315 281 255 232 214 177 156 172 169 145 144 137 131 198 256 336
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Agents 221 228 213 219 254 250 256 260 190 141 62
Z -03746343246 00624390541 -08741467575 -04995124328 16858544608 14360982444 18107325691 20604887855 -23102450019 -53697586531 -103024439274
C+ 0 0 0 0 11858544608 21219527053 34326852743 49931740598 21829290579 0 0
C- 0 0 03741467575 03736591903 0 0 0 0 18102450019 6680003655 164824475824
Signal 0 0 0 0 0 0 0 1 0 1 1
Average 227
Std Dev 16015617378
k 05
Control Limit 4
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Component Members-Weak Average 367623 299836 366667 345 232377 232213 225628 211885 50694 855628 1487596
Z 09566024078 -01181147213 09414456731 05979300132 -11876316317 -1190231741 -12946324712 -15125184597 31653794044 86935922814 187130183455
C+ 04566024078 0 04414456731 05393756863 0 0 0 0 26653794044 108589716857 290719900313
C- 0 0 0 0 06876316317 13778633728 21724958439 31850143037 0 0 0
Signal 0 0 0 0 0 0 0 0 0 1 1
Average 307286
Std Dev 63074271515
k 05
Control Limit 4
1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Edge Count 38 201 405 574 733 889 1075 1207 1112 1082 1247 1330 1328 1370 825 496 92
Slope of Edge Count 163 204 169 159 156 186 132 -95 -30 165 83 -2 42 -545 -329 -404
Z -05039753822 15973457028 -01964649795 -07089823173 -08627375186 06748144948 -20927791293 -137269226972 -6393653789 59836399187 20500693512 -16784942813 -06150208054 -316223197419 -114547624997 -100197139539
C+ 0 10973457028 04008807233 0 0 01748144948 0 0 0 54836399187 70337092699 48552149886 37401941833 0 0 0
C- 00039753822 0 0 02089823173 05717198359 0 15927791293 148197018266 207133556155 142297156968 116796463456 128581406269 129731614323 440954811741 550502436738 645699576277
Signal 0 0 0 0 0 0 0 0 1 1 1 1 1 1 1 1 1
Average 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 9475 4825 43 3075 54 72 -1055 -2085 -309
Std Dev 195115350498 195115350498 195115350498 195115350498 195115350498 195115350498 195115350498 195115350498 1284143683549 1324547092406 1254033492376 115840047767 882307580533 710539700979 29504858357 278774102097 2505474007049
k 05
Control Limit 4
1988 1988 1988 1988 1988 1988
1989 1989 1989 1989 1989 1989
1990 1990 1990 1990 1990 1990
1991 1991 1991 1991 1991 1991
1992 1992 1992 1992 1992 1992
1993 1993 1993 1993 1993 1993
1994 1994 1994 1994 1994 1994
1995 1995 1995 1995 1995 1995
1996 1996 1996 1996 1996 1996
1997 1997 1997 1997 1997 1997
1998 1998 1998 1998 1998 1998
1999 1999 1999 1999 1999 1999
2000 2000 2000 2000 2000 2000
2001 2001 2001 2001 2001 2001
2002 2002 2002 2002 2002 2002
2003 2003 2003 2003 2003 2003
2004 2004 2004 2004 2004 2004
httpwwwmarocnlnieuwsforumsshowphppg=timeline
Significant Events Founded in Afghanistan Moves to Sudan WTC 1 and Somalia Leaves Sudan Returns to Afghanistan Saudia Arabia bomb Issues fatwa US embassy bombings USS Cole WTC 2 Afghanistan 3 Man is Captured Dirty bomb foiled 911 plotter captured Sheikh Mohammed captured Saudi bombings Morroco attacks Sectarian violence plot Madrid Europe truce offered
1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Betweeness 0 00001 00003 00006 00008 00016 00027 00029 00024 00025 00033 00031 00033 00036 0002 00008 0
Density 00003 00015 0003 00043 00055 00067 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Closeness 00001 00005 00012 00016 00018 00021 00027 0003 00028 00028 00031 0003 00032 00034 00024 00015 00004
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Closeness 00027 0003 00028 00028 00031 0003 00032 00034 00024 00015 00004
Z -08728715609 10910894512 -02182178902 -02182178902 17457431219 10910894512 24003967926 3709704134 -28368325731 -87287156094 -159299059872
C+ 0 05910894512 0 0 12457431219 18368325731 37372293657 69469334997 36101009266 0 0
C- 03728715609 0 0 0 0 0 0 0 23368325731 105655481825 259954541697
Signal 0 0 0 0 0 0 0 1 0 1 1
Average 00028333333
Std Dev 00001527525
k 05
Control Limit 4
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Density 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Z -08444407432 11042686642 -0259827921 -06495698025 16888814864 30529780716 28581071308 36375908938 -43521176765 -9223891195 -150700194171
C+ 0 06042686642 0 0 11888814864 3741859558 60999666888 92375575826 43854399061 0 0
C- 03444407432 0 0 01495698025 0 0 0 0 38521176765 125760088714 271460282886
Signal 0 0 0 0 0 0 1 1 1 1 1
Average 00084333333
Std Dev 00005131601
k 05
Control Limit 4
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Betweeness 00027 00029 00024 00025 00033 00031 00033 00036 0002 00008 0
Z 01324532357 09271726499 -10596258857 -06622661785 25166114784 17218920642 25166114784 37086905998 -26490647141 -74173811996 -105962588565
C+ 0 04271726499 0 0 20166114784 32385035426 5255115021 84638056208 53147409067 0 0
C- 0 0 05596258857 07218920642 0 0 0 0 21490647141 90664459137 191627047702
Signal 0 0 0 0 0 0 1 1 1 1 1
Average 00026666667
Std Dev 00002516611
k 05
Control Limit 4
1988 1988 1988
1989 1989 1989
1990 1990 1990
1991 1991 1991
1992 1992 1992
1993 1993 1993
1994 1994 1994
1995 1995 1995
1996 1996 1996
1997 1997 1997
1998 1998 1998
1999 1999 1999
2000 2000 2000
2001 2001 2001
2002 2002 2002
2003 2003 2003
2004 2004 2004
Page 5: Data Science Center Eindhoven - 4TU.Federation · 4TU AMI SRO Big Data Meeting Big Data: Mathematics in Action! November 24, 2017 Data Science Center Eindhoven The Mathematics Behind

High-dimensional data ldquo119951119951 ≪ 119953119953rdquo

Presenter
Presentation Notes
book Van de Geer Buumlhlmann13httpwwwstatriceedu~gallenexamples_high_dim_datapdf13DNA 20000 genes of 200 people13fMRI similar voxels (time series) of few people13Netflix ratings of 18000 movies

Topic Google Search

Topic Google Search

bull How can Google search so fast on its over 250 billion pages

bull What are proper ways to rank web pages

bull First idea count words in pages

Topic Google Search

Model the WWW as a graphbull Squares (nodes vertices) denote web pagesbull Arrows (edges) denote links from one page to another

Relevance is translated into number of incoming links weighted by importance of referring pages

Topic Google Search

Topic Google Search

A =

Topic Google Search

Presenter
Presentation Notes
Note that pages 2 and 4 both have 2 incoming links but not the same relevance

Equivalent Random Surfer Model

bull Start at any point on the graphbull Assign equal probabilities to all outgoing linksbull Choose new node with probabilities given by the

linksbull Relevance is proportion of visits after surfing for a

very long time (ldquostationary distributionrdquo)

Google Matrix

bull Choose a ldquodamping factorrdquo p (0ltplt1) bull Googlersquos p is secret but around 015

Interpretation for random surferbull choose next link according to A with probability 1- pbull ldquoteleportrdquo to random page with probability p

119872119872 = 1 minus 119901119901 119860119860 + 119901119901119901119901

119901119901 =1119899119899

1 ⋯ 1⋮ ⋱ ⋮1 ⋯ 1

Choice of Matrix B

Google choose other B to avoid unwanted boosting of pages

Practical Issues

bull The real Google matrix has size in the order of 30 billion columns x 30 billion rows

bull Every day around 250000 new pages are added bull Matrix has many zeros (100 links to a page)

which speeds up calculationsbull Doing the matrix calculation

requires fast computers butalso advanced math

Topic Streaming algorithms

High volume high velocity data makes exact counting of frequencies or number of different items practically infeasible but approximate answers suffice in several applications (web site counters customer counts in retail transactionshellip)

New algorithms using advanced probabilistic methods (random hash functions)bull Count-Min Sketch bull MinHashbull HyperLogLog

Topic Uncertainty Quantification

Virtual prototyping using mathematical models UQ does not deal with1 unknown uncertainty in the initial

conditions of parameters2 parametrisation of design building blocks dimension

reduction

For 1) polynomial chaos (Wiener chaos) inverse statistical models Bayesian analysis (calibration)

For 2) Model Order Reduction

Topic Network monitoring

0

0002

0004

0006

0008

001

0012

1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004

Measu

re V

alu

e

Time

Al-Qaeda Network Measures

Betweeness

Density

Closeness

Challengesbull monitor high number of variablesbull models to capture structural changesbull scalable algorithms for likelihood ratio calculations

Closeness CUSUM Statistic for Al-Qaeda (1994-2004)

0

1

2

3

4

5

6

7

8

1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004

C+

Change Point

Signal

Chart3

Betweeness
Density
Closeness
Time
Measure Value
Al-Qaeda Network Measures
0
00003
00001
00001
00015
00005
00003
0003
00012
00006
00043
00016
00008
00055
00018
00016
00067
00021
00027
0008
00027
00029
0009
0003
00024
00083
00028
00025
00081
00028
00033
00093
00031
00031
001
0003
00033
00099
00032
00036
00103
00034
0002
00062
00024
00008
00037
00015
0
00007
00004

Std Measures Control Charts

Std Measures Control Charts

Betweeness
Density
Closeness
Time
Value
Standard Measures

Chart1

Number of AGENT
Edge Count[Agent x Agent Agent x Agent]
Component Members-Weak[Agent x Agent Agent x Agent] Average
Isolate Count[Agent x Agent Agent x Agent]
Component Count-Weak[Agent x Agent Agent x Agent]
Component Count-Strong[Agent x Agent Agent x Agent]
32
38
1728798
350
355
355
94
201
1248005
293
301
315
112
405
101776
269
272
281
137
574
862131
247
250
255
162
733
710437
222
226
232
183
889
559781
198
201
214
221
1075
367623
157
162
177
228
1207
299836
142
146
156
213
1112
366667
159
162
172
219
1082
345
154
157
169
254
1247
232377
126
128
145
250
1330
232213
125
128
144
256
1328
225628
122
126
137
260
1370
211885
119
122
131
190
825
50694
187
191
198
141
496
855628
243
249
256
62
92
1487596
322
329
336

Control Charts

raw data

Chart4

C+
Closeness CUSUM Statistic for Al-Qaeda (1994-2004)
0
05910894512
0
0
12457431219
18368325731
37372293657
69469334997
36101009266
0
0

Std Measures Control Charts

Std Measures Control Charts

Betweeness
Density
Closeness
Time
Value
Standard Measures

Chart1

C+
Closeness CUSUM Chart for Al-Qaeda (1994-2004)

Control Charts

Number of AGENT
Edge Count[Agent x Agent Agent x Agent]
Component Members-Weak[Agent x Agent Agent x Agent] Average
Isolate Count[Agent x Agent Agent x Agent]
Component Count-Weak[Agent x Agent Agent x Agent]
Component Count-Strong[Agent x Agent Agent x Agent]
32
38
1728798
350
355
355
94
201
1248005
293
301
315
112
405
101776
269
272
281
137
574
862131
247
250
255
162
733
710437
222
226
232
183
889
559781
198
201
214
221
1075
367623
157
162
177
228
1207
299836
142
146
156
213
1112
366667
159
162
172
219
1082
345
154
157
169
254
1247
232377
126
128
145
250
1330
232213
125
128
144
256
1328
225628
122
126
137
260
1370
211885
119
122
131
190
825
50694
187
191
198
141
496
855628
243
249
256
62
92
1487596
322
329
336

raw data

Topic Precision farming

Topic Precision farming

Topic Compressed Sensing

Topic Network structure

bull dependencies between nodes in networks measured by degrees of direct neighbours

bull assortativity coefficient of Newman is nothing but Pearsonrsquos correlation statistic

bull inconsistent estimator when variances are infinitebull Spearman rank correlation behaves better but

calculation is computationally intensive bull requires heavy asymptotics

Van der Hofstad R and Litvak N (2014) Degree-Degree Dependencies in Random Graphs with Heavy-Tailed Degrees Internet Mathematics 10 (3-4) pp 287-334

Topic Topological Data Analysis

Common Big Data problem is to choose relevant ldquofeaturesrdquo from high-dimensional data

Combination of machine learning with topological tools (simplices cohomology) yields new algorithmsfor finding patterns and clustering

Presenter
Presentation Notes
httpsyoutubeXfWibrh6stw Gunnar Carsson

Mathematical contributions in general

bull Modellingbull Performance of algorithmsbull Statistical thinking

We need to work hard to makemathematical contributions more explicit

Conclusions

1 Mathematical methods are an important enabler in big data settings

2 Developments in big data settings not only require more computer speed and memory capacity but also new advanced mathematics

  • The Mathematics Behind Big Data
  • Outline
  • What is big data
  • Four Vrsquos of Big Data
  • High-dimensional data ldquo119951≪119953rdquo
  • Topic Google Search
  • Topic Google Search
  • Topic Google Search
  • Topic Google Search
  • Topic Google Search
  • Topic Google Search
  • Equivalent Random Surfer Model
  • Google Matrix
  • Choice of Matrix B
  • Practical Issues
  • Topic Streaming algorithms
  • Topic Uncertainty Quantification
  • Topic Network monitoring
  • Topic Precision farming
  • Topic Precision farming
  • Topic Compressed Sensing
  • Topic Network structure
  • Topic Topological Data Analysis
  • Mathematical contributions in general
  • Conclusions
1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Centrality-Betweenness[Agent x Agent Agent x Agent] Average 0 00001 00003 00006 00008 00016 00027 00029 00024 00025 00033 00031 00033 00036 0002 00008 0
Meta-Matrix Hamming Distance 0 00018 00036 0006 00065 00087 00124 0012 00146 00136 00135 00166 00164 00156 0015 0009 0004
Network Centralization-Betweenness[Agent x Agent Agent x Agent] 0 00091 00292 00492 00608 01105 01557 01586 01242 01187 01295 01338 01316 01414 0104 0042 00014
Edge Count-Sequential[Agent x Agent Agent x Agent] 0 001 00025 00017 0 0 00019 00008 00009 00009 00016 00015 0 00007 0 0002 0
Hierarchy[Agent x Agent Agent x Agent] 0 04827 01679 00696 00796 01496 01339 00727 00775 01273 01492 01711 0075 00717 00786 00882 05587
Exclusivity[Agent x Agent Agent x Agent] Average 00001 00003 00003 00003 00004 00004 00005 00005 00005 00006 00006 00006 00006 00006 00006 00004 00002
Network Centralization-Closeness[Agent x Agent Agent x Agent] 00001 00005 00012 00016 00018 00021 00027 0003 00028 00028 00031 0003 00032 00034 00024 00015 00004
Similarity Correlation[Agent x Agent Agent x Agent] Average 00001 00023 00036 00043 0005 0006 0007 00077 0007 00069 0008 00084 0008 00082 00053 0003 00008
Simmelian Ties[Agent x Agent Agent x Agent] Average 00002 00008 00021 00029 0004 0005 00059 00068 00061 00058 00065 00076 00076 0008 00045 00029 00004
Overall Complexity 00003 00015 0003 00043 00055 00067 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Expertise Correlation[Agent x Agent Agent x Agent] Average 00003 00015 0003 00042 00053 00065 00079 00088 00081 00079 00091 00097 00097 00101 0006 00036 00007
Density[Agent x Agent Agent x Agent] 00003 00015 0003 00043 00055 00067 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Centrality-Inverse Closeness[Agent x Agent Agent x Agent] Average 00003 00057 00191 00306 00387 0053 00733 00943 0084 00877 01111 01117 01125 01206 00587 00257 00018
Efficiency-Global[Agent x Agent Agent x Agent] 00003 00087 00223 00338 00421 00591 00815 0101 00904 00972 01245 01244 01203 01286 00632 00277 00026
Connectedness[Agent x Agent Agent x Agent] 00003 0021 00589 00932 0119 01953 02921 03446 0301 03223 04222 04187 04049 04402 02202 00899 00059
Distinctiveness Correlation[Agent x Agent Agent x Agent] Average 00006 00029 00059 00083 00106 00129 00156 00175 00162 00157 00181 00193 00193 00199 0012 00073 00014
Centrality-Closeness[Agent x Agent Agent x Agent] Average 00027 00028 00029 00031 00032 00035 00042 00048 00044 00044 00053 00052 00054 00058 00038 00031 00027
Boundary Spanner[Agent x Agent Agent x Agent] Average 00027 00191 00328 00328 00546 00574 00683 00601 00656 0071 00683 0071 00847 00656 00738 00546 00246
Network Centralization-Row Degree[Agent x Agent Agent x Agent] 00134 00314 006 00752 00794 0081 00851 00841 00848 00878 00893 00969 00996 00993 00842 00648 0024
Network Centralization-Column Degree[Agent x Agent Agent x Agent] 00134 00588 00819 00834 00849 00892 00878 00896 00876 00905 00948 00996 01024 00993 00842 0062 00322
Network Centralization-Out Degree[Agent x Agent Agent x Agent] 00135 00315 00601 00754 00797 00812 00853 00843 00851 0088 00895 00972 00999 00996 00845 0065 0024
Network Centralization-Total Degree[Agent x Agent Agent x Agent] 00135 00453 00672 00769 00813 00828 00869 0086 00867 00897 00925 00988 01016 00999 00847 00638 00282
Network Centralization-In Degree[Agent x Agent Agent x Agent] 00135 00589 00821 00836 00852 00895 00881 00898 00878 00908 0095 00999 01027 00996 00845 00622 00323
Clustering Coefficient-Watts-Strogatz[Agent x Agent Agent x Agent] Average 0023 00786 0126 01471 01728 01993 02334 02782 02382 02256 02604 02856 02775 02958 02157 01468 00355
Efficiency-Local[Agent x Agent Agent x Agent] 00238 01032 01587 01916 02253 02598 03034 0362 03167 03054 03644 03843 03678 03861 02663 01705 0041
Interlockers[Agent x Agent Agent x Agent] Average 00246 00546 00656 00628 00738 00902 00902 00902 00847 0071 00738 00874 00874 0082 00792 00683 00546
Breadth-Column[Agent x Agent Agent x Agent] 00273 01011 01803 02596 03224 03716 04399 05082 04672 04672 05355 05328 05519 05738 03825 02404 00601
Breadth-Row[Agent x Agent Agent x Agent] 00273 01339 02158 02923 03388 03934 04754 05301 04863 04973 05792 05765 05765 05984 04153 02514 00601
Clique Count[Agent x Agent Agent x Agent] Average 00301 02896 06421 10929 14699 17213 19781 22486 20601 20492 24071 2571 24918 24727 13689 06913 01148
Constraint-Burt[Agent x Agent Agent x Agent] Average 0038 01331 01404 01481 01869 02077 02643 02719 02506 02623 02827 0295 03034 02956 02548 01967 00948
Interdependence[Agent x Agent Agent x Agent] 00485 00073 00042 0003 00024 0002 00017 00015 00016 00016 00014 00013 00014 00013 00022 00037 00176
Effective Network Size-Burt[Agent x Agent Agent x Agent] Average 0053 03636 06977 10333 13118 15388 18575 21156 19688 19928 22983 2382 23991 2429 14392 08408 01957
Redundancy-Row[Agent x Agent Agent x Agent] 00601 03525 08443 12459 16093 19699 23716 26913 24754 23798 27541 29781 29617 30683 1765 10219 01311
Redundancy-Column[Agent x Agent Agent x Agent] 00601 03743 08634 12568 1623 2 23989 27077 24945 24016 27814 30055 29891 30902 17842 10383 01448
Node Levels[Agent x Agent Agent x Agent] Average 00601 09836 14344 23087 24126 36339 46913 42541 3918 43197 51913 44727 46421 48169 47568 23033 03169
Centrality-Bonacich Power[Agent x Agent Agent x Agent] Average 01038 05492 11066 15683 20027 2429 29372 32978 30383 29563 34071 36339 36284 37432 22541 13552 02514
Triad Count[Agent x Agent Agent x Agent] Average 01967 08279 27104 42432 5918 78251 96503 101475 94454 84508 94645 111421 110164 112732 67131 35574 01995
Efficiency[Agent x Agent Agent x Agent] 03333 09552 0964 09658 09658 09746 09798 09806 09795 09818 09835 09822 0982 09828 09809 09756 09607
Speed-Minimum[Agent x Agent Agent x Agent] 05 0125 0125 01 01111 00909 00909 01 01 01 00909 01111 01 01 00769 01111 01667
Edge Count-Skip[Agent x Agent Agent x Agent] 07895 07761 08988 09408 09304 0946 0933 09519 09487 09409 09455 09534 09465 09584 09188 0881 06196
Transitivity[Agent x Agent Agent x Agent] 08372 03475 03372 03238 03336 03626 0375 03495 03503 03253 0319 03266 03302 0336 03822 0405 0258
Edge Count-Pooled[Agent x Agent Agent x Agent] 08421 08657 09432 09669 09714 09764 09665 09751 09748 09704 09735 09737 09759 09788 09612 09435 08152
Speed-Average[Agent x Agent Agent x Agent] 08519 02994 03068 02802 02799 0233 02156 02387 0245 02484 02448 02516 02464 02429 02259 02373 04139
Edge Count-Lateral[Agent x Agent Agent x Agent] 09474 03731 03926 0392 04216 04061 03944 03828 03768 03651 03633 03872 03795 03985 03842 03891 05761
Resemblance Correlation[Agent x Agent Agent x Agent] Average 09994 09971 09941 09917 09894 09871 09844 09825 09838 09843 09819 09807 09807 09801 0988 09927 09986
Fragmentation[Agent x Agent Agent x Agent] 09997 0979 09411 09068 0881 08047 07079 06554 0699 06777 05778 05813 05951 05598 07798 09101 09941
Communicative Need[Agent x Agent Agent x Agent] 1 0608 07457 07662 0801 08069 08098 08233 0817 08124 0784 08169 08368 08365 08212 09004 08039
Upper Boundedness[Agent x Agent Agent x Agent] 1 09021 09909 0999 09987 0994 09966 09993 09989 09965 0995 09928 09987 09988 09986 09983 09073
Average Distance[Agent x Agent Agent x Agent] 11739 33399 326 35685 35731 42917 46372 41893 40814 40263 40845 39749 40577 41172 44266 42139 24163
Network Levels[Agent x Agent Agent x Agent] 2 8 8 10 9 11 11 10 10 10 11 9 10 10 13 9 6
Span Of Control[Agent x Agent Agent x Agent] 2375 27917 42188 48644 50903 52917 51932 54369 53981 5128 52176 55417 54426 55466 46089 40656 20909
Number of AGENT 32 94 112 137 162 183 221 228 213 219 254 250 256 260 190 141 62
Edge Count[Agent x Agent Agent x Agent] 38 201 405 574 733 889 1075 1207 1112 1082 1247 1330 1328 1370 825 496 92
Component Members-Weak[Agent x Agent Agent x Agent] Average 1728798 1248005 101776 862131 710437 559781 367623 299836 366667 345 232377 232213 225628 211885 50694 855628 1487596
Isolate Count[Agent x Agent Agent x Agent] 350 293 269 247 222 198 157 142 159 154 126 125 122 119 187 243 322
Component Count-Weak[Agent x Agent Agent x Agent] 355 301 272 250 226 201 162 146 162 157 128 128 126 122 191 249 329
Component Count-Strong[Agent x Agent Agent x Agent] 355 315 281 255 232 214 177 156 172 169 145 144 137 131 198 256 336
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Agents 221 228 213 219 254 250 256 260 190 141 62
Z -03746343246 00624390541 -08741467575 -04995124328 16858544608 14360982444 18107325691 20604887855 -23102450019 -53697586531 -103024439274
C+ 0 0 0 0 11858544608 21219527053 34326852743 49931740598 21829290579 0 0
C- 0 0 03741467575 03736591903 0 0 0 0 18102450019 6680003655 164824475824
Signal 0 0 0 0 0 0 0 1 0 1 1
Average 227
Std Dev 16015617378
k 05
Control Limit 4
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Component Members-Weak Average 367623 299836 366667 345 232377 232213 225628 211885 50694 855628 1487596
Z 09566024078 -01181147213 09414456731 05979300132 -11876316317 -1190231741 -12946324712 -15125184597 31653794044 86935922814 187130183455
C+ 04566024078 0 04414456731 05393756863 0 0 0 0 26653794044 108589716857 290719900313
C- 0 0 0 0 06876316317 13778633728 21724958439 31850143037 0 0 0
Signal 0 0 0 0 0 0 0 0 0 1 1
Average 307286
Std Dev 63074271515
k 05
Control Limit 4
1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Edge Count 38 201 405 574 733 889 1075 1207 1112 1082 1247 1330 1328 1370 825 496 92
Slope of Edge Count 163 204 169 159 156 186 132 -95 -30 165 83 -2 42 -545 -329 -404
Z -05039753822 15973457028 -01964649795 -07089823173 -08627375186 06748144948 -20927791293 -137269226972 -6393653789 59836399187 20500693512 -16784942813 -06150208054 -316223197419 -114547624997 -100197139539
C+ 0 10973457028 04008807233 0 0 01748144948 0 0 0 54836399187 70337092699 48552149886 37401941833 0 0 0
C- 00039753822 0 0 02089823173 05717198359 0 15927791293 148197018266 207133556155 142297156968 116796463456 128581406269 129731614323 440954811741 550502436738 645699576277
Signal 0 0 0 0 0 0 0 0 1 1 1 1 1 1 1 1 1
Average 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 9475 4825 43 3075 54 72 -1055 -2085 -309
Std Dev 195115350498 195115350498 195115350498 195115350498 195115350498 195115350498 195115350498 195115350498 1284143683549 1324547092406 1254033492376 115840047767 882307580533 710539700979 29504858357 278774102097 2505474007049
k 05
Control Limit 4
1988 1988 1988 1988 1988 1988
1989 1989 1989 1989 1989 1989
1990 1990 1990 1990 1990 1990
1991 1991 1991 1991 1991 1991
1992 1992 1992 1992 1992 1992
1993 1993 1993 1993 1993 1993
1994 1994 1994 1994 1994 1994
1995 1995 1995 1995 1995 1995
1996 1996 1996 1996 1996 1996
1997 1997 1997 1997 1997 1997
1998 1998 1998 1998 1998 1998
1999 1999 1999 1999 1999 1999
2000 2000 2000 2000 2000 2000
2001 2001 2001 2001 2001 2001
2002 2002 2002 2002 2002 2002
2003 2003 2003 2003 2003 2003
2004 2004 2004 2004 2004 2004
httpwwwmarocnlnieuwsforumsshowphppg=timeline
Significant Events Founded in Afghanistan Moves to Sudan WTC 1 and Somalia Leaves Sudan Returns to Afghanistan Saudia Arabia bomb Issues fatwa US embassy bombings USS Cole WTC 2 Afghanistan 3 Man is Captured Dirty bomb foiled 911 plotter captured Sheikh Mohammed captured Saudi bombings Morroco attacks Sectarian violence plot Madrid Europe truce offered
1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Betweeness 0 00001 00003 00006 00008 00016 00027 00029 00024 00025 00033 00031 00033 00036 0002 00008 0
Density 00003 00015 0003 00043 00055 00067 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Closeness 00001 00005 00012 00016 00018 00021 00027 0003 00028 00028 00031 0003 00032 00034 00024 00015 00004
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Closeness 00027 0003 00028 00028 00031 0003 00032 00034 00024 00015 00004
Z -08728715609 10910894512 -02182178902 -02182178902 17457431219 10910894512 24003967926 3709704134 -28368325731 -87287156094 -159299059872
C+ 0 05910894512 0 0 12457431219 18368325731 37372293657 69469334997 36101009266 0 0
C- 03728715609 0 0 0 0 0 0 0 23368325731 105655481825 259954541697
03728715609 05910894512 0 0 12457431219 18368325731 37372293657 69469334997 36101009266 105655481825 259954541697
Signal 0 0 0 0 0 0 0 1 0 1 1
Average 00028333333
Std Dev 00001527525
k 05
Control Limit 4
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Density 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Z -08444407432 11042686642 -0259827921 -06495698025 16888814864 30529780716 28581071308 36375908938 -43521176765 -9223891195 -150700194171
C+ 0 06042686642 0 0 11888814864 3741859558 60999666888 92375575826 43854399061 0 0
C- 03444407432 0 0 01495698025 0 0 0 0 38521176765 125760088714 271460282886
Signal 0 0 0 0 0 0 1 1 1 1 1
Average 00084333333
Std Dev 00005131601
k 05
Control Limit 4
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Betweeness 00027 00029 00024 00025 00033 00031 00033 00036 0002 00008 0
Z 01324532357 09271726499 -10596258857 -06622661785 25166114784 17218920642 25166114784 37086905998 -26490647141 -74173811996 -105962588565
C+ 0 04271726499 0 0 20166114784 32385035426 5255115021 84638056208 53147409067 0 0
C- 0 0 05596258857 07218920642 0 0 0 0 21490647141 90664459137 191627047702
Signal 0 0 0 0 0 0 1 1 1 1 1
Average 00026666667
Std Dev 00002516611
k 05
Control Limit 4
1994
1995
1996
1997
1998
1999
2000
2001
2002
2003
2004
1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Centrality-Betweenness[Agent x Agent Agent x Agent] Average 0 00001 00003 00006 00008 00016 00027 00029 00024 00025 00033 00031 00033 00036 0002 00008 0
Meta-Matrix Hamming Distance 0 00018 00036 0006 00065 00087 00124 0012 00146 00136 00135 00166 00164 00156 0015 0009 0004
Network Centralization-Betweenness[Agent x Agent Agent x Agent] 0 00091 00292 00492 00608 01105 01557 01586 01242 01187 01295 01338 01316 01414 0104 0042 00014
Edge Count-Sequential[Agent x Agent Agent x Agent] 0 001 00025 00017 0 0 00019 00008 00009 00009 00016 00015 0 00007 0 0002 0
Hierarchy[Agent x Agent Agent x Agent] 0 04827 01679 00696 00796 01496 01339 00727 00775 01273 01492 01711 0075 00717 00786 00882 05587
Exclusivity[Agent x Agent Agent x Agent] Average 00001 00003 00003 00003 00004 00004 00005 00005 00005 00006 00006 00006 00006 00006 00006 00004 00002
Network Centralization-Closeness[Agent x Agent Agent x Agent] 00001 00005 00012 00016 00018 00021 00027 0003 00028 00028 00031 0003 00032 00034 00024 00015 00004
Similarity Correlation[Agent x Agent Agent x Agent] Average 00001 00023 00036 00043 0005 0006 0007 00077 0007 00069 0008 00084 0008 00082 00053 0003 00008
Simmelian Ties[Agent x Agent Agent x Agent] Average 00002 00008 00021 00029 0004 0005 00059 00068 00061 00058 00065 00076 00076 0008 00045 00029 00004
Overall Complexity 00003 00015 0003 00043 00055 00067 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Expertise Correlation[Agent x Agent Agent x Agent] Average 00003 00015 0003 00042 00053 00065 00079 00088 00081 00079 00091 00097 00097 00101 0006 00036 00007
Density[Agent x Agent Agent x Agent] 00003 00015 0003 00043 00055 00067 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Centrality-Inverse Closeness[Agent x Agent Agent x Agent] Average 00003 00057 00191 00306 00387 0053 00733 00943 0084 00877 01111 01117 01125 01206 00587 00257 00018
Efficiency-Global[Agent x Agent Agent x Agent] 00003 00087 00223 00338 00421 00591 00815 0101 00904 00972 01245 01244 01203 01286 00632 00277 00026
Connectedness[Agent x Agent Agent x Agent] 00003 0021 00589 00932 0119 01953 02921 03446 0301 03223 04222 04187 04049 04402 02202 00899 00059
Distinctiveness Correlation[Agent x Agent Agent x Agent] Average 00006 00029 00059 00083 00106 00129 00156 00175 00162 00157 00181 00193 00193 00199 0012 00073 00014
Centrality-Closeness[Agent x Agent Agent x Agent] Average 00027 00028 00029 00031 00032 00035 00042 00048 00044 00044 00053 00052 00054 00058 00038 00031 00027
Boundary Spanner[Agent x Agent Agent x Agent] Average 00027 00191 00328 00328 00546 00574 00683 00601 00656 0071 00683 0071 00847 00656 00738 00546 00246
Network Centralization-Row Degree[Agent x Agent Agent x Agent] 00134 00314 006 00752 00794 0081 00851 00841 00848 00878 00893 00969 00996 00993 00842 00648 0024
Network Centralization-Column Degree[Agent x Agent Agent x Agent] 00134 00588 00819 00834 00849 00892 00878 00896 00876 00905 00948 00996 01024 00993 00842 0062 00322
Network Centralization-Out Degree[Agent x Agent Agent x Agent] 00135 00315 00601 00754 00797 00812 00853 00843 00851 0088 00895 00972 00999 00996 00845 0065 0024
Network Centralization-Total Degree[Agent x Agent Agent x Agent] 00135 00453 00672 00769 00813 00828 00869 0086 00867 00897 00925 00988 01016 00999 00847 00638 00282
Network Centralization-In Degree[Agent x Agent Agent x Agent] 00135 00589 00821 00836 00852 00895 00881 00898 00878 00908 0095 00999 01027 00996 00845 00622 00323
Clustering Coefficient-Watts-Strogatz[Agent x Agent Agent x Agent] Average 0023 00786 0126 01471 01728 01993 02334 02782 02382 02256 02604 02856 02775 02958 02157 01468 00355
Efficiency-Local[Agent x Agent Agent x Agent] 00238 01032 01587 01916 02253 02598 03034 0362 03167 03054 03644 03843 03678 03861 02663 01705 0041
Interlockers[Agent x Agent Agent x Agent] Average 00246 00546 00656 00628 00738 00902 00902 00902 00847 0071 00738 00874 00874 0082 00792 00683 00546
Breadth-Column[Agent x Agent Agent x Agent] 00273 01011 01803 02596 03224 03716 04399 05082 04672 04672 05355 05328 05519 05738 03825 02404 00601
Breadth-Row[Agent x Agent Agent x Agent] 00273 01339 02158 02923 03388 03934 04754 05301 04863 04973 05792 05765 05765 05984 04153 02514 00601
Clique Count[Agent x Agent Agent x Agent] Average 00301 02896 06421 10929 14699 17213 19781 22486 20601 20492 24071 2571 24918 24727 13689 06913 01148
Constraint-Burt[Agent x Agent Agent x Agent] Average 0038 01331 01404 01481 01869 02077 02643 02719 02506 02623 02827 0295 03034 02956 02548 01967 00948
Interdependence[Agent x Agent Agent x Agent] 00485 00073 00042 0003 00024 0002 00017 00015 00016 00016 00014 00013 00014 00013 00022 00037 00176
Effective Network Size-Burt[Agent x Agent Agent x Agent] Average 0053 03636 06977 10333 13118 15388 18575 21156 19688 19928 22983 2382 23991 2429 14392 08408 01957
Redundancy-Row[Agent x Agent Agent x Agent] 00601 03525 08443 12459 16093 19699 23716 26913 24754 23798 27541 29781 29617 30683 1765 10219 01311
Redundancy-Column[Agent x Agent Agent x Agent] 00601 03743 08634 12568 1623 2 23989 27077 24945 24016 27814 30055 29891 30902 17842 10383 01448
Node Levels[Agent x Agent Agent x Agent] Average 00601 09836 14344 23087 24126 36339 46913 42541 3918 43197 51913 44727 46421 48169 47568 23033 03169
Centrality-Bonacich Power[Agent x Agent Agent x Agent] Average 01038 05492 11066 15683 20027 2429 29372 32978 30383 29563 34071 36339 36284 37432 22541 13552 02514
Triad Count[Agent x Agent Agent x Agent] Average 01967 08279 27104 42432 5918 78251 96503 101475 94454 84508 94645 111421 110164 112732 67131 35574 01995
Efficiency[Agent x Agent Agent x Agent] 03333 09552 0964 09658 09658 09746 09798 09806 09795 09818 09835 09822 0982 09828 09809 09756 09607
Speed-Minimum[Agent x Agent Agent x Agent] 05 0125 0125 01 01111 00909 00909 01 01 01 00909 01111 01 01 00769 01111 01667
Edge Count-Skip[Agent x Agent Agent x Agent] 07895 07761 08988 09408 09304 0946 0933 09519 09487 09409 09455 09534 09465 09584 09188 0881 06196
Transitivity[Agent x Agent Agent x Agent] 08372 03475 03372 03238 03336 03626 0375 03495 03503 03253 0319 03266 03302 0336 03822 0405 0258
Edge Count-Pooled[Agent x Agent Agent x Agent] 08421 08657 09432 09669 09714 09764 09665 09751 09748 09704 09735 09737 09759 09788 09612 09435 08152
Speed-Average[Agent x Agent Agent x Agent] 08519 02994 03068 02802 02799 0233 02156 02387 0245 02484 02448 02516 02464 02429 02259 02373 04139
Edge Count-Lateral[Agent x Agent Agent x Agent] 09474 03731 03926 0392 04216 04061 03944 03828 03768 03651 03633 03872 03795 03985 03842 03891 05761
Resemblance Correlation[Agent x Agent Agent x Agent] Average 09994 09971 09941 09917 09894 09871 09844 09825 09838 09843 09819 09807 09807 09801 0988 09927 09986
Fragmentation[Agent x Agent Agent x Agent] 09997 0979 09411 09068 0881 08047 07079 06554 0699 06777 05778 05813 05951 05598 07798 09101 09941
Communicative Need[Agent x Agent Agent x Agent] 1 0608 07457 07662 0801 08069 08098 08233 0817 08124 0784 08169 08368 08365 08212 09004 08039
Upper Boundedness[Agent x Agent Agent x Agent] 1 09021 09909 0999 09987 0994 09966 09993 09989 09965 0995 09928 09987 09988 09986 09983 09073
Average Distance[Agent x Agent Agent x Agent] 11739 33399 326 35685 35731 42917 46372 41893 40814 40263 40845 39749 40577 41172 44266 42139 24163
Network Levels[Agent x Agent Agent x Agent] 2 8 8 10 9 11 11 10 10 10 11 9 10 10 13 9 6
Span Of Control[Agent x Agent Agent x Agent] 2375 27917 42188 48644 50903 52917 51932 54369 53981 5128 52176 55417 54426 55466 46089 40656 20909
Number of AGENT 32 94 112 137 162 183 221 228 213 219 254 250 256 260 190 141 62
Edge Count[Agent x Agent Agent x Agent] 38 201 405 574 733 889 1075 1207 1112 1082 1247 1330 1328 1370 825 496 92
Component Members-Weak[Agent x Agent Agent x Agent] Average 1728798 1248005 101776 862131 710437 559781 367623 299836 366667 345 232377 232213 225628 211885 50694 855628 1487596
Isolate Count[Agent x Agent Agent x Agent] 350 293 269 247 222 198 157 142 159 154 126 125 122 119 187 243 322
Component Count-Weak[Agent x Agent Agent x Agent] 355 301 272 250 226 201 162 146 162 157 128 128 126 122 191 249 329
Component Count-Strong[Agent x Agent Agent x Agent] 355 315 281 255 232 214 177 156 172 169 145 144 137 131 198 256 336
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Agents 221 228 213 219 254 250 256 260 190 141 62
Z -03746343246 00624390541 -08741467575 -04995124328 16858544608 14360982444 18107325691 20604887855 -23102450019 -53697586531 -103024439274
C+ 0 0 0 0 11858544608 21219527053 34326852743 49931740598 21829290579 0 0
C- 0 0 03741467575 03736591903 0 0 0 0 18102450019 6680003655 164824475824
Signal 0 0 0 0 0 0 0 1 0 1 1
Average 227
Std Dev 16015617378
k 05
Control Limit 4
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Component Members-Weak Average 367623 299836 366667 345 232377 232213 225628 211885 50694 855628 1487596
Z 09566024078 -01181147213 09414456731 05979300132 -11876316317 -1190231741 -12946324712 -15125184597 31653794044 86935922814 187130183455
C+ 04566024078 0 04414456731 05393756863 0 0 0 0 26653794044 108589716857 290719900313
C- 0 0 0 0 06876316317 13778633728 21724958439 31850143037 0 0 0
Signal 0 0 0 0 0 0 0 0 0 1 1
Average 307286
Std Dev 63074271515
k 05
Control Limit 4
1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Edge Count 38 201 405 574 733 889 1075 1207 1112 1082 1247 1330 1328 1370 825 496 92
Slope of Edge Count 163 204 169 159 156 186 132 -95 -30 165 83 -2 42 -545 -329 -404
Z -05039753822 15973457028 -01964649795 -07089823173 -08627375186 06748144948 -20927791293 -137269226972 -6393653789 59836399187 20500693512 -16784942813 -06150208054 -316223197419 -114547624997 -100197139539
C+ 0 10973457028 04008807233 0 0 01748144948 0 0 0 54836399187 70337092699 48552149886 37401941833 0 0 0
C- 00039753822 0 0 02089823173 05717198359 0 15927791293 148197018266 207133556155 142297156968 116796463456 128581406269 129731614323 440954811741 550502436738 645699576277
Signal 0 0 0 0 0 0 0 0 1 1 1 1 1 1 1 1 1
Average 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 9475 4825 43 3075 54 72 -1055 -2085 -309
Std Dev 195115350498 195115350498 195115350498 195115350498 195115350498 195115350498 195115350498 195115350498 1284143683549 1324547092406 1254033492376 115840047767 882307580533 710539700979 29504858357 278774102097 2505474007049
k 05
Control Limit 4
1988 1988 1988 1988 1988 1988
1989 1989 1989 1989 1989 1989
1990 1990 1990 1990 1990 1990
1991 1991 1991 1991 1991 1991
1992 1992 1992 1992 1992 1992
1993 1993 1993 1993 1993 1993
1994 1994 1994 1994 1994 1994
1995 1995 1995 1995 1995 1995
1996 1996 1996 1996 1996 1996
1997 1997 1997 1997 1997 1997
1998 1998 1998 1998 1998 1998
1999 1999 1999 1999 1999 1999
2000 2000 2000 2000 2000 2000
2001 2001 2001 2001 2001 2001
2002 2002 2002 2002 2002 2002
2003 2003 2003 2003 2003 2003
2004 2004 2004 2004 2004 2004
httpwwwmarocnlnieuwsforumsshowphppg=timeline
Significant Events Founded in Afghanistan Moves to Sudan WTC 1 and Somalia Leaves Sudan Returns to Afghanistan Saudia Arabia bomb Issues fatwa US embassy bombings USS Cole WTC 2 Afghanistan 3 Man is Captured Dirty bomb foiled 911 plotter captured Sheikh Mohammed captured Saudi bombings Morroco attacks Sectarian violence plot Madrid Europe truce offered
1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Betweeness 0 00001 00003 00006 00008 00016 00027 00029 00024 00025 00033 00031 00033 00036 0002 00008 0
Density 00003 00015 0003 00043 00055 00067 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Closeness 00001 00005 00012 00016 00018 00021 00027 0003 00028 00028 00031 0003 00032 00034 00024 00015 00004
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Closeness 00027 0003 00028 00028 00031 0003 00032 00034 00024 00015 00004
Z -08728715609 10910894512 -02182178902 -02182178902 17457431219 10910894512 24003967926 3709704134 -28368325731 -87287156094 -159299059872
C+ 0 05910894512 0 0 12457431219 18368325731 37372293657 69469334997 36101009266 0 0
C- 03728715609 0 0 0 0 0 0 0 23368325731 105655481825 259954541697
Signal 0 0 0 0 0 0 0 1 0 1 1
Average 00028333333
Std Dev 00001527525
k 05
Control Limit 4
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Density 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Z -08444407432 11042686642 -0259827921 -06495698025 16888814864 30529780716 28581071308 36375908938 -43521176765 -9223891195 -150700194171
C+ 0 06042686642 0 0 11888814864 3741859558 60999666888 92375575826 43854399061 0 0
C- 03444407432 0 0 01495698025 0 0 0 0 38521176765 125760088714 271460282886
Signal 0 0 0 0 0 0 1 1 1 1 1
Average 00084333333
Std Dev 00005131601
k 05
Control Limit 4
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Betweeness 00027 00029 00024 00025 00033 00031 00033 00036 0002 00008 0
Z 01324532357 09271726499 -10596258857 -06622661785 25166114784 17218920642 25166114784 37086905998 -26490647141 -74173811996 -105962588565
C+ 0 04271726499 0 0 20166114784 32385035426 5255115021 84638056208 53147409067 0 0
C- 0 0 05596258857 07218920642 0 0 0 0 21490647141 90664459137 191627047702
Signal 0 0 0 0 0 0 1 1 1 1 1
Average 00026666667
Std Dev 00002516611
k 05
Control Limit 4
1988 1988 1988
1989 1989 1989
1990 1990 1990
1991 1991 1991
1992 1992 1992
1993 1993 1993
1994 1994 1994
1995 1995 1995
1996 1996 1996
1997 1997 1997
1998 1998 1998
1999 1999 1999
2000 2000 2000
2001 2001 2001
2002 2002 2002
2003 2003 2003
2004 2004 2004
Page 6: Data Science Center Eindhoven - 4TU.Federation · 4TU AMI SRO Big Data Meeting Big Data: Mathematics in Action! November 24, 2017 Data Science Center Eindhoven The Mathematics Behind

Topic Google Search

Topic Google Search

bull How can Google search so fast on its over 250 billion pages

bull What are proper ways to rank web pages

bull First idea count words in pages

Topic Google Search

Model the WWW as a graphbull Squares (nodes vertices) denote web pagesbull Arrows (edges) denote links from one page to another

Relevance is translated into number of incoming links weighted by importance of referring pages

Topic Google Search

Topic Google Search

A =

Topic Google Search

Presenter
Presentation Notes
Note that pages 2 and 4 both have 2 incoming links but not the same relevance

Equivalent Random Surfer Model

bull Start at any point on the graphbull Assign equal probabilities to all outgoing linksbull Choose new node with probabilities given by the

linksbull Relevance is proportion of visits after surfing for a

very long time (ldquostationary distributionrdquo)

Google Matrix

bull Choose a ldquodamping factorrdquo p (0ltplt1) bull Googlersquos p is secret but around 015

Interpretation for random surferbull choose next link according to A with probability 1- pbull ldquoteleportrdquo to random page with probability p

119872119872 = 1 minus 119901119901 119860119860 + 119901119901119901119901

119901119901 =1119899119899

1 ⋯ 1⋮ ⋱ ⋮1 ⋯ 1

Choice of Matrix B

Google choose other B to avoid unwanted boosting of pages

Practical Issues

bull The real Google matrix has size in the order of 30 billion columns x 30 billion rows

bull Every day around 250000 new pages are added bull Matrix has many zeros (100 links to a page)

which speeds up calculationsbull Doing the matrix calculation

requires fast computers butalso advanced math

Topic Streaming algorithms

High volume high velocity data makes exact counting of frequencies or number of different items practically infeasible but approximate answers suffice in several applications (web site counters customer counts in retail transactionshellip)

New algorithms using advanced probabilistic methods (random hash functions)bull Count-Min Sketch bull MinHashbull HyperLogLog

Topic Uncertainty Quantification

Virtual prototyping using mathematical models UQ does not deal with1 unknown uncertainty in the initial

conditions of parameters2 parametrisation of design building blocks dimension

reduction

For 1) polynomial chaos (Wiener chaos) inverse statistical models Bayesian analysis (calibration)

For 2) Model Order Reduction

Topic Network monitoring

0

0002

0004

0006

0008

001

0012

1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004

Measu

re V

alu

e

Time

Al-Qaeda Network Measures

Betweeness

Density

Closeness

Challengesbull monitor high number of variablesbull models to capture structural changesbull scalable algorithms for likelihood ratio calculations

Closeness CUSUM Statistic for Al-Qaeda (1994-2004)

0

1

2

3

4

5

6

7

8

1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004

C+

Change Point

Signal

Chart3

Betweeness
Density
Closeness
Time
Measure Value
Al-Qaeda Network Measures
0
00003
00001
00001
00015
00005
00003
0003
00012
00006
00043
00016
00008
00055
00018
00016
00067
00021
00027
0008
00027
00029
0009
0003
00024
00083
00028
00025
00081
00028
00033
00093
00031
00031
001
0003
00033
00099
00032
00036
00103
00034
0002
00062
00024
00008
00037
00015
0
00007
00004

Std Measures Control Charts

Std Measures Control Charts

Betweeness
Density
Closeness
Time
Value
Standard Measures

Chart1

Number of AGENT
Edge Count[Agent x Agent Agent x Agent]
Component Members-Weak[Agent x Agent Agent x Agent] Average
Isolate Count[Agent x Agent Agent x Agent]
Component Count-Weak[Agent x Agent Agent x Agent]
Component Count-Strong[Agent x Agent Agent x Agent]
32
38
1728798
350
355
355
94
201
1248005
293
301
315
112
405
101776
269
272
281
137
574
862131
247
250
255
162
733
710437
222
226
232
183
889
559781
198
201
214
221
1075
367623
157
162
177
228
1207
299836
142
146
156
213
1112
366667
159
162
172
219
1082
345
154
157
169
254
1247
232377
126
128
145
250
1330
232213
125
128
144
256
1328
225628
122
126
137
260
1370
211885
119
122
131
190
825
50694
187
191
198
141
496
855628
243
249
256
62
92
1487596
322
329
336

Control Charts

raw data

Chart4

C+
Closeness CUSUM Statistic for Al-Qaeda (1994-2004)
0
05910894512
0
0
12457431219
18368325731
37372293657
69469334997
36101009266
0
0

Std Measures Control Charts

Std Measures Control Charts

Betweeness
Density
Closeness
Time
Value
Standard Measures

Chart1

C+
Closeness CUSUM Chart for Al-Qaeda (1994-2004)

Control Charts

Number of AGENT
Edge Count[Agent x Agent Agent x Agent]
Component Members-Weak[Agent x Agent Agent x Agent] Average
Isolate Count[Agent x Agent Agent x Agent]
Component Count-Weak[Agent x Agent Agent x Agent]
Component Count-Strong[Agent x Agent Agent x Agent]
32
38
1728798
350
355
355
94
201
1248005
293
301
315
112
405
101776
269
272
281
137
574
862131
247
250
255
162
733
710437
222
226
232
183
889
559781
198
201
214
221
1075
367623
157
162
177
228
1207
299836
142
146
156
213
1112
366667
159
162
172
219
1082
345
154
157
169
254
1247
232377
126
128
145
250
1330
232213
125
128
144
256
1328
225628
122
126
137
260
1370
211885
119
122
131
190
825
50694
187
191
198
141
496
855628
243
249
256
62
92
1487596
322
329
336

raw data

Topic Precision farming

Topic Precision farming

Topic Compressed Sensing

Topic Network structure

bull dependencies between nodes in networks measured by degrees of direct neighbours

bull assortativity coefficient of Newman is nothing but Pearsonrsquos correlation statistic

bull inconsistent estimator when variances are infinitebull Spearman rank correlation behaves better but

calculation is computationally intensive bull requires heavy asymptotics

Van der Hofstad R and Litvak N (2014) Degree-Degree Dependencies in Random Graphs with Heavy-Tailed Degrees Internet Mathematics 10 (3-4) pp 287-334

Topic Topological Data Analysis

Common Big Data problem is to choose relevant ldquofeaturesrdquo from high-dimensional data

Combination of machine learning with topological tools (simplices cohomology) yields new algorithmsfor finding patterns and clustering

Presenter
Presentation Notes
httpsyoutubeXfWibrh6stw Gunnar Carsson

Mathematical contributions in general

bull Modellingbull Performance of algorithmsbull Statistical thinking

We need to work hard to makemathematical contributions more explicit

Conclusions

1 Mathematical methods are an important enabler in big data settings

2 Developments in big data settings not only require more computer speed and memory capacity but also new advanced mathematics

  • The Mathematics Behind Big Data
  • Outline
  • What is big data
  • Four Vrsquos of Big Data
  • High-dimensional data ldquo119951≪119953rdquo
  • Topic Google Search
  • Topic Google Search
  • Topic Google Search
  • Topic Google Search
  • Topic Google Search
  • Topic Google Search
  • Equivalent Random Surfer Model
  • Google Matrix
  • Choice of Matrix B
  • Practical Issues
  • Topic Streaming algorithms
  • Topic Uncertainty Quantification
  • Topic Network monitoring
  • Topic Precision farming
  • Topic Precision farming
  • Topic Compressed Sensing
  • Topic Network structure
  • Topic Topological Data Analysis
  • Mathematical contributions in general
  • Conclusions
1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Centrality-Betweenness[Agent x Agent Agent x Agent] Average 0 00001 00003 00006 00008 00016 00027 00029 00024 00025 00033 00031 00033 00036 0002 00008 0
Meta-Matrix Hamming Distance 0 00018 00036 0006 00065 00087 00124 0012 00146 00136 00135 00166 00164 00156 0015 0009 0004
Network Centralization-Betweenness[Agent x Agent Agent x Agent] 0 00091 00292 00492 00608 01105 01557 01586 01242 01187 01295 01338 01316 01414 0104 0042 00014
Edge Count-Sequential[Agent x Agent Agent x Agent] 0 001 00025 00017 0 0 00019 00008 00009 00009 00016 00015 0 00007 0 0002 0
Hierarchy[Agent x Agent Agent x Agent] 0 04827 01679 00696 00796 01496 01339 00727 00775 01273 01492 01711 0075 00717 00786 00882 05587
Exclusivity[Agent x Agent Agent x Agent] Average 00001 00003 00003 00003 00004 00004 00005 00005 00005 00006 00006 00006 00006 00006 00006 00004 00002
Network Centralization-Closeness[Agent x Agent Agent x Agent] 00001 00005 00012 00016 00018 00021 00027 0003 00028 00028 00031 0003 00032 00034 00024 00015 00004
Similarity Correlation[Agent x Agent Agent x Agent] Average 00001 00023 00036 00043 0005 0006 0007 00077 0007 00069 0008 00084 0008 00082 00053 0003 00008
Simmelian Ties[Agent x Agent Agent x Agent] Average 00002 00008 00021 00029 0004 0005 00059 00068 00061 00058 00065 00076 00076 0008 00045 00029 00004
Overall Complexity 00003 00015 0003 00043 00055 00067 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Expertise Correlation[Agent x Agent Agent x Agent] Average 00003 00015 0003 00042 00053 00065 00079 00088 00081 00079 00091 00097 00097 00101 0006 00036 00007
Density[Agent x Agent Agent x Agent] 00003 00015 0003 00043 00055 00067 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Centrality-Inverse Closeness[Agent x Agent Agent x Agent] Average 00003 00057 00191 00306 00387 0053 00733 00943 0084 00877 01111 01117 01125 01206 00587 00257 00018
Efficiency-Global[Agent x Agent Agent x Agent] 00003 00087 00223 00338 00421 00591 00815 0101 00904 00972 01245 01244 01203 01286 00632 00277 00026
Connectedness[Agent x Agent Agent x Agent] 00003 0021 00589 00932 0119 01953 02921 03446 0301 03223 04222 04187 04049 04402 02202 00899 00059
Distinctiveness Correlation[Agent x Agent Agent x Agent] Average 00006 00029 00059 00083 00106 00129 00156 00175 00162 00157 00181 00193 00193 00199 0012 00073 00014
Centrality-Closeness[Agent x Agent Agent x Agent] Average 00027 00028 00029 00031 00032 00035 00042 00048 00044 00044 00053 00052 00054 00058 00038 00031 00027
Boundary Spanner[Agent x Agent Agent x Agent] Average 00027 00191 00328 00328 00546 00574 00683 00601 00656 0071 00683 0071 00847 00656 00738 00546 00246
Network Centralization-Row Degree[Agent x Agent Agent x Agent] 00134 00314 006 00752 00794 0081 00851 00841 00848 00878 00893 00969 00996 00993 00842 00648 0024
Network Centralization-Column Degree[Agent x Agent Agent x Agent] 00134 00588 00819 00834 00849 00892 00878 00896 00876 00905 00948 00996 01024 00993 00842 0062 00322
Network Centralization-Out Degree[Agent x Agent Agent x Agent] 00135 00315 00601 00754 00797 00812 00853 00843 00851 0088 00895 00972 00999 00996 00845 0065 0024
Network Centralization-Total Degree[Agent x Agent Agent x Agent] 00135 00453 00672 00769 00813 00828 00869 0086 00867 00897 00925 00988 01016 00999 00847 00638 00282
Network Centralization-In Degree[Agent x Agent Agent x Agent] 00135 00589 00821 00836 00852 00895 00881 00898 00878 00908 0095 00999 01027 00996 00845 00622 00323
Clustering Coefficient-Watts-Strogatz[Agent x Agent Agent x Agent] Average 0023 00786 0126 01471 01728 01993 02334 02782 02382 02256 02604 02856 02775 02958 02157 01468 00355
Efficiency-Local[Agent x Agent Agent x Agent] 00238 01032 01587 01916 02253 02598 03034 0362 03167 03054 03644 03843 03678 03861 02663 01705 0041
Interlockers[Agent x Agent Agent x Agent] Average 00246 00546 00656 00628 00738 00902 00902 00902 00847 0071 00738 00874 00874 0082 00792 00683 00546
Breadth-Column[Agent x Agent Agent x Agent] 00273 01011 01803 02596 03224 03716 04399 05082 04672 04672 05355 05328 05519 05738 03825 02404 00601
Breadth-Row[Agent x Agent Agent x Agent] 00273 01339 02158 02923 03388 03934 04754 05301 04863 04973 05792 05765 05765 05984 04153 02514 00601
Clique Count[Agent x Agent Agent x Agent] Average 00301 02896 06421 10929 14699 17213 19781 22486 20601 20492 24071 2571 24918 24727 13689 06913 01148
Constraint-Burt[Agent x Agent Agent x Agent] Average 0038 01331 01404 01481 01869 02077 02643 02719 02506 02623 02827 0295 03034 02956 02548 01967 00948
Interdependence[Agent x Agent Agent x Agent] 00485 00073 00042 0003 00024 0002 00017 00015 00016 00016 00014 00013 00014 00013 00022 00037 00176
Effective Network Size-Burt[Agent x Agent Agent x Agent] Average 0053 03636 06977 10333 13118 15388 18575 21156 19688 19928 22983 2382 23991 2429 14392 08408 01957
Redundancy-Row[Agent x Agent Agent x Agent] 00601 03525 08443 12459 16093 19699 23716 26913 24754 23798 27541 29781 29617 30683 1765 10219 01311
Redundancy-Column[Agent x Agent Agent x Agent] 00601 03743 08634 12568 1623 2 23989 27077 24945 24016 27814 30055 29891 30902 17842 10383 01448
Node Levels[Agent x Agent Agent x Agent] Average 00601 09836 14344 23087 24126 36339 46913 42541 3918 43197 51913 44727 46421 48169 47568 23033 03169
Centrality-Bonacich Power[Agent x Agent Agent x Agent] Average 01038 05492 11066 15683 20027 2429 29372 32978 30383 29563 34071 36339 36284 37432 22541 13552 02514
Triad Count[Agent x Agent Agent x Agent] Average 01967 08279 27104 42432 5918 78251 96503 101475 94454 84508 94645 111421 110164 112732 67131 35574 01995
Efficiency[Agent x Agent Agent x Agent] 03333 09552 0964 09658 09658 09746 09798 09806 09795 09818 09835 09822 0982 09828 09809 09756 09607
Speed-Minimum[Agent x Agent Agent x Agent] 05 0125 0125 01 01111 00909 00909 01 01 01 00909 01111 01 01 00769 01111 01667
Edge Count-Skip[Agent x Agent Agent x Agent] 07895 07761 08988 09408 09304 0946 0933 09519 09487 09409 09455 09534 09465 09584 09188 0881 06196
Transitivity[Agent x Agent Agent x Agent] 08372 03475 03372 03238 03336 03626 0375 03495 03503 03253 0319 03266 03302 0336 03822 0405 0258
Edge Count-Pooled[Agent x Agent Agent x Agent] 08421 08657 09432 09669 09714 09764 09665 09751 09748 09704 09735 09737 09759 09788 09612 09435 08152
Speed-Average[Agent x Agent Agent x Agent] 08519 02994 03068 02802 02799 0233 02156 02387 0245 02484 02448 02516 02464 02429 02259 02373 04139
Edge Count-Lateral[Agent x Agent Agent x Agent] 09474 03731 03926 0392 04216 04061 03944 03828 03768 03651 03633 03872 03795 03985 03842 03891 05761
Resemblance Correlation[Agent x Agent Agent x Agent] Average 09994 09971 09941 09917 09894 09871 09844 09825 09838 09843 09819 09807 09807 09801 0988 09927 09986
Fragmentation[Agent x Agent Agent x Agent] 09997 0979 09411 09068 0881 08047 07079 06554 0699 06777 05778 05813 05951 05598 07798 09101 09941
Communicative Need[Agent x Agent Agent x Agent] 1 0608 07457 07662 0801 08069 08098 08233 0817 08124 0784 08169 08368 08365 08212 09004 08039
Upper Boundedness[Agent x Agent Agent x Agent] 1 09021 09909 0999 09987 0994 09966 09993 09989 09965 0995 09928 09987 09988 09986 09983 09073
Average Distance[Agent x Agent Agent x Agent] 11739 33399 326 35685 35731 42917 46372 41893 40814 40263 40845 39749 40577 41172 44266 42139 24163
Network Levels[Agent x Agent Agent x Agent] 2 8 8 10 9 11 11 10 10 10 11 9 10 10 13 9 6
Span Of Control[Agent x Agent Agent x Agent] 2375 27917 42188 48644 50903 52917 51932 54369 53981 5128 52176 55417 54426 55466 46089 40656 20909
Number of AGENT 32 94 112 137 162 183 221 228 213 219 254 250 256 260 190 141 62
Edge Count[Agent x Agent Agent x Agent] 38 201 405 574 733 889 1075 1207 1112 1082 1247 1330 1328 1370 825 496 92
Component Members-Weak[Agent x Agent Agent x Agent] Average 1728798 1248005 101776 862131 710437 559781 367623 299836 366667 345 232377 232213 225628 211885 50694 855628 1487596
Isolate Count[Agent x Agent Agent x Agent] 350 293 269 247 222 198 157 142 159 154 126 125 122 119 187 243 322
Component Count-Weak[Agent x Agent Agent x Agent] 355 301 272 250 226 201 162 146 162 157 128 128 126 122 191 249 329
Component Count-Strong[Agent x Agent Agent x Agent] 355 315 281 255 232 214 177 156 172 169 145 144 137 131 198 256 336
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Agents 221 228 213 219 254 250 256 260 190 141 62
Z -03746343246 00624390541 -08741467575 -04995124328 16858544608 14360982444 18107325691 20604887855 -23102450019 -53697586531 -103024439274
C+ 0 0 0 0 11858544608 21219527053 34326852743 49931740598 21829290579 0 0
C- 0 0 03741467575 03736591903 0 0 0 0 18102450019 6680003655 164824475824
Signal 0 0 0 0 0 0 0 1 0 1 1
Average 227
Std Dev 16015617378
k 05
Control Limit 4
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Component Members-Weak Average 367623 299836 366667 345 232377 232213 225628 211885 50694 855628 1487596
Z 09566024078 -01181147213 09414456731 05979300132 -11876316317 -1190231741 -12946324712 -15125184597 31653794044 86935922814 187130183455
C+ 04566024078 0 04414456731 05393756863 0 0 0 0 26653794044 108589716857 290719900313
C- 0 0 0 0 06876316317 13778633728 21724958439 31850143037 0 0 0
Signal 0 0 0 0 0 0 0 0 0 1 1
Average 307286
Std Dev 63074271515
k 05
Control Limit 4
1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Edge Count 38 201 405 574 733 889 1075 1207 1112 1082 1247 1330 1328 1370 825 496 92
Slope of Edge Count 163 204 169 159 156 186 132 -95 -30 165 83 -2 42 -545 -329 -404
Z -05039753822 15973457028 -01964649795 -07089823173 -08627375186 06748144948 -20927791293 -137269226972 -6393653789 59836399187 20500693512 -16784942813 -06150208054 -316223197419 -114547624997 -100197139539
C+ 0 10973457028 04008807233 0 0 01748144948 0 0 0 54836399187 70337092699 48552149886 37401941833 0 0 0
C- 00039753822 0 0 02089823173 05717198359 0 15927791293 148197018266 207133556155 142297156968 116796463456 128581406269 129731614323 440954811741 550502436738 645699576277
Signal 0 0 0 0 0 0 0 0 1 1 1 1 1 1 1 1 1
Average 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 9475 4825 43 3075 54 72 -1055 -2085 -309
Std Dev 195115350498 195115350498 195115350498 195115350498 195115350498 195115350498 195115350498 195115350498 1284143683549 1324547092406 1254033492376 115840047767 882307580533 710539700979 29504858357 278774102097 2505474007049
k 05
Control Limit 4
1988 1988 1988 1988 1988 1988
1989 1989 1989 1989 1989 1989
1990 1990 1990 1990 1990 1990
1991 1991 1991 1991 1991 1991
1992 1992 1992 1992 1992 1992
1993 1993 1993 1993 1993 1993
1994 1994 1994 1994 1994 1994
1995 1995 1995 1995 1995 1995
1996 1996 1996 1996 1996 1996
1997 1997 1997 1997 1997 1997
1998 1998 1998 1998 1998 1998
1999 1999 1999 1999 1999 1999
2000 2000 2000 2000 2000 2000
2001 2001 2001 2001 2001 2001
2002 2002 2002 2002 2002 2002
2003 2003 2003 2003 2003 2003
2004 2004 2004 2004 2004 2004
httpwwwmarocnlnieuwsforumsshowphppg=timeline
Significant Events Founded in Afghanistan Moves to Sudan WTC 1 and Somalia Leaves Sudan Returns to Afghanistan Saudia Arabia bomb Issues fatwa US embassy bombings USS Cole WTC 2 Afghanistan 3 Man is Captured Dirty bomb foiled 911 plotter captured Sheikh Mohammed captured Saudi bombings Morroco attacks Sectarian violence plot Madrid Europe truce offered
1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Betweeness 0 00001 00003 00006 00008 00016 00027 00029 00024 00025 00033 00031 00033 00036 0002 00008 0
Density 00003 00015 0003 00043 00055 00067 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Closeness 00001 00005 00012 00016 00018 00021 00027 0003 00028 00028 00031 0003 00032 00034 00024 00015 00004
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Closeness 00027 0003 00028 00028 00031 0003 00032 00034 00024 00015 00004
Z -08728715609 10910894512 -02182178902 -02182178902 17457431219 10910894512 24003967926 3709704134 -28368325731 -87287156094 -159299059872
C+ 0 05910894512 0 0 12457431219 18368325731 37372293657 69469334997 36101009266 0 0
C- 03728715609 0 0 0 0 0 0 0 23368325731 105655481825 259954541697
03728715609 05910894512 0 0 12457431219 18368325731 37372293657 69469334997 36101009266 105655481825 259954541697
Signal 0 0 0 0 0 0 0 1 0 1 1
Average 00028333333
Std Dev 00001527525
k 05
Control Limit 4
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Density 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Z -08444407432 11042686642 -0259827921 -06495698025 16888814864 30529780716 28581071308 36375908938 -43521176765 -9223891195 -150700194171
C+ 0 06042686642 0 0 11888814864 3741859558 60999666888 92375575826 43854399061 0 0
C- 03444407432 0 0 01495698025 0 0 0 0 38521176765 125760088714 271460282886
Signal 0 0 0 0 0 0 1 1 1 1 1
Average 00084333333
Std Dev 00005131601
k 05
Control Limit 4
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Betweeness 00027 00029 00024 00025 00033 00031 00033 00036 0002 00008 0
Z 01324532357 09271726499 -10596258857 -06622661785 25166114784 17218920642 25166114784 37086905998 -26490647141 -74173811996 -105962588565
C+ 0 04271726499 0 0 20166114784 32385035426 5255115021 84638056208 53147409067 0 0
C- 0 0 05596258857 07218920642 0 0 0 0 21490647141 90664459137 191627047702
Signal 0 0 0 0 0 0 1 1 1 1 1
Average 00026666667
Std Dev 00002516611
k 05
Control Limit 4
1994
1995
1996
1997
1998
1999
2000
2001
2002
2003
2004
1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Centrality-Betweenness[Agent x Agent Agent x Agent] Average 0 00001 00003 00006 00008 00016 00027 00029 00024 00025 00033 00031 00033 00036 0002 00008 0
Meta-Matrix Hamming Distance 0 00018 00036 0006 00065 00087 00124 0012 00146 00136 00135 00166 00164 00156 0015 0009 0004
Network Centralization-Betweenness[Agent x Agent Agent x Agent] 0 00091 00292 00492 00608 01105 01557 01586 01242 01187 01295 01338 01316 01414 0104 0042 00014
Edge Count-Sequential[Agent x Agent Agent x Agent] 0 001 00025 00017 0 0 00019 00008 00009 00009 00016 00015 0 00007 0 0002 0
Hierarchy[Agent x Agent Agent x Agent] 0 04827 01679 00696 00796 01496 01339 00727 00775 01273 01492 01711 0075 00717 00786 00882 05587
Exclusivity[Agent x Agent Agent x Agent] Average 00001 00003 00003 00003 00004 00004 00005 00005 00005 00006 00006 00006 00006 00006 00006 00004 00002
Network Centralization-Closeness[Agent x Agent Agent x Agent] 00001 00005 00012 00016 00018 00021 00027 0003 00028 00028 00031 0003 00032 00034 00024 00015 00004
Similarity Correlation[Agent x Agent Agent x Agent] Average 00001 00023 00036 00043 0005 0006 0007 00077 0007 00069 0008 00084 0008 00082 00053 0003 00008
Simmelian Ties[Agent x Agent Agent x Agent] Average 00002 00008 00021 00029 0004 0005 00059 00068 00061 00058 00065 00076 00076 0008 00045 00029 00004
Overall Complexity 00003 00015 0003 00043 00055 00067 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Expertise Correlation[Agent x Agent Agent x Agent] Average 00003 00015 0003 00042 00053 00065 00079 00088 00081 00079 00091 00097 00097 00101 0006 00036 00007
Density[Agent x Agent Agent x Agent] 00003 00015 0003 00043 00055 00067 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Centrality-Inverse Closeness[Agent x Agent Agent x Agent] Average 00003 00057 00191 00306 00387 0053 00733 00943 0084 00877 01111 01117 01125 01206 00587 00257 00018
Efficiency-Global[Agent x Agent Agent x Agent] 00003 00087 00223 00338 00421 00591 00815 0101 00904 00972 01245 01244 01203 01286 00632 00277 00026
Connectedness[Agent x Agent Agent x Agent] 00003 0021 00589 00932 0119 01953 02921 03446 0301 03223 04222 04187 04049 04402 02202 00899 00059
Distinctiveness Correlation[Agent x Agent Agent x Agent] Average 00006 00029 00059 00083 00106 00129 00156 00175 00162 00157 00181 00193 00193 00199 0012 00073 00014
Centrality-Closeness[Agent x Agent Agent x Agent] Average 00027 00028 00029 00031 00032 00035 00042 00048 00044 00044 00053 00052 00054 00058 00038 00031 00027
Boundary Spanner[Agent x Agent Agent x Agent] Average 00027 00191 00328 00328 00546 00574 00683 00601 00656 0071 00683 0071 00847 00656 00738 00546 00246
Network Centralization-Row Degree[Agent x Agent Agent x Agent] 00134 00314 006 00752 00794 0081 00851 00841 00848 00878 00893 00969 00996 00993 00842 00648 0024
Network Centralization-Column Degree[Agent x Agent Agent x Agent] 00134 00588 00819 00834 00849 00892 00878 00896 00876 00905 00948 00996 01024 00993 00842 0062 00322
Network Centralization-Out Degree[Agent x Agent Agent x Agent] 00135 00315 00601 00754 00797 00812 00853 00843 00851 0088 00895 00972 00999 00996 00845 0065 0024
Network Centralization-Total Degree[Agent x Agent Agent x Agent] 00135 00453 00672 00769 00813 00828 00869 0086 00867 00897 00925 00988 01016 00999 00847 00638 00282
Network Centralization-In Degree[Agent x Agent Agent x Agent] 00135 00589 00821 00836 00852 00895 00881 00898 00878 00908 0095 00999 01027 00996 00845 00622 00323
Clustering Coefficient-Watts-Strogatz[Agent x Agent Agent x Agent] Average 0023 00786 0126 01471 01728 01993 02334 02782 02382 02256 02604 02856 02775 02958 02157 01468 00355
Efficiency-Local[Agent x Agent Agent x Agent] 00238 01032 01587 01916 02253 02598 03034 0362 03167 03054 03644 03843 03678 03861 02663 01705 0041
Interlockers[Agent x Agent Agent x Agent] Average 00246 00546 00656 00628 00738 00902 00902 00902 00847 0071 00738 00874 00874 0082 00792 00683 00546
Breadth-Column[Agent x Agent Agent x Agent] 00273 01011 01803 02596 03224 03716 04399 05082 04672 04672 05355 05328 05519 05738 03825 02404 00601
Breadth-Row[Agent x Agent Agent x Agent] 00273 01339 02158 02923 03388 03934 04754 05301 04863 04973 05792 05765 05765 05984 04153 02514 00601
Clique Count[Agent x Agent Agent x Agent] Average 00301 02896 06421 10929 14699 17213 19781 22486 20601 20492 24071 2571 24918 24727 13689 06913 01148
Constraint-Burt[Agent x Agent Agent x Agent] Average 0038 01331 01404 01481 01869 02077 02643 02719 02506 02623 02827 0295 03034 02956 02548 01967 00948
Interdependence[Agent x Agent Agent x Agent] 00485 00073 00042 0003 00024 0002 00017 00015 00016 00016 00014 00013 00014 00013 00022 00037 00176
Effective Network Size-Burt[Agent x Agent Agent x Agent] Average 0053 03636 06977 10333 13118 15388 18575 21156 19688 19928 22983 2382 23991 2429 14392 08408 01957
Redundancy-Row[Agent x Agent Agent x Agent] 00601 03525 08443 12459 16093 19699 23716 26913 24754 23798 27541 29781 29617 30683 1765 10219 01311
Redundancy-Column[Agent x Agent Agent x Agent] 00601 03743 08634 12568 1623 2 23989 27077 24945 24016 27814 30055 29891 30902 17842 10383 01448
Node Levels[Agent x Agent Agent x Agent] Average 00601 09836 14344 23087 24126 36339 46913 42541 3918 43197 51913 44727 46421 48169 47568 23033 03169
Centrality-Bonacich Power[Agent x Agent Agent x Agent] Average 01038 05492 11066 15683 20027 2429 29372 32978 30383 29563 34071 36339 36284 37432 22541 13552 02514
Triad Count[Agent x Agent Agent x Agent] Average 01967 08279 27104 42432 5918 78251 96503 101475 94454 84508 94645 111421 110164 112732 67131 35574 01995
Efficiency[Agent x Agent Agent x Agent] 03333 09552 0964 09658 09658 09746 09798 09806 09795 09818 09835 09822 0982 09828 09809 09756 09607
Speed-Minimum[Agent x Agent Agent x Agent] 05 0125 0125 01 01111 00909 00909 01 01 01 00909 01111 01 01 00769 01111 01667
Edge Count-Skip[Agent x Agent Agent x Agent] 07895 07761 08988 09408 09304 0946 0933 09519 09487 09409 09455 09534 09465 09584 09188 0881 06196
Transitivity[Agent x Agent Agent x Agent] 08372 03475 03372 03238 03336 03626 0375 03495 03503 03253 0319 03266 03302 0336 03822 0405 0258
Edge Count-Pooled[Agent x Agent Agent x Agent] 08421 08657 09432 09669 09714 09764 09665 09751 09748 09704 09735 09737 09759 09788 09612 09435 08152
Speed-Average[Agent x Agent Agent x Agent] 08519 02994 03068 02802 02799 0233 02156 02387 0245 02484 02448 02516 02464 02429 02259 02373 04139
Edge Count-Lateral[Agent x Agent Agent x Agent] 09474 03731 03926 0392 04216 04061 03944 03828 03768 03651 03633 03872 03795 03985 03842 03891 05761
Resemblance Correlation[Agent x Agent Agent x Agent] Average 09994 09971 09941 09917 09894 09871 09844 09825 09838 09843 09819 09807 09807 09801 0988 09927 09986
Fragmentation[Agent x Agent Agent x Agent] 09997 0979 09411 09068 0881 08047 07079 06554 0699 06777 05778 05813 05951 05598 07798 09101 09941
Communicative Need[Agent x Agent Agent x Agent] 1 0608 07457 07662 0801 08069 08098 08233 0817 08124 0784 08169 08368 08365 08212 09004 08039
Upper Boundedness[Agent x Agent Agent x Agent] 1 09021 09909 0999 09987 0994 09966 09993 09989 09965 0995 09928 09987 09988 09986 09983 09073
Average Distance[Agent x Agent Agent x Agent] 11739 33399 326 35685 35731 42917 46372 41893 40814 40263 40845 39749 40577 41172 44266 42139 24163
Network Levels[Agent x Agent Agent x Agent] 2 8 8 10 9 11 11 10 10 10 11 9 10 10 13 9 6
Span Of Control[Agent x Agent Agent x Agent] 2375 27917 42188 48644 50903 52917 51932 54369 53981 5128 52176 55417 54426 55466 46089 40656 20909
Number of AGENT 32 94 112 137 162 183 221 228 213 219 254 250 256 260 190 141 62
Edge Count[Agent x Agent Agent x Agent] 38 201 405 574 733 889 1075 1207 1112 1082 1247 1330 1328 1370 825 496 92
Component Members-Weak[Agent x Agent Agent x Agent] Average 1728798 1248005 101776 862131 710437 559781 367623 299836 366667 345 232377 232213 225628 211885 50694 855628 1487596
Isolate Count[Agent x Agent Agent x Agent] 350 293 269 247 222 198 157 142 159 154 126 125 122 119 187 243 322
Component Count-Weak[Agent x Agent Agent x Agent] 355 301 272 250 226 201 162 146 162 157 128 128 126 122 191 249 329
Component Count-Strong[Agent x Agent Agent x Agent] 355 315 281 255 232 214 177 156 172 169 145 144 137 131 198 256 336
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Agents 221 228 213 219 254 250 256 260 190 141 62
Z -03746343246 00624390541 -08741467575 -04995124328 16858544608 14360982444 18107325691 20604887855 -23102450019 -53697586531 -103024439274
C+ 0 0 0 0 11858544608 21219527053 34326852743 49931740598 21829290579 0 0
C- 0 0 03741467575 03736591903 0 0 0 0 18102450019 6680003655 164824475824
Signal 0 0 0 0 0 0 0 1 0 1 1
Average 227
Std Dev 16015617378
k 05
Control Limit 4
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Component Members-Weak Average 367623 299836 366667 345 232377 232213 225628 211885 50694 855628 1487596
Z 09566024078 -01181147213 09414456731 05979300132 -11876316317 -1190231741 -12946324712 -15125184597 31653794044 86935922814 187130183455
C+ 04566024078 0 04414456731 05393756863 0 0 0 0 26653794044 108589716857 290719900313
C- 0 0 0 0 06876316317 13778633728 21724958439 31850143037 0 0 0
Signal 0 0 0 0 0 0 0 0 0 1 1
Average 307286
Std Dev 63074271515
k 05
Control Limit 4
1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Edge Count 38 201 405 574 733 889 1075 1207 1112 1082 1247 1330 1328 1370 825 496 92
Slope of Edge Count 163 204 169 159 156 186 132 -95 -30 165 83 -2 42 -545 -329 -404
Z -05039753822 15973457028 -01964649795 -07089823173 -08627375186 06748144948 -20927791293 -137269226972 -6393653789 59836399187 20500693512 -16784942813 -06150208054 -316223197419 -114547624997 -100197139539
C+ 0 10973457028 04008807233 0 0 01748144948 0 0 0 54836399187 70337092699 48552149886 37401941833 0 0 0
C- 00039753822 0 0 02089823173 05717198359 0 15927791293 148197018266 207133556155 142297156968 116796463456 128581406269 129731614323 440954811741 550502436738 645699576277
Signal 0 0 0 0 0 0 0 0 1 1 1 1 1 1 1 1 1
Average 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 9475 4825 43 3075 54 72 -1055 -2085 -309
Std Dev 195115350498 195115350498 195115350498 195115350498 195115350498 195115350498 195115350498 195115350498 1284143683549 1324547092406 1254033492376 115840047767 882307580533 710539700979 29504858357 278774102097 2505474007049
k 05
Control Limit 4
1988 1988 1988 1988 1988 1988
1989 1989 1989 1989 1989 1989
1990 1990 1990 1990 1990 1990
1991 1991 1991 1991 1991 1991
1992 1992 1992 1992 1992 1992
1993 1993 1993 1993 1993 1993
1994 1994 1994 1994 1994 1994
1995 1995 1995 1995 1995 1995
1996 1996 1996 1996 1996 1996
1997 1997 1997 1997 1997 1997
1998 1998 1998 1998 1998 1998
1999 1999 1999 1999 1999 1999
2000 2000 2000 2000 2000 2000
2001 2001 2001 2001 2001 2001
2002 2002 2002 2002 2002 2002
2003 2003 2003 2003 2003 2003
2004 2004 2004 2004 2004 2004
httpwwwmarocnlnieuwsforumsshowphppg=timeline
Significant Events Founded in Afghanistan Moves to Sudan WTC 1 and Somalia Leaves Sudan Returns to Afghanistan Saudia Arabia bomb Issues fatwa US embassy bombings USS Cole WTC 2 Afghanistan 3 Man is Captured Dirty bomb foiled 911 plotter captured Sheikh Mohammed captured Saudi bombings Morroco attacks Sectarian violence plot Madrid Europe truce offered
1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Betweeness 0 00001 00003 00006 00008 00016 00027 00029 00024 00025 00033 00031 00033 00036 0002 00008 0
Density 00003 00015 0003 00043 00055 00067 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Closeness 00001 00005 00012 00016 00018 00021 00027 0003 00028 00028 00031 0003 00032 00034 00024 00015 00004
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Closeness 00027 0003 00028 00028 00031 0003 00032 00034 00024 00015 00004
Z -08728715609 10910894512 -02182178902 -02182178902 17457431219 10910894512 24003967926 3709704134 -28368325731 -87287156094 -159299059872
C+ 0 05910894512 0 0 12457431219 18368325731 37372293657 69469334997 36101009266 0 0
C- 03728715609 0 0 0 0 0 0 0 23368325731 105655481825 259954541697
Signal 0 0 0 0 0 0 0 1 0 1 1
Average 00028333333
Std Dev 00001527525
k 05
Control Limit 4
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Density 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Z -08444407432 11042686642 -0259827921 -06495698025 16888814864 30529780716 28581071308 36375908938 -43521176765 -9223891195 -150700194171
C+ 0 06042686642 0 0 11888814864 3741859558 60999666888 92375575826 43854399061 0 0
C- 03444407432 0 0 01495698025 0 0 0 0 38521176765 125760088714 271460282886
Signal 0 0 0 0 0 0 1 1 1 1 1
Average 00084333333
Std Dev 00005131601
k 05
Control Limit 4
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Betweeness 00027 00029 00024 00025 00033 00031 00033 00036 0002 00008 0
Z 01324532357 09271726499 -10596258857 -06622661785 25166114784 17218920642 25166114784 37086905998 -26490647141 -74173811996 -105962588565
C+ 0 04271726499 0 0 20166114784 32385035426 5255115021 84638056208 53147409067 0 0
C- 0 0 05596258857 07218920642 0 0 0 0 21490647141 90664459137 191627047702
Signal 0 0 0 0 0 0 1 1 1 1 1
Average 00026666667
Std Dev 00002516611
k 05
Control Limit 4
1988 1988 1988
1989 1989 1989
1990 1990 1990
1991 1991 1991
1992 1992 1992
1993 1993 1993
1994 1994 1994
1995 1995 1995
1996 1996 1996
1997 1997 1997
1998 1998 1998
1999 1999 1999
2000 2000 2000
2001 2001 2001
2002 2002 2002
2003 2003 2003
2004 2004 2004
Page 7: Data Science Center Eindhoven - 4TU.Federation · 4TU AMI SRO Big Data Meeting Big Data: Mathematics in Action! November 24, 2017 Data Science Center Eindhoven The Mathematics Behind

Topic Google Search

bull How can Google search so fast on its over 250 billion pages

bull What are proper ways to rank web pages

bull First idea count words in pages

Topic Google Search

Model the WWW as a graphbull Squares (nodes vertices) denote web pagesbull Arrows (edges) denote links from one page to another

Relevance is translated into number of incoming links weighted by importance of referring pages

Topic Google Search

Topic Google Search

A =

Topic Google Search

Presenter
Presentation Notes
Note that pages 2 and 4 both have 2 incoming links but not the same relevance

Equivalent Random Surfer Model

bull Start at any point on the graphbull Assign equal probabilities to all outgoing linksbull Choose new node with probabilities given by the

linksbull Relevance is proportion of visits after surfing for a

very long time (ldquostationary distributionrdquo)

Google Matrix

bull Choose a ldquodamping factorrdquo p (0ltplt1) bull Googlersquos p is secret but around 015

Interpretation for random surferbull choose next link according to A with probability 1- pbull ldquoteleportrdquo to random page with probability p

119872119872 = 1 minus 119901119901 119860119860 + 119901119901119901119901

119901119901 =1119899119899

1 ⋯ 1⋮ ⋱ ⋮1 ⋯ 1

Choice of Matrix B

Google choose other B to avoid unwanted boosting of pages

Practical Issues

bull The real Google matrix has size in the order of 30 billion columns x 30 billion rows

bull Every day around 250000 new pages are added bull Matrix has many zeros (100 links to a page)

which speeds up calculationsbull Doing the matrix calculation

requires fast computers butalso advanced math

Topic Streaming algorithms

High volume high velocity data makes exact counting of frequencies or number of different items practically infeasible but approximate answers suffice in several applications (web site counters customer counts in retail transactionshellip)

New algorithms using advanced probabilistic methods (random hash functions)bull Count-Min Sketch bull MinHashbull HyperLogLog

Topic Uncertainty Quantification

Virtual prototyping using mathematical models UQ does not deal with1 unknown uncertainty in the initial

conditions of parameters2 parametrisation of design building blocks dimension

reduction

For 1) polynomial chaos (Wiener chaos) inverse statistical models Bayesian analysis (calibration)

For 2) Model Order Reduction

Topic Network monitoring

0

0002

0004

0006

0008

001

0012

1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004

Measu

re V

alu

e

Time

Al-Qaeda Network Measures

Betweeness

Density

Closeness

Challengesbull monitor high number of variablesbull models to capture structural changesbull scalable algorithms for likelihood ratio calculations

Closeness CUSUM Statistic for Al-Qaeda (1994-2004)

0

1

2

3

4

5

6

7

8

1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004

C+

Change Point

Signal

Chart3

Betweeness
Density
Closeness
Time
Measure Value
Al-Qaeda Network Measures
0
00003
00001
00001
00015
00005
00003
0003
00012
00006
00043
00016
00008
00055
00018
00016
00067
00021
00027
0008
00027
00029
0009
0003
00024
00083
00028
00025
00081
00028
00033
00093
00031
00031
001
0003
00033
00099
00032
00036
00103
00034
0002
00062
00024
00008
00037
00015
0
00007
00004

Std Measures Control Charts

Std Measures Control Charts

Betweeness
Density
Closeness
Time
Value
Standard Measures

Chart1

Number of AGENT
Edge Count[Agent x Agent Agent x Agent]
Component Members-Weak[Agent x Agent Agent x Agent] Average
Isolate Count[Agent x Agent Agent x Agent]
Component Count-Weak[Agent x Agent Agent x Agent]
Component Count-Strong[Agent x Agent Agent x Agent]
32
38
1728798
350
355
355
94
201
1248005
293
301
315
112
405
101776
269
272
281
137
574
862131
247
250
255
162
733
710437
222
226
232
183
889
559781
198
201
214
221
1075
367623
157
162
177
228
1207
299836
142
146
156
213
1112
366667
159
162
172
219
1082
345
154
157
169
254
1247
232377
126
128
145
250
1330
232213
125
128
144
256
1328
225628
122
126
137
260
1370
211885
119
122
131
190
825
50694
187
191
198
141
496
855628
243
249
256
62
92
1487596
322
329
336

Control Charts

raw data

Chart4

C+
Closeness CUSUM Statistic for Al-Qaeda (1994-2004)
0
05910894512
0
0
12457431219
18368325731
37372293657
69469334997
36101009266
0
0

Std Measures Control Charts

Std Measures Control Charts

Betweeness
Density
Closeness
Time
Value
Standard Measures

Chart1

C+
Closeness CUSUM Chart for Al-Qaeda (1994-2004)

Control Charts

Number of AGENT
Edge Count[Agent x Agent Agent x Agent]
Component Members-Weak[Agent x Agent Agent x Agent] Average
Isolate Count[Agent x Agent Agent x Agent]
Component Count-Weak[Agent x Agent Agent x Agent]
Component Count-Strong[Agent x Agent Agent x Agent]
32
38
1728798
350
355
355
94
201
1248005
293
301
315
112
405
101776
269
272
281
137
574
862131
247
250
255
162
733
710437
222
226
232
183
889
559781
198
201
214
221
1075
367623
157
162
177
228
1207
299836
142
146
156
213
1112
366667
159
162
172
219
1082
345
154
157
169
254
1247
232377
126
128
145
250
1330
232213
125
128
144
256
1328
225628
122
126
137
260
1370
211885
119
122
131
190
825
50694
187
191
198
141
496
855628
243
249
256
62
92
1487596
322
329
336

raw data

Topic Precision farming

Topic Precision farming

Topic Compressed Sensing

Topic Network structure

bull dependencies between nodes in networks measured by degrees of direct neighbours

bull assortativity coefficient of Newman is nothing but Pearsonrsquos correlation statistic

bull inconsistent estimator when variances are infinitebull Spearman rank correlation behaves better but

calculation is computationally intensive bull requires heavy asymptotics

Van der Hofstad R and Litvak N (2014) Degree-Degree Dependencies in Random Graphs with Heavy-Tailed Degrees Internet Mathematics 10 (3-4) pp 287-334

Topic Topological Data Analysis

Common Big Data problem is to choose relevant ldquofeaturesrdquo from high-dimensional data

Combination of machine learning with topological tools (simplices cohomology) yields new algorithmsfor finding patterns and clustering

Presenter
Presentation Notes
httpsyoutubeXfWibrh6stw Gunnar Carsson

Mathematical contributions in general

bull Modellingbull Performance of algorithmsbull Statistical thinking

We need to work hard to makemathematical contributions more explicit

Conclusions

1 Mathematical methods are an important enabler in big data settings

2 Developments in big data settings not only require more computer speed and memory capacity but also new advanced mathematics

  • The Mathematics Behind Big Data
  • Outline
  • What is big data
  • Four Vrsquos of Big Data
  • High-dimensional data ldquo119951≪119953rdquo
  • Topic Google Search
  • Topic Google Search
  • Topic Google Search
  • Topic Google Search
  • Topic Google Search
  • Topic Google Search
  • Equivalent Random Surfer Model
  • Google Matrix
  • Choice of Matrix B
  • Practical Issues
  • Topic Streaming algorithms
  • Topic Uncertainty Quantification
  • Topic Network monitoring
  • Topic Precision farming
  • Topic Precision farming
  • Topic Compressed Sensing
  • Topic Network structure
  • Topic Topological Data Analysis
  • Mathematical contributions in general
  • Conclusions
1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Centrality-Betweenness[Agent x Agent Agent x Agent] Average 0 00001 00003 00006 00008 00016 00027 00029 00024 00025 00033 00031 00033 00036 0002 00008 0
Meta-Matrix Hamming Distance 0 00018 00036 0006 00065 00087 00124 0012 00146 00136 00135 00166 00164 00156 0015 0009 0004
Network Centralization-Betweenness[Agent x Agent Agent x Agent] 0 00091 00292 00492 00608 01105 01557 01586 01242 01187 01295 01338 01316 01414 0104 0042 00014
Edge Count-Sequential[Agent x Agent Agent x Agent] 0 001 00025 00017 0 0 00019 00008 00009 00009 00016 00015 0 00007 0 0002 0
Hierarchy[Agent x Agent Agent x Agent] 0 04827 01679 00696 00796 01496 01339 00727 00775 01273 01492 01711 0075 00717 00786 00882 05587
Exclusivity[Agent x Agent Agent x Agent] Average 00001 00003 00003 00003 00004 00004 00005 00005 00005 00006 00006 00006 00006 00006 00006 00004 00002
Network Centralization-Closeness[Agent x Agent Agent x Agent] 00001 00005 00012 00016 00018 00021 00027 0003 00028 00028 00031 0003 00032 00034 00024 00015 00004
Similarity Correlation[Agent x Agent Agent x Agent] Average 00001 00023 00036 00043 0005 0006 0007 00077 0007 00069 0008 00084 0008 00082 00053 0003 00008
Simmelian Ties[Agent x Agent Agent x Agent] Average 00002 00008 00021 00029 0004 0005 00059 00068 00061 00058 00065 00076 00076 0008 00045 00029 00004
Overall Complexity 00003 00015 0003 00043 00055 00067 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Expertise Correlation[Agent x Agent Agent x Agent] Average 00003 00015 0003 00042 00053 00065 00079 00088 00081 00079 00091 00097 00097 00101 0006 00036 00007
Density[Agent x Agent Agent x Agent] 00003 00015 0003 00043 00055 00067 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Centrality-Inverse Closeness[Agent x Agent Agent x Agent] Average 00003 00057 00191 00306 00387 0053 00733 00943 0084 00877 01111 01117 01125 01206 00587 00257 00018
Efficiency-Global[Agent x Agent Agent x Agent] 00003 00087 00223 00338 00421 00591 00815 0101 00904 00972 01245 01244 01203 01286 00632 00277 00026
Connectedness[Agent x Agent Agent x Agent] 00003 0021 00589 00932 0119 01953 02921 03446 0301 03223 04222 04187 04049 04402 02202 00899 00059
Distinctiveness Correlation[Agent x Agent Agent x Agent] Average 00006 00029 00059 00083 00106 00129 00156 00175 00162 00157 00181 00193 00193 00199 0012 00073 00014
Centrality-Closeness[Agent x Agent Agent x Agent] Average 00027 00028 00029 00031 00032 00035 00042 00048 00044 00044 00053 00052 00054 00058 00038 00031 00027
Boundary Spanner[Agent x Agent Agent x Agent] Average 00027 00191 00328 00328 00546 00574 00683 00601 00656 0071 00683 0071 00847 00656 00738 00546 00246
Network Centralization-Row Degree[Agent x Agent Agent x Agent] 00134 00314 006 00752 00794 0081 00851 00841 00848 00878 00893 00969 00996 00993 00842 00648 0024
Network Centralization-Column Degree[Agent x Agent Agent x Agent] 00134 00588 00819 00834 00849 00892 00878 00896 00876 00905 00948 00996 01024 00993 00842 0062 00322
Network Centralization-Out Degree[Agent x Agent Agent x Agent] 00135 00315 00601 00754 00797 00812 00853 00843 00851 0088 00895 00972 00999 00996 00845 0065 0024
Network Centralization-Total Degree[Agent x Agent Agent x Agent] 00135 00453 00672 00769 00813 00828 00869 0086 00867 00897 00925 00988 01016 00999 00847 00638 00282
Network Centralization-In Degree[Agent x Agent Agent x Agent] 00135 00589 00821 00836 00852 00895 00881 00898 00878 00908 0095 00999 01027 00996 00845 00622 00323
Clustering Coefficient-Watts-Strogatz[Agent x Agent Agent x Agent] Average 0023 00786 0126 01471 01728 01993 02334 02782 02382 02256 02604 02856 02775 02958 02157 01468 00355
Efficiency-Local[Agent x Agent Agent x Agent] 00238 01032 01587 01916 02253 02598 03034 0362 03167 03054 03644 03843 03678 03861 02663 01705 0041
Interlockers[Agent x Agent Agent x Agent] Average 00246 00546 00656 00628 00738 00902 00902 00902 00847 0071 00738 00874 00874 0082 00792 00683 00546
Breadth-Column[Agent x Agent Agent x Agent] 00273 01011 01803 02596 03224 03716 04399 05082 04672 04672 05355 05328 05519 05738 03825 02404 00601
Breadth-Row[Agent x Agent Agent x Agent] 00273 01339 02158 02923 03388 03934 04754 05301 04863 04973 05792 05765 05765 05984 04153 02514 00601
Clique Count[Agent x Agent Agent x Agent] Average 00301 02896 06421 10929 14699 17213 19781 22486 20601 20492 24071 2571 24918 24727 13689 06913 01148
Constraint-Burt[Agent x Agent Agent x Agent] Average 0038 01331 01404 01481 01869 02077 02643 02719 02506 02623 02827 0295 03034 02956 02548 01967 00948
Interdependence[Agent x Agent Agent x Agent] 00485 00073 00042 0003 00024 0002 00017 00015 00016 00016 00014 00013 00014 00013 00022 00037 00176
Effective Network Size-Burt[Agent x Agent Agent x Agent] Average 0053 03636 06977 10333 13118 15388 18575 21156 19688 19928 22983 2382 23991 2429 14392 08408 01957
Redundancy-Row[Agent x Agent Agent x Agent] 00601 03525 08443 12459 16093 19699 23716 26913 24754 23798 27541 29781 29617 30683 1765 10219 01311
Redundancy-Column[Agent x Agent Agent x Agent] 00601 03743 08634 12568 1623 2 23989 27077 24945 24016 27814 30055 29891 30902 17842 10383 01448
Node Levels[Agent x Agent Agent x Agent] Average 00601 09836 14344 23087 24126 36339 46913 42541 3918 43197 51913 44727 46421 48169 47568 23033 03169
Centrality-Bonacich Power[Agent x Agent Agent x Agent] Average 01038 05492 11066 15683 20027 2429 29372 32978 30383 29563 34071 36339 36284 37432 22541 13552 02514
Triad Count[Agent x Agent Agent x Agent] Average 01967 08279 27104 42432 5918 78251 96503 101475 94454 84508 94645 111421 110164 112732 67131 35574 01995
Efficiency[Agent x Agent Agent x Agent] 03333 09552 0964 09658 09658 09746 09798 09806 09795 09818 09835 09822 0982 09828 09809 09756 09607
Speed-Minimum[Agent x Agent Agent x Agent] 05 0125 0125 01 01111 00909 00909 01 01 01 00909 01111 01 01 00769 01111 01667
Edge Count-Skip[Agent x Agent Agent x Agent] 07895 07761 08988 09408 09304 0946 0933 09519 09487 09409 09455 09534 09465 09584 09188 0881 06196
Transitivity[Agent x Agent Agent x Agent] 08372 03475 03372 03238 03336 03626 0375 03495 03503 03253 0319 03266 03302 0336 03822 0405 0258
Edge Count-Pooled[Agent x Agent Agent x Agent] 08421 08657 09432 09669 09714 09764 09665 09751 09748 09704 09735 09737 09759 09788 09612 09435 08152
Speed-Average[Agent x Agent Agent x Agent] 08519 02994 03068 02802 02799 0233 02156 02387 0245 02484 02448 02516 02464 02429 02259 02373 04139
Edge Count-Lateral[Agent x Agent Agent x Agent] 09474 03731 03926 0392 04216 04061 03944 03828 03768 03651 03633 03872 03795 03985 03842 03891 05761
Resemblance Correlation[Agent x Agent Agent x Agent] Average 09994 09971 09941 09917 09894 09871 09844 09825 09838 09843 09819 09807 09807 09801 0988 09927 09986
Fragmentation[Agent x Agent Agent x Agent] 09997 0979 09411 09068 0881 08047 07079 06554 0699 06777 05778 05813 05951 05598 07798 09101 09941
Communicative Need[Agent x Agent Agent x Agent] 1 0608 07457 07662 0801 08069 08098 08233 0817 08124 0784 08169 08368 08365 08212 09004 08039
Upper Boundedness[Agent x Agent Agent x Agent] 1 09021 09909 0999 09987 0994 09966 09993 09989 09965 0995 09928 09987 09988 09986 09983 09073
Average Distance[Agent x Agent Agent x Agent] 11739 33399 326 35685 35731 42917 46372 41893 40814 40263 40845 39749 40577 41172 44266 42139 24163
Network Levels[Agent x Agent Agent x Agent] 2 8 8 10 9 11 11 10 10 10 11 9 10 10 13 9 6
Span Of Control[Agent x Agent Agent x Agent] 2375 27917 42188 48644 50903 52917 51932 54369 53981 5128 52176 55417 54426 55466 46089 40656 20909
Number of AGENT 32 94 112 137 162 183 221 228 213 219 254 250 256 260 190 141 62
Edge Count[Agent x Agent Agent x Agent] 38 201 405 574 733 889 1075 1207 1112 1082 1247 1330 1328 1370 825 496 92
Component Members-Weak[Agent x Agent Agent x Agent] Average 1728798 1248005 101776 862131 710437 559781 367623 299836 366667 345 232377 232213 225628 211885 50694 855628 1487596
Isolate Count[Agent x Agent Agent x Agent] 350 293 269 247 222 198 157 142 159 154 126 125 122 119 187 243 322
Component Count-Weak[Agent x Agent Agent x Agent] 355 301 272 250 226 201 162 146 162 157 128 128 126 122 191 249 329
Component Count-Strong[Agent x Agent Agent x Agent] 355 315 281 255 232 214 177 156 172 169 145 144 137 131 198 256 336
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Agents 221 228 213 219 254 250 256 260 190 141 62
Z -03746343246 00624390541 -08741467575 -04995124328 16858544608 14360982444 18107325691 20604887855 -23102450019 -53697586531 -103024439274
C+ 0 0 0 0 11858544608 21219527053 34326852743 49931740598 21829290579 0 0
C- 0 0 03741467575 03736591903 0 0 0 0 18102450019 6680003655 164824475824
Signal 0 0 0 0 0 0 0 1 0 1 1
Average 227
Std Dev 16015617378
k 05
Control Limit 4
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Component Members-Weak Average 367623 299836 366667 345 232377 232213 225628 211885 50694 855628 1487596
Z 09566024078 -01181147213 09414456731 05979300132 -11876316317 -1190231741 -12946324712 -15125184597 31653794044 86935922814 187130183455
C+ 04566024078 0 04414456731 05393756863 0 0 0 0 26653794044 108589716857 290719900313
C- 0 0 0 0 06876316317 13778633728 21724958439 31850143037 0 0 0
Signal 0 0 0 0 0 0 0 0 0 1 1
Average 307286
Std Dev 63074271515
k 05
Control Limit 4
1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Edge Count 38 201 405 574 733 889 1075 1207 1112 1082 1247 1330 1328 1370 825 496 92
Slope of Edge Count 163 204 169 159 156 186 132 -95 -30 165 83 -2 42 -545 -329 -404
Z -05039753822 15973457028 -01964649795 -07089823173 -08627375186 06748144948 -20927791293 -137269226972 -6393653789 59836399187 20500693512 -16784942813 -06150208054 -316223197419 -114547624997 -100197139539
C+ 0 10973457028 04008807233 0 0 01748144948 0 0 0 54836399187 70337092699 48552149886 37401941833 0 0 0
C- 00039753822 0 0 02089823173 05717198359 0 15927791293 148197018266 207133556155 142297156968 116796463456 128581406269 129731614323 440954811741 550502436738 645699576277
Signal 0 0 0 0 0 0 0 0 1 1 1 1 1 1 1 1 1
Average 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 9475 4825 43 3075 54 72 -1055 -2085 -309
Std Dev 195115350498 195115350498 195115350498 195115350498 195115350498 195115350498 195115350498 195115350498 1284143683549 1324547092406 1254033492376 115840047767 882307580533 710539700979 29504858357 278774102097 2505474007049
k 05
Control Limit 4
1988 1988 1988 1988 1988 1988
1989 1989 1989 1989 1989 1989
1990 1990 1990 1990 1990 1990
1991 1991 1991 1991 1991 1991
1992 1992 1992 1992 1992 1992
1993 1993 1993 1993 1993 1993
1994 1994 1994 1994 1994 1994
1995 1995 1995 1995 1995 1995
1996 1996 1996 1996 1996 1996
1997 1997 1997 1997 1997 1997
1998 1998 1998 1998 1998 1998
1999 1999 1999 1999 1999 1999
2000 2000 2000 2000 2000 2000
2001 2001 2001 2001 2001 2001
2002 2002 2002 2002 2002 2002
2003 2003 2003 2003 2003 2003
2004 2004 2004 2004 2004 2004
httpwwwmarocnlnieuwsforumsshowphppg=timeline
Significant Events Founded in Afghanistan Moves to Sudan WTC 1 and Somalia Leaves Sudan Returns to Afghanistan Saudia Arabia bomb Issues fatwa US embassy bombings USS Cole WTC 2 Afghanistan 3 Man is Captured Dirty bomb foiled 911 plotter captured Sheikh Mohammed captured Saudi bombings Morroco attacks Sectarian violence plot Madrid Europe truce offered
1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Betweeness 0 00001 00003 00006 00008 00016 00027 00029 00024 00025 00033 00031 00033 00036 0002 00008 0
Density 00003 00015 0003 00043 00055 00067 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Closeness 00001 00005 00012 00016 00018 00021 00027 0003 00028 00028 00031 0003 00032 00034 00024 00015 00004
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Closeness 00027 0003 00028 00028 00031 0003 00032 00034 00024 00015 00004
Z -08728715609 10910894512 -02182178902 -02182178902 17457431219 10910894512 24003967926 3709704134 -28368325731 -87287156094 -159299059872
C+ 0 05910894512 0 0 12457431219 18368325731 37372293657 69469334997 36101009266 0 0
C- 03728715609 0 0 0 0 0 0 0 23368325731 105655481825 259954541697
03728715609 05910894512 0 0 12457431219 18368325731 37372293657 69469334997 36101009266 105655481825 259954541697
Signal 0 0 0 0 0 0 0 1 0 1 1
Average 00028333333
Std Dev 00001527525
k 05
Control Limit 4
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Density 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Z -08444407432 11042686642 -0259827921 -06495698025 16888814864 30529780716 28581071308 36375908938 -43521176765 -9223891195 -150700194171
C+ 0 06042686642 0 0 11888814864 3741859558 60999666888 92375575826 43854399061 0 0
C- 03444407432 0 0 01495698025 0 0 0 0 38521176765 125760088714 271460282886
Signal 0 0 0 0 0 0 1 1 1 1 1
Average 00084333333
Std Dev 00005131601
k 05
Control Limit 4
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Betweeness 00027 00029 00024 00025 00033 00031 00033 00036 0002 00008 0
Z 01324532357 09271726499 -10596258857 -06622661785 25166114784 17218920642 25166114784 37086905998 -26490647141 -74173811996 -105962588565
C+ 0 04271726499 0 0 20166114784 32385035426 5255115021 84638056208 53147409067 0 0
C- 0 0 05596258857 07218920642 0 0 0 0 21490647141 90664459137 191627047702
Signal 0 0 0 0 0 0 1 1 1 1 1
Average 00026666667
Std Dev 00002516611
k 05
Control Limit 4
1994
1995
1996
1997
1998
1999
2000
2001
2002
2003
2004
1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Centrality-Betweenness[Agent x Agent Agent x Agent] Average 0 00001 00003 00006 00008 00016 00027 00029 00024 00025 00033 00031 00033 00036 0002 00008 0
Meta-Matrix Hamming Distance 0 00018 00036 0006 00065 00087 00124 0012 00146 00136 00135 00166 00164 00156 0015 0009 0004
Network Centralization-Betweenness[Agent x Agent Agent x Agent] 0 00091 00292 00492 00608 01105 01557 01586 01242 01187 01295 01338 01316 01414 0104 0042 00014
Edge Count-Sequential[Agent x Agent Agent x Agent] 0 001 00025 00017 0 0 00019 00008 00009 00009 00016 00015 0 00007 0 0002 0
Hierarchy[Agent x Agent Agent x Agent] 0 04827 01679 00696 00796 01496 01339 00727 00775 01273 01492 01711 0075 00717 00786 00882 05587
Exclusivity[Agent x Agent Agent x Agent] Average 00001 00003 00003 00003 00004 00004 00005 00005 00005 00006 00006 00006 00006 00006 00006 00004 00002
Network Centralization-Closeness[Agent x Agent Agent x Agent] 00001 00005 00012 00016 00018 00021 00027 0003 00028 00028 00031 0003 00032 00034 00024 00015 00004
Similarity Correlation[Agent x Agent Agent x Agent] Average 00001 00023 00036 00043 0005 0006 0007 00077 0007 00069 0008 00084 0008 00082 00053 0003 00008
Simmelian Ties[Agent x Agent Agent x Agent] Average 00002 00008 00021 00029 0004 0005 00059 00068 00061 00058 00065 00076 00076 0008 00045 00029 00004
Overall Complexity 00003 00015 0003 00043 00055 00067 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Expertise Correlation[Agent x Agent Agent x Agent] Average 00003 00015 0003 00042 00053 00065 00079 00088 00081 00079 00091 00097 00097 00101 0006 00036 00007
Density[Agent x Agent Agent x Agent] 00003 00015 0003 00043 00055 00067 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Centrality-Inverse Closeness[Agent x Agent Agent x Agent] Average 00003 00057 00191 00306 00387 0053 00733 00943 0084 00877 01111 01117 01125 01206 00587 00257 00018
Efficiency-Global[Agent x Agent Agent x Agent] 00003 00087 00223 00338 00421 00591 00815 0101 00904 00972 01245 01244 01203 01286 00632 00277 00026
Connectedness[Agent x Agent Agent x Agent] 00003 0021 00589 00932 0119 01953 02921 03446 0301 03223 04222 04187 04049 04402 02202 00899 00059
Distinctiveness Correlation[Agent x Agent Agent x Agent] Average 00006 00029 00059 00083 00106 00129 00156 00175 00162 00157 00181 00193 00193 00199 0012 00073 00014
Centrality-Closeness[Agent x Agent Agent x Agent] Average 00027 00028 00029 00031 00032 00035 00042 00048 00044 00044 00053 00052 00054 00058 00038 00031 00027
Boundary Spanner[Agent x Agent Agent x Agent] Average 00027 00191 00328 00328 00546 00574 00683 00601 00656 0071 00683 0071 00847 00656 00738 00546 00246
Network Centralization-Row Degree[Agent x Agent Agent x Agent] 00134 00314 006 00752 00794 0081 00851 00841 00848 00878 00893 00969 00996 00993 00842 00648 0024
Network Centralization-Column Degree[Agent x Agent Agent x Agent] 00134 00588 00819 00834 00849 00892 00878 00896 00876 00905 00948 00996 01024 00993 00842 0062 00322
Network Centralization-Out Degree[Agent x Agent Agent x Agent] 00135 00315 00601 00754 00797 00812 00853 00843 00851 0088 00895 00972 00999 00996 00845 0065 0024
Network Centralization-Total Degree[Agent x Agent Agent x Agent] 00135 00453 00672 00769 00813 00828 00869 0086 00867 00897 00925 00988 01016 00999 00847 00638 00282
Network Centralization-In Degree[Agent x Agent Agent x Agent] 00135 00589 00821 00836 00852 00895 00881 00898 00878 00908 0095 00999 01027 00996 00845 00622 00323
Clustering Coefficient-Watts-Strogatz[Agent x Agent Agent x Agent] Average 0023 00786 0126 01471 01728 01993 02334 02782 02382 02256 02604 02856 02775 02958 02157 01468 00355
Efficiency-Local[Agent x Agent Agent x Agent] 00238 01032 01587 01916 02253 02598 03034 0362 03167 03054 03644 03843 03678 03861 02663 01705 0041
Interlockers[Agent x Agent Agent x Agent] Average 00246 00546 00656 00628 00738 00902 00902 00902 00847 0071 00738 00874 00874 0082 00792 00683 00546
Breadth-Column[Agent x Agent Agent x Agent] 00273 01011 01803 02596 03224 03716 04399 05082 04672 04672 05355 05328 05519 05738 03825 02404 00601
Breadth-Row[Agent x Agent Agent x Agent] 00273 01339 02158 02923 03388 03934 04754 05301 04863 04973 05792 05765 05765 05984 04153 02514 00601
Clique Count[Agent x Agent Agent x Agent] Average 00301 02896 06421 10929 14699 17213 19781 22486 20601 20492 24071 2571 24918 24727 13689 06913 01148
Constraint-Burt[Agent x Agent Agent x Agent] Average 0038 01331 01404 01481 01869 02077 02643 02719 02506 02623 02827 0295 03034 02956 02548 01967 00948
Interdependence[Agent x Agent Agent x Agent] 00485 00073 00042 0003 00024 0002 00017 00015 00016 00016 00014 00013 00014 00013 00022 00037 00176
Effective Network Size-Burt[Agent x Agent Agent x Agent] Average 0053 03636 06977 10333 13118 15388 18575 21156 19688 19928 22983 2382 23991 2429 14392 08408 01957
Redundancy-Row[Agent x Agent Agent x Agent] 00601 03525 08443 12459 16093 19699 23716 26913 24754 23798 27541 29781 29617 30683 1765 10219 01311
Redundancy-Column[Agent x Agent Agent x Agent] 00601 03743 08634 12568 1623 2 23989 27077 24945 24016 27814 30055 29891 30902 17842 10383 01448
Node Levels[Agent x Agent Agent x Agent] Average 00601 09836 14344 23087 24126 36339 46913 42541 3918 43197 51913 44727 46421 48169 47568 23033 03169
Centrality-Bonacich Power[Agent x Agent Agent x Agent] Average 01038 05492 11066 15683 20027 2429 29372 32978 30383 29563 34071 36339 36284 37432 22541 13552 02514
Triad Count[Agent x Agent Agent x Agent] Average 01967 08279 27104 42432 5918 78251 96503 101475 94454 84508 94645 111421 110164 112732 67131 35574 01995
Efficiency[Agent x Agent Agent x Agent] 03333 09552 0964 09658 09658 09746 09798 09806 09795 09818 09835 09822 0982 09828 09809 09756 09607
Speed-Minimum[Agent x Agent Agent x Agent] 05 0125 0125 01 01111 00909 00909 01 01 01 00909 01111 01 01 00769 01111 01667
Edge Count-Skip[Agent x Agent Agent x Agent] 07895 07761 08988 09408 09304 0946 0933 09519 09487 09409 09455 09534 09465 09584 09188 0881 06196
Transitivity[Agent x Agent Agent x Agent] 08372 03475 03372 03238 03336 03626 0375 03495 03503 03253 0319 03266 03302 0336 03822 0405 0258
Edge Count-Pooled[Agent x Agent Agent x Agent] 08421 08657 09432 09669 09714 09764 09665 09751 09748 09704 09735 09737 09759 09788 09612 09435 08152
Speed-Average[Agent x Agent Agent x Agent] 08519 02994 03068 02802 02799 0233 02156 02387 0245 02484 02448 02516 02464 02429 02259 02373 04139
Edge Count-Lateral[Agent x Agent Agent x Agent] 09474 03731 03926 0392 04216 04061 03944 03828 03768 03651 03633 03872 03795 03985 03842 03891 05761
Resemblance Correlation[Agent x Agent Agent x Agent] Average 09994 09971 09941 09917 09894 09871 09844 09825 09838 09843 09819 09807 09807 09801 0988 09927 09986
Fragmentation[Agent x Agent Agent x Agent] 09997 0979 09411 09068 0881 08047 07079 06554 0699 06777 05778 05813 05951 05598 07798 09101 09941
Communicative Need[Agent x Agent Agent x Agent] 1 0608 07457 07662 0801 08069 08098 08233 0817 08124 0784 08169 08368 08365 08212 09004 08039
Upper Boundedness[Agent x Agent Agent x Agent] 1 09021 09909 0999 09987 0994 09966 09993 09989 09965 0995 09928 09987 09988 09986 09983 09073
Average Distance[Agent x Agent Agent x Agent] 11739 33399 326 35685 35731 42917 46372 41893 40814 40263 40845 39749 40577 41172 44266 42139 24163
Network Levels[Agent x Agent Agent x Agent] 2 8 8 10 9 11 11 10 10 10 11 9 10 10 13 9 6
Span Of Control[Agent x Agent Agent x Agent] 2375 27917 42188 48644 50903 52917 51932 54369 53981 5128 52176 55417 54426 55466 46089 40656 20909
Number of AGENT 32 94 112 137 162 183 221 228 213 219 254 250 256 260 190 141 62
Edge Count[Agent x Agent Agent x Agent] 38 201 405 574 733 889 1075 1207 1112 1082 1247 1330 1328 1370 825 496 92
Component Members-Weak[Agent x Agent Agent x Agent] Average 1728798 1248005 101776 862131 710437 559781 367623 299836 366667 345 232377 232213 225628 211885 50694 855628 1487596
Isolate Count[Agent x Agent Agent x Agent] 350 293 269 247 222 198 157 142 159 154 126 125 122 119 187 243 322
Component Count-Weak[Agent x Agent Agent x Agent] 355 301 272 250 226 201 162 146 162 157 128 128 126 122 191 249 329
Component Count-Strong[Agent x Agent Agent x Agent] 355 315 281 255 232 214 177 156 172 169 145 144 137 131 198 256 336
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Agents 221 228 213 219 254 250 256 260 190 141 62
Z -03746343246 00624390541 -08741467575 -04995124328 16858544608 14360982444 18107325691 20604887855 -23102450019 -53697586531 -103024439274
C+ 0 0 0 0 11858544608 21219527053 34326852743 49931740598 21829290579 0 0
C- 0 0 03741467575 03736591903 0 0 0 0 18102450019 6680003655 164824475824
Signal 0 0 0 0 0 0 0 1 0 1 1
Average 227
Std Dev 16015617378
k 05
Control Limit 4
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Component Members-Weak Average 367623 299836 366667 345 232377 232213 225628 211885 50694 855628 1487596
Z 09566024078 -01181147213 09414456731 05979300132 -11876316317 -1190231741 -12946324712 -15125184597 31653794044 86935922814 187130183455
C+ 04566024078 0 04414456731 05393756863 0 0 0 0 26653794044 108589716857 290719900313
C- 0 0 0 0 06876316317 13778633728 21724958439 31850143037 0 0 0
Signal 0 0 0 0 0 0 0 0 0 1 1
Average 307286
Std Dev 63074271515
k 05
Control Limit 4
1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Edge Count 38 201 405 574 733 889 1075 1207 1112 1082 1247 1330 1328 1370 825 496 92
Slope of Edge Count 163 204 169 159 156 186 132 -95 -30 165 83 -2 42 -545 -329 -404
Z -05039753822 15973457028 -01964649795 -07089823173 -08627375186 06748144948 -20927791293 -137269226972 -6393653789 59836399187 20500693512 -16784942813 -06150208054 -316223197419 -114547624997 -100197139539
C+ 0 10973457028 04008807233 0 0 01748144948 0 0 0 54836399187 70337092699 48552149886 37401941833 0 0 0
C- 00039753822 0 0 02089823173 05717198359 0 15927791293 148197018266 207133556155 142297156968 116796463456 128581406269 129731614323 440954811741 550502436738 645699576277
Signal 0 0 0 0 0 0 0 0 1 1 1 1 1 1 1 1 1
Average 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 9475 4825 43 3075 54 72 -1055 -2085 -309
Std Dev 195115350498 195115350498 195115350498 195115350498 195115350498 195115350498 195115350498 195115350498 1284143683549 1324547092406 1254033492376 115840047767 882307580533 710539700979 29504858357 278774102097 2505474007049
k 05
Control Limit 4
1988 1988 1988 1988 1988 1988
1989 1989 1989 1989 1989 1989
1990 1990 1990 1990 1990 1990
1991 1991 1991 1991 1991 1991
1992 1992 1992 1992 1992 1992
1993 1993 1993 1993 1993 1993
1994 1994 1994 1994 1994 1994
1995 1995 1995 1995 1995 1995
1996 1996 1996 1996 1996 1996
1997 1997 1997 1997 1997 1997
1998 1998 1998 1998 1998 1998
1999 1999 1999 1999 1999 1999
2000 2000 2000 2000 2000 2000
2001 2001 2001 2001 2001 2001
2002 2002 2002 2002 2002 2002
2003 2003 2003 2003 2003 2003
2004 2004 2004 2004 2004 2004
httpwwwmarocnlnieuwsforumsshowphppg=timeline
Significant Events Founded in Afghanistan Moves to Sudan WTC 1 and Somalia Leaves Sudan Returns to Afghanistan Saudia Arabia bomb Issues fatwa US embassy bombings USS Cole WTC 2 Afghanistan 3 Man is Captured Dirty bomb foiled 911 plotter captured Sheikh Mohammed captured Saudi bombings Morroco attacks Sectarian violence plot Madrid Europe truce offered
1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Betweeness 0 00001 00003 00006 00008 00016 00027 00029 00024 00025 00033 00031 00033 00036 0002 00008 0
Density 00003 00015 0003 00043 00055 00067 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Closeness 00001 00005 00012 00016 00018 00021 00027 0003 00028 00028 00031 0003 00032 00034 00024 00015 00004
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Closeness 00027 0003 00028 00028 00031 0003 00032 00034 00024 00015 00004
Z -08728715609 10910894512 -02182178902 -02182178902 17457431219 10910894512 24003967926 3709704134 -28368325731 -87287156094 -159299059872
C+ 0 05910894512 0 0 12457431219 18368325731 37372293657 69469334997 36101009266 0 0
C- 03728715609 0 0 0 0 0 0 0 23368325731 105655481825 259954541697
Signal 0 0 0 0 0 0 0 1 0 1 1
Average 00028333333
Std Dev 00001527525
k 05
Control Limit 4
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Density 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Z -08444407432 11042686642 -0259827921 -06495698025 16888814864 30529780716 28581071308 36375908938 -43521176765 -9223891195 -150700194171
C+ 0 06042686642 0 0 11888814864 3741859558 60999666888 92375575826 43854399061 0 0
C- 03444407432 0 0 01495698025 0 0 0 0 38521176765 125760088714 271460282886
Signal 0 0 0 0 0 0 1 1 1 1 1
Average 00084333333
Std Dev 00005131601
k 05
Control Limit 4
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Betweeness 00027 00029 00024 00025 00033 00031 00033 00036 0002 00008 0
Z 01324532357 09271726499 -10596258857 -06622661785 25166114784 17218920642 25166114784 37086905998 -26490647141 -74173811996 -105962588565
C+ 0 04271726499 0 0 20166114784 32385035426 5255115021 84638056208 53147409067 0 0
C- 0 0 05596258857 07218920642 0 0 0 0 21490647141 90664459137 191627047702
Signal 0 0 0 0 0 0 1 1 1 1 1
Average 00026666667
Std Dev 00002516611
k 05
Control Limit 4
1988 1988 1988
1989 1989 1989
1990 1990 1990
1991 1991 1991
1992 1992 1992
1993 1993 1993
1994 1994 1994
1995 1995 1995
1996 1996 1996
1997 1997 1997
1998 1998 1998
1999 1999 1999
2000 2000 2000
2001 2001 2001
2002 2002 2002
2003 2003 2003
2004 2004 2004
Page 8: Data Science Center Eindhoven - 4TU.Federation · 4TU AMI SRO Big Data Meeting Big Data: Mathematics in Action! November 24, 2017 Data Science Center Eindhoven The Mathematics Behind

Topic Google Search

Model the WWW as a graphbull Squares (nodes vertices) denote web pagesbull Arrows (edges) denote links from one page to another

Relevance is translated into number of incoming links weighted by importance of referring pages

Topic Google Search

Topic Google Search

A =

Topic Google Search

Presenter
Presentation Notes
Note that pages 2 and 4 both have 2 incoming links but not the same relevance

Equivalent Random Surfer Model

bull Start at any point on the graphbull Assign equal probabilities to all outgoing linksbull Choose new node with probabilities given by the

linksbull Relevance is proportion of visits after surfing for a

very long time (ldquostationary distributionrdquo)

Google Matrix

bull Choose a ldquodamping factorrdquo p (0ltplt1) bull Googlersquos p is secret but around 015

Interpretation for random surferbull choose next link according to A with probability 1- pbull ldquoteleportrdquo to random page with probability p

119872119872 = 1 minus 119901119901 119860119860 + 119901119901119901119901

119901119901 =1119899119899

1 ⋯ 1⋮ ⋱ ⋮1 ⋯ 1

Choice of Matrix B

Google choose other B to avoid unwanted boosting of pages

Practical Issues

bull The real Google matrix has size in the order of 30 billion columns x 30 billion rows

bull Every day around 250000 new pages are added bull Matrix has many zeros (100 links to a page)

which speeds up calculationsbull Doing the matrix calculation

requires fast computers butalso advanced math

Topic Streaming algorithms

High volume high velocity data makes exact counting of frequencies or number of different items practically infeasible but approximate answers suffice in several applications (web site counters customer counts in retail transactionshellip)

New algorithms using advanced probabilistic methods (random hash functions)bull Count-Min Sketch bull MinHashbull HyperLogLog

Topic Uncertainty Quantification

Virtual prototyping using mathematical models UQ does not deal with1 unknown uncertainty in the initial

conditions of parameters2 parametrisation of design building blocks dimension

reduction

For 1) polynomial chaos (Wiener chaos) inverse statistical models Bayesian analysis (calibration)

For 2) Model Order Reduction

Topic Network monitoring

0

0002

0004

0006

0008

001

0012

1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004

Measu

re V

alu

e

Time

Al-Qaeda Network Measures

Betweeness

Density

Closeness

Challengesbull monitor high number of variablesbull models to capture structural changesbull scalable algorithms for likelihood ratio calculations

Closeness CUSUM Statistic for Al-Qaeda (1994-2004)

0

1

2

3

4

5

6

7

8

1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004

C+

Change Point

Signal

Chart3

Betweeness
Density
Closeness
Time
Measure Value
Al-Qaeda Network Measures
0
00003
00001
00001
00015
00005
00003
0003
00012
00006
00043
00016
00008
00055
00018
00016
00067
00021
00027
0008
00027
00029
0009
0003
00024
00083
00028
00025
00081
00028
00033
00093
00031
00031
001
0003
00033
00099
00032
00036
00103
00034
0002
00062
00024
00008
00037
00015
0
00007
00004

Std Measures Control Charts

Std Measures Control Charts

Betweeness
Density
Closeness
Time
Value
Standard Measures

Chart1

Number of AGENT
Edge Count[Agent x Agent Agent x Agent]
Component Members-Weak[Agent x Agent Agent x Agent] Average
Isolate Count[Agent x Agent Agent x Agent]
Component Count-Weak[Agent x Agent Agent x Agent]
Component Count-Strong[Agent x Agent Agent x Agent]
32
38
1728798
350
355
355
94
201
1248005
293
301
315
112
405
101776
269
272
281
137
574
862131
247
250
255
162
733
710437
222
226
232
183
889
559781
198
201
214
221
1075
367623
157
162
177
228
1207
299836
142
146
156
213
1112
366667
159
162
172
219
1082
345
154
157
169
254
1247
232377
126
128
145
250
1330
232213
125
128
144
256
1328
225628
122
126
137
260
1370
211885
119
122
131
190
825
50694
187
191
198
141
496
855628
243
249
256
62
92
1487596
322
329
336

Control Charts

raw data

Chart4

C+
Closeness CUSUM Statistic for Al-Qaeda (1994-2004)
0
05910894512
0
0
12457431219
18368325731
37372293657
69469334997
36101009266
0
0

Std Measures Control Charts

Std Measures Control Charts

Betweeness
Density
Closeness
Time
Value
Standard Measures

Chart1

C+
Closeness CUSUM Chart for Al-Qaeda (1994-2004)

Control Charts

Number of AGENT
Edge Count[Agent x Agent Agent x Agent]
Component Members-Weak[Agent x Agent Agent x Agent] Average
Isolate Count[Agent x Agent Agent x Agent]
Component Count-Weak[Agent x Agent Agent x Agent]
Component Count-Strong[Agent x Agent Agent x Agent]
32
38
1728798
350
355
355
94
201
1248005
293
301
315
112
405
101776
269
272
281
137
574
862131
247
250
255
162
733
710437
222
226
232
183
889
559781
198
201
214
221
1075
367623
157
162
177
228
1207
299836
142
146
156
213
1112
366667
159
162
172
219
1082
345
154
157
169
254
1247
232377
126
128
145
250
1330
232213
125
128
144
256
1328
225628
122
126
137
260
1370
211885
119
122
131
190
825
50694
187
191
198
141
496
855628
243
249
256
62
92
1487596
322
329
336

raw data

Topic Precision farming

Topic Precision farming

Topic Compressed Sensing

Topic Network structure

bull dependencies between nodes in networks measured by degrees of direct neighbours

bull assortativity coefficient of Newman is nothing but Pearsonrsquos correlation statistic

bull inconsistent estimator when variances are infinitebull Spearman rank correlation behaves better but

calculation is computationally intensive bull requires heavy asymptotics

Van der Hofstad R and Litvak N (2014) Degree-Degree Dependencies in Random Graphs with Heavy-Tailed Degrees Internet Mathematics 10 (3-4) pp 287-334

Topic Topological Data Analysis

Common Big Data problem is to choose relevant ldquofeaturesrdquo from high-dimensional data

Combination of machine learning with topological tools (simplices cohomology) yields new algorithmsfor finding patterns and clustering

Presenter
Presentation Notes
httpsyoutubeXfWibrh6stw Gunnar Carsson

Mathematical contributions in general

bull Modellingbull Performance of algorithmsbull Statistical thinking

We need to work hard to makemathematical contributions more explicit

Conclusions

1 Mathematical methods are an important enabler in big data settings

2 Developments in big data settings not only require more computer speed and memory capacity but also new advanced mathematics

  • The Mathematics Behind Big Data
  • Outline
  • What is big data
  • Four Vrsquos of Big Data
  • High-dimensional data ldquo119951≪119953rdquo
  • Topic Google Search
  • Topic Google Search
  • Topic Google Search
  • Topic Google Search
  • Topic Google Search
  • Topic Google Search
  • Equivalent Random Surfer Model
  • Google Matrix
  • Choice of Matrix B
  • Practical Issues
  • Topic Streaming algorithms
  • Topic Uncertainty Quantification
  • Topic Network monitoring
  • Topic Precision farming
  • Topic Precision farming
  • Topic Compressed Sensing
  • Topic Network structure
  • Topic Topological Data Analysis
  • Mathematical contributions in general
  • Conclusions
1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Centrality-Betweenness[Agent x Agent Agent x Agent] Average 0 00001 00003 00006 00008 00016 00027 00029 00024 00025 00033 00031 00033 00036 0002 00008 0
Meta-Matrix Hamming Distance 0 00018 00036 0006 00065 00087 00124 0012 00146 00136 00135 00166 00164 00156 0015 0009 0004
Network Centralization-Betweenness[Agent x Agent Agent x Agent] 0 00091 00292 00492 00608 01105 01557 01586 01242 01187 01295 01338 01316 01414 0104 0042 00014
Edge Count-Sequential[Agent x Agent Agent x Agent] 0 001 00025 00017 0 0 00019 00008 00009 00009 00016 00015 0 00007 0 0002 0
Hierarchy[Agent x Agent Agent x Agent] 0 04827 01679 00696 00796 01496 01339 00727 00775 01273 01492 01711 0075 00717 00786 00882 05587
Exclusivity[Agent x Agent Agent x Agent] Average 00001 00003 00003 00003 00004 00004 00005 00005 00005 00006 00006 00006 00006 00006 00006 00004 00002
Network Centralization-Closeness[Agent x Agent Agent x Agent] 00001 00005 00012 00016 00018 00021 00027 0003 00028 00028 00031 0003 00032 00034 00024 00015 00004
Similarity Correlation[Agent x Agent Agent x Agent] Average 00001 00023 00036 00043 0005 0006 0007 00077 0007 00069 0008 00084 0008 00082 00053 0003 00008
Simmelian Ties[Agent x Agent Agent x Agent] Average 00002 00008 00021 00029 0004 0005 00059 00068 00061 00058 00065 00076 00076 0008 00045 00029 00004
Overall Complexity 00003 00015 0003 00043 00055 00067 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Expertise Correlation[Agent x Agent Agent x Agent] Average 00003 00015 0003 00042 00053 00065 00079 00088 00081 00079 00091 00097 00097 00101 0006 00036 00007
Density[Agent x Agent Agent x Agent] 00003 00015 0003 00043 00055 00067 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Centrality-Inverse Closeness[Agent x Agent Agent x Agent] Average 00003 00057 00191 00306 00387 0053 00733 00943 0084 00877 01111 01117 01125 01206 00587 00257 00018
Efficiency-Global[Agent x Agent Agent x Agent] 00003 00087 00223 00338 00421 00591 00815 0101 00904 00972 01245 01244 01203 01286 00632 00277 00026
Connectedness[Agent x Agent Agent x Agent] 00003 0021 00589 00932 0119 01953 02921 03446 0301 03223 04222 04187 04049 04402 02202 00899 00059
Distinctiveness Correlation[Agent x Agent Agent x Agent] Average 00006 00029 00059 00083 00106 00129 00156 00175 00162 00157 00181 00193 00193 00199 0012 00073 00014
Centrality-Closeness[Agent x Agent Agent x Agent] Average 00027 00028 00029 00031 00032 00035 00042 00048 00044 00044 00053 00052 00054 00058 00038 00031 00027
Boundary Spanner[Agent x Agent Agent x Agent] Average 00027 00191 00328 00328 00546 00574 00683 00601 00656 0071 00683 0071 00847 00656 00738 00546 00246
Network Centralization-Row Degree[Agent x Agent Agent x Agent] 00134 00314 006 00752 00794 0081 00851 00841 00848 00878 00893 00969 00996 00993 00842 00648 0024
Network Centralization-Column Degree[Agent x Agent Agent x Agent] 00134 00588 00819 00834 00849 00892 00878 00896 00876 00905 00948 00996 01024 00993 00842 0062 00322
Network Centralization-Out Degree[Agent x Agent Agent x Agent] 00135 00315 00601 00754 00797 00812 00853 00843 00851 0088 00895 00972 00999 00996 00845 0065 0024
Network Centralization-Total Degree[Agent x Agent Agent x Agent] 00135 00453 00672 00769 00813 00828 00869 0086 00867 00897 00925 00988 01016 00999 00847 00638 00282
Network Centralization-In Degree[Agent x Agent Agent x Agent] 00135 00589 00821 00836 00852 00895 00881 00898 00878 00908 0095 00999 01027 00996 00845 00622 00323
Clustering Coefficient-Watts-Strogatz[Agent x Agent Agent x Agent] Average 0023 00786 0126 01471 01728 01993 02334 02782 02382 02256 02604 02856 02775 02958 02157 01468 00355
Efficiency-Local[Agent x Agent Agent x Agent] 00238 01032 01587 01916 02253 02598 03034 0362 03167 03054 03644 03843 03678 03861 02663 01705 0041
Interlockers[Agent x Agent Agent x Agent] Average 00246 00546 00656 00628 00738 00902 00902 00902 00847 0071 00738 00874 00874 0082 00792 00683 00546
Breadth-Column[Agent x Agent Agent x Agent] 00273 01011 01803 02596 03224 03716 04399 05082 04672 04672 05355 05328 05519 05738 03825 02404 00601
Breadth-Row[Agent x Agent Agent x Agent] 00273 01339 02158 02923 03388 03934 04754 05301 04863 04973 05792 05765 05765 05984 04153 02514 00601
Clique Count[Agent x Agent Agent x Agent] Average 00301 02896 06421 10929 14699 17213 19781 22486 20601 20492 24071 2571 24918 24727 13689 06913 01148
Constraint-Burt[Agent x Agent Agent x Agent] Average 0038 01331 01404 01481 01869 02077 02643 02719 02506 02623 02827 0295 03034 02956 02548 01967 00948
Interdependence[Agent x Agent Agent x Agent] 00485 00073 00042 0003 00024 0002 00017 00015 00016 00016 00014 00013 00014 00013 00022 00037 00176
Effective Network Size-Burt[Agent x Agent Agent x Agent] Average 0053 03636 06977 10333 13118 15388 18575 21156 19688 19928 22983 2382 23991 2429 14392 08408 01957
Redundancy-Row[Agent x Agent Agent x Agent] 00601 03525 08443 12459 16093 19699 23716 26913 24754 23798 27541 29781 29617 30683 1765 10219 01311
Redundancy-Column[Agent x Agent Agent x Agent] 00601 03743 08634 12568 1623 2 23989 27077 24945 24016 27814 30055 29891 30902 17842 10383 01448
Node Levels[Agent x Agent Agent x Agent] Average 00601 09836 14344 23087 24126 36339 46913 42541 3918 43197 51913 44727 46421 48169 47568 23033 03169
Centrality-Bonacich Power[Agent x Agent Agent x Agent] Average 01038 05492 11066 15683 20027 2429 29372 32978 30383 29563 34071 36339 36284 37432 22541 13552 02514
Triad Count[Agent x Agent Agent x Agent] Average 01967 08279 27104 42432 5918 78251 96503 101475 94454 84508 94645 111421 110164 112732 67131 35574 01995
Efficiency[Agent x Agent Agent x Agent] 03333 09552 0964 09658 09658 09746 09798 09806 09795 09818 09835 09822 0982 09828 09809 09756 09607
Speed-Minimum[Agent x Agent Agent x Agent] 05 0125 0125 01 01111 00909 00909 01 01 01 00909 01111 01 01 00769 01111 01667
Edge Count-Skip[Agent x Agent Agent x Agent] 07895 07761 08988 09408 09304 0946 0933 09519 09487 09409 09455 09534 09465 09584 09188 0881 06196
Transitivity[Agent x Agent Agent x Agent] 08372 03475 03372 03238 03336 03626 0375 03495 03503 03253 0319 03266 03302 0336 03822 0405 0258
Edge Count-Pooled[Agent x Agent Agent x Agent] 08421 08657 09432 09669 09714 09764 09665 09751 09748 09704 09735 09737 09759 09788 09612 09435 08152
Speed-Average[Agent x Agent Agent x Agent] 08519 02994 03068 02802 02799 0233 02156 02387 0245 02484 02448 02516 02464 02429 02259 02373 04139
Edge Count-Lateral[Agent x Agent Agent x Agent] 09474 03731 03926 0392 04216 04061 03944 03828 03768 03651 03633 03872 03795 03985 03842 03891 05761
Resemblance Correlation[Agent x Agent Agent x Agent] Average 09994 09971 09941 09917 09894 09871 09844 09825 09838 09843 09819 09807 09807 09801 0988 09927 09986
Fragmentation[Agent x Agent Agent x Agent] 09997 0979 09411 09068 0881 08047 07079 06554 0699 06777 05778 05813 05951 05598 07798 09101 09941
Communicative Need[Agent x Agent Agent x Agent] 1 0608 07457 07662 0801 08069 08098 08233 0817 08124 0784 08169 08368 08365 08212 09004 08039
Upper Boundedness[Agent x Agent Agent x Agent] 1 09021 09909 0999 09987 0994 09966 09993 09989 09965 0995 09928 09987 09988 09986 09983 09073
Average Distance[Agent x Agent Agent x Agent] 11739 33399 326 35685 35731 42917 46372 41893 40814 40263 40845 39749 40577 41172 44266 42139 24163
Network Levels[Agent x Agent Agent x Agent] 2 8 8 10 9 11 11 10 10 10 11 9 10 10 13 9 6
Span Of Control[Agent x Agent Agent x Agent] 2375 27917 42188 48644 50903 52917 51932 54369 53981 5128 52176 55417 54426 55466 46089 40656 20909
Number of AGENT 32 94 112 137 162 183 221 228 213 219 254 250 256 260 190 141 62
Edge Count[Agent x Agent Agent x Agent] 38 201 405 574 733 889 1075 1207 1112 1082 1247 1330 1328 1370 825 496 92
Component Members-Weak[Agent x Agent Agent x Agent] Average 1728798 1248005 101776 862131 710437 559781 367623 299836 366667 345 232377 232213 225628 211885 50694 855628 1487596
Isolate Count[Agent x Agent Agent x Agent] 350 293 269 247 222 198 157 142 159 154 126 125 122 119 187 243 322
Component Count-Weak[Agent x Agent Agent x Agent] 355 301 272 250 226 201 162 146 162 157 128 128 126 122 191 249 329
Component Count-Strong[Agent x Agent Agent x Agent] 355 315 281 255 232 214 177 156 172 169 145 144 137 131 198 256 336
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Agents 221 228 213 219 254 250 256 260 190 141 62
Z -03746343246 00624390541 -08741467575 -04995124328 16858544608 14360982444 18107325691 20604887855 -23102450019 -53697586531 -103024439274
C+ 0 0 0 0 11858544608 21219527053 34326852743 49931740598 21829290579 0 0
C- 0 0 03741467575 03736591903 0 0 0 0 18102450019 6680003655 164824475824
Signal 0 0 0 0 0 0 0 1 0 1 1
Average 227
Std Dev 16015617378
k 05
Control Limit 4
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Component Members-Weak Average 367623 299836 366667 345 232377 232213 225628 211885 50694 855628 1487596
Z 09566024078 -01181147213 09414456731 05979300132 -11876316317 -1190231741 -12946324712 -15125184597 31653794044 86935922814 187130183455
C+ 04566024078 0 04414456731 05393756863 0 0 0 0 26653794044 108589716857 290719900313
C- 0 0 0 0 06876316317 13778633728 21724958439 31850143037 0 0 0
Signal 0 0 0 0 0 0 0 0 0 1 1
Average 307286
Std Dev 63074271515
k 05
Control Limit 4
1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Edge Count 38 201 405 574 733 889 1075 1207 1112 1082 1247 1330 1328 1370 825 496 92
Slope of Edge Count 163 204 169 159 156 186 132 -95 -30 165 83 -2 42 -545 -329 -404
Z -05039753822 15973457028 -01964649795 -07089823173 -08627375186 06748144948 -20927791293 -137269226972 -6393653789 59836399187 20500693512 -16784942813 -06150208054 -316223197419 -114547624997 -100197139539
C+ 0 10973457028 04008807233 0 0 01748144948 0 0 0 54836399187 70337092699 48552149886 37401941833 0 0 0
C- 00039753822 0 0 02089823173 05717198359 0 15927791293 148197018266 207133556155 142297156968 116796463456 128581406269 129731614323 440954811741 550502436738 645699576277
Signal 0 0 0 0 0 0 0 0 1 1 1 1 1 1 1 1 1
Average 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 9475 4825 43 3075 54 72 -1055 -2085 -309
Std Dev 195115350498 195115350498 195115350498 195115350498 195115350498 195115350498 195115350498 195115350498 1284143683549 1324547092406 1254033492376 115840047767 882307580533 710539700979 29504858357 278774102097 2505474007049
k 05
Control Limit 4
1988 1988 1988 1988 1988 1988
1989 1989 1989 1989 1989 1989
1990 1990 1990 1990 1990 1990
1991 1991 1991 1991 1991 1991
1992 1992 1992 1992 1992 1992
1993 1993 1993 1993 1993 1993
1994 1994 1994 1994 1994 1994
1995 1995 1995 1995 1995 1995
1996 1996 1996 1996 1996 1996
1997 1997 1997 1997 1997 1997
1998 1998 1998 1998 1998 1998
1999 1999 1999 1999 1999 1999
2000 2000 2000 2000 2000 2000
2001 2001 2001 2001 2001 2001
2002 2002 2002 2002 2002 2002
2003 2003 2003 2003 2003 2003
2004 2004 2004 2004 2004 2004
httpwwwmarocnlnieuwsforumsshowphppg=timeline
Significant Events Founded in Afghanistan Moves to Sudan WTC 1 and Somalia Leaves Sudan Returns to Afghanistan Saudia Arabia bomb Issues fatwa US embassy bombings USS Cole WTC 2 Afghanistan 3 Man is Captured Dirty bomb foiled 911 plotter captured Sheikh Mohammed captured Saudi bombings Morroco attacks Sectarian violence plot Madrid Europe truce offered
1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Betweeness 0 00001 00003 00006 00008 00016 00027 00029 00024 00025 00033 00031 00033 00036 0002 00008 0
Density 00003 00015 0003 00043 00055 00067 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Closeness 00001 00005 00012 00016 00018 00021 00027 0003 00028 00028 00031 0003 00032 00034 00024 00015 00004
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Closeness 00027 0003 00028 00028 00031 0003 00032 00034 00024 00015 00004
Z -08728715609 10910894512 -02182178902 -02182178902 17457431219 10910894512 24003967926 3709704134 -28368325731 -87287156094 -159299059872
C+ 0 05910894512 0 0 12457431219 18368325731 37372293657 69469334997 36101009266 0 0
C- 03728715609 0 0 0 0 0 0 0 23368325731 105655481825 259954541697
03728715609 05910894512 0 0 12457431219 18368325731 37372293657 69469334997 36101009266 105655481825 259954541697
Signal 0 0 0 0 0 0 0 1 0 1 1
Average 00028333333
Std Dev 00001527525
k 05
Control Limit 4
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Density 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Z -08444407432 11042686642 -0259827921 -06495698025 16888814864 30529780716 28581071308 36375908938 -43521176765 -9223891195 -150700194171
C+ 0 06042686642 0 0 11888814864 3741859558 60999666888 92375575826 43854399061 0 0
C- 03444407432 0 0 01495698025 0 0 0 0 38521176765 125760088714 271460282886
Signal 0 0 0 0 0 0 1 1 1 1 1
Average 00084333333
Std Dev 00005131601
k 05
Control Limit 4
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Betweeness 00027 00029 00024 00025 00033 00031 00033 00036 0002 00008 0
Z 01324532357 09271726499 -10596258857 -06622661785 25166114784 17218920642 25166114784 37086905998 -26490647141 -74173811996 -105962588565
C+ 0 04271726499 0 0 20166114784 32385035426 5255115021 84638056208 53147409067 0 0
C- 0 0 05596258857 07218920642 0 0 0 0 21490647141 90664459137 191627047702
Signal 0 0 0 0 0 0 1 1 1 1 1
Average 00026666667
Std Dev 00002516611
k 05
Control Limit 4
1994
1995
1996
1997
1998
1999
2000
2001
2002
2003
2004
1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Centrality-Betweenness[Agent x Agent Agent x Agent] Average 0 00001 00003 00006 00008 00016 00027 00029 00024 00025 00033 00031 00033 00036 0002 00008 0
Meta-Matrix Hamming Distance 0 00018 00036 0006 00065 00087 00124 0012 00146 00136 00135 00166 00164 00156 0015 0009 0004
Network Centralization-Betweenness[Agent x Agent Agent x Agent] 0 00091 00292 00492 00608 01105 01557 01586 01242 01187 01295 01338 01316 01414 0104 0042 00014
Edge Count-Sequential[Agent x Agent Agent x Agent] 0 001 00025 00017 0 0 00019 00008 00009 00009 00016 00015 0 00007 0 0002 0
Hierarchy[Agent x Agent Agent x Agent] 0 04827 01679 00696 00796 01496 01339 00727 00775 01273 01492 01711 0075 00717 00786 00882 05587
Exclusivity[Agent x Agent Agent x Agent] Average 00001 00003 00003 00003 00004 00004 00005 00005 00005 00006 00006 00006 00006 00006 00006 00004 00002
Network Centralization-Closeness[Agent x Agent Agent x Agent] 00001 00005 00012 00016 00018 00021 00027 0003 00028 00028 00031 0003 00032 00034 00024 00015 00004
Similarity Correlation[Agent x Agent Agent x Agent] Average 00001 00023 00036 00043 0005 0006 0007 00077 0007 00069 0008 00084 0008 00082 00053 0003 00008
Simmelian Ties[Agent x Agent Agent x Agent] Average 00002 00008 00021 00029 0004 0005 00059 00068 00061 00058 00065 00076 00076 0008 00045 00029 00004
Overall Complexity 00003 00015 0003 00043 00055 00067 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Expertise Correlation[Agent x Agent Agent x Agent] Average 00003 00015 0003 00042 00053 00065 00079 00088 00081 00079 00091 00097 00097 00101 0006 00036 00007
Density[Agent x Agent Agent x Agent] 00003 00015 0003 00043 00055 00067 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Centrality-Inverse Closeness[Agent x Agent Agent x Agent] Average 00003 00057 00191 00306 00387 0053 00733 00943 0084 00877 01111 01117 01125 01206 00587 00257 00018
Efficiency-Global[Agent x Agent Agent x Agent] 00003 00087 00223 00338 00421 00591 00815 0101 00904 00972 01245 01244 01203 01286 00632 00277 00026
Connectedness[Agent x Agent Agent x Agent] 00003 0021 00589 00932 0119 01953 02921 03446 0301 03223 04222 04187 04049 04402 02202 00899 00059
Distinctiveness Correlation[Agent x Agent Agent x Agent] Average 00006 00029 00059 00083 00106 00129 00156 00175 00162 00157 00181 00193 00193 00199 0012 00073 00014
Centrality-Closeness[Agent x Agent Agent x Agent] Average 00027 00028 00029 00031 00032 00035 00042 00048 00044 00044 00053 00052 00054 00058 00038 00031 00027
Boundary Spanner[Agent x Agent Agent x Agent] Average 00027 00191 00328 00328 00546 00574 00683 00601 00656 0071 00683 0071 00847 00656 00738 00546 00246
Network Centralization-Row Degree[Agent x Agent Agent x Agent] 00134 00314 006 00752 00794 0081 00851 00841 00848 00878 00893 00969 00996 00993 00842 00648 0024
Network Centralization-Column Degree[Agent x Agent Agent x Agent] 00134 00588 00819 00834 00849 00892 00878 00896 00876 00905 00948 00996 01024 00993 00842 0062 00322
Network Centralization-Out Degree[Agent x Agent Agent x Agent] 00135 00315 00601 00754 00797 00812 00853 00843 00851 0088 00895 00972 00999 00996 00845 0065 0024
Network Centralization-Total Degree[Agent x Agent Agent x Agent] 00135 00453 00672 00769 00813 00828 00869 0086 00867 00897 00925 00988 01016 00999 00847 00638 00282
Network Centralization-In Degree[Agent x Agent Agent x Agent] 00135 00589 00821 00836 00852 00895 00881 00898 00878 00908 0095 00999 01027 00996 00845 00622 00323
Clustering Coefficient-Watts-Strogatz[Agent x Agent Agent x Agent] Average 0023 00786 0126 01471 01728 01993 02334 02782 02382 02256 02604 02856 02775 02958 02157 01468 00355
Efficiency-Local[Agent x Agent Agent x Agent] 00238 01032 01587 01916 02253 02598 03034 0362 03167 03054 03644 03843 03678 03861 02663 01705 0041
Interlockers[Agent x Agent Agent x Agent] Average 00246 00546 00656 00628 00738 00902 00902 00902 00847 0071 00738 00874 00874 0082 00792 00683 00546
Breadth-Column[Agent x Agent Agent x Agent] 00273 01011 01803 02596 03224 03716 04399 05082 04672 04672 05355 05328 05519 05738 03825 02404 00601
Breadth-Row[Agent x Agent Agent x Agent] 00273 01339 02158 02923 03388 03934 04754 05301 04863 04973 05792 05765 05765 05984 04153 02514 00601
Clique Count[Agent x Agent Agent x Agent] Average 00301 02896 06421 10929 14699 17213 19781 22486 20601 20492 24071 2571 24918 24727 13689 06913 01148
Constraint-Burt[Agent x Agent Agent x Agent] Average 0038 01331 01404 01481 01869 02077 02643 02719 02506 02623 02827 0295 03034 02956 02548 01967 00948
Interdependence[Agent x Agent Agent x Agent] 00485 00073 00042 0003 00024 0002 00017 00015 00016 00016 00014 00013 00014 00013 00022 00037 00176
Effective Network Size-Burt[Agent x Agent Agent x Agent] Average 0053 03636 06977 10333 13118 15388 18575 21156 19688 19928 22983 2382 23991 2429 14392 08408 01957
Redundancy-Row[Agent x Agent Agent x Agent] 00601 03525 08443 12459 16093 19699 23716 26913 24754 23798 27541 29781 29617 30683 1765 10219 01311
Redundancy-Column[Agent x Agent Agent x Agent] 00601 03743 08634 12568 1623 2 23989 27077 24945 24016 27814 30055 29891 30902 17842 10383 01448
Node Levels[Agent x Agent Agent x Agent] Average 00601 09836 14344 23087 24126 36339 46913 42541 3918 43197 51913 44727 46421 48169 47568 23033 03169
Centrality-Bonacich Power[Agent x Agent Agent x Agent] Average 01038 05492 11066 15683 20027 2429 29372 32978 30383 29563 34071 36339 36284 37432 22541 13552 02514
Triad Count[Agent x Agent Agent x Agent] Average 01967 08279 27104 42432 5918 78251 96503 101475 94454 84508 94645 111421 110164 112732 67131 35574 01995
Efficiency[Agent x Agent Agent x Agent] 03333 09552 0964 09658 09658 09746 09798 09806 09795 09818 09835 09822 0982 09828 09809 09756 09607
Speed-Minimum[Agent x Agent Agent x Agent] 05 0125 0125 01 01111 00909 00909 01 01 01 00909 01111 01 01 00769 01111 01667
Edge Count-Skip[Agent x Agent Agent x Agent] 07895 07761 08988 09408 09304 0946 0933 09519 09487 09409 09455 09534 09465 09584 09188 0881 06196
Transitivity[Agent x Agent Agent x Agent] 08372 03475 03372 03238 03336 03626 0375 03495 03503 03253 0319 03266 03302 0336 03822 0405 0258
Edge Count-Pooled[Agent x Agent Agent x Agent] 08421 08657 09432 09669 09714 09764 09665 09751 09748 09704 09735 09737 09759 09788 09612 09435 08152
Speed-Average[Agent x Agent Agent x Agent] 08519 02994 03068 02802 02799 0233 02156 02387 0245 02484 02448 02516 02464 02429 02259 02373 04139
Edge Count-Lateral[Agent x Agent Agent x Agent] 09474 03731 03926 0392 04216 04061 03944 03828 03768 03651 03633 03872 03795 03985 03842 03891 05761
Resemblance Correlation[Agent x Agent Agent x Agent] Average 09994 09971 09941 09917 09894 09871 09844 09825 09838 09843 09819 09807 09807 09801 0988 09927 09986
Fragmentation[Agent x Agent Agent x Agent] 09997 0979 09411 09068 0881 08047 07079 06554 0699 06777 05778 05813 05951 05598 07798 09101 09941
Communicative Need[Agent x Agent Agent x Agent] 1 0608 07457 07662 0801 08069 08098 08233 0817 08124 0784 08169 08368 08365 08212 09004 08039
Upper Boundedness[Agent x Agent Agent x Agent] 1 09021 09909 0999 09987 0994 09966 09993 09989 09965 0995 09928 09987 09988 09986 09983 09073
Average Distance[Agent x Agent Agent x Agent] 11739 33399 326 35685 35731 42917 46372 41893 40814 40263 40845 39749 40577 41172 44266 42139 24163
Network Levels[Agent x Agent Agent x Agent] 2 8 8 10 9 11 11 10 10 10 11 9 10 10 13 9 6
Span Of Control[Agent x Agent Agent x Agent] 2375 27917 42188 48644 50903 52917 51932 54369 53981 5128 52176 55417 54426 55466 46089 40656 20909
Number of AGENT 32 94 112 137 162 183 221 228 213 219 254 250 256 260 190 141 62
Edge Count[Agent x Agent Agent x Agent] 38 201 405 574 733 889 1075 1207 1112 1082 1247 1330 1328 1370 825 496 92
Component Members-Weak[Agent x Agent Agent x Agent] Average 1728798 1248005 101776 862131 710437 559781 367623 299836 366667 345 232377 232213 225628 211885 50694 855628 1487596
Isolate Count[Agent x Agent Agent x Agent] 350 293 269 247 222 198 157 142 159 154 126 125 122 119 187 243 322
Component Count-Weak[Agent x Agent Agent x Agent] 355 301 272 250 226 201 162 146 162 157 128 128 126 122 191 249 329
Component Count-Strong[Agent x Agent Agent x Agent] 355 315 281 255 232 214 177 156 172 169 145 144 137 131 198 256 336
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Agents 221 228 213 219 254 250 256 260 190 141 62
Z -03746343246 00624390541 -08741467575 -04995124328 16858544608 14360982444 18107325691 20604887855 -23102450019 -53697586531 -103024439274
C+ 0 0 0 0 11858544608 21219527053 34326852743 49931740598 21829290579 0 0
C- 0 0 03741467575 03736591903 0 0 0 0 18102450019 6680003655 164824475824
Signal 0 0 0 0 0 0 0 1 0 1 1
Average 227
Std Dev 16015617378
k 05
Control Limit 4
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Component Members-Weak Average 367623 299836 366667 345 232377 232213 225628 211885 50694 855628 1487596
Z 09566024078 -01181147213 09414456731 05979300132 -11876316317 -1190231741 -12946324712 -15125184597 31653794044 86935922814 187130183455
C+ 04566024078 0 04414456731 05393756863 0 0 0 0 26653794044 108589716857 290719900313
C- 0 0 0 0 06876316317 13778633728 21724958439 31850143037 0 0 0
Signal 0 0 0 0 0 0 0 0 0 1 1
Average 307286
Std Dev 63074271515
k 05
Control Limit 4
1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Edge Count 38 201 405 574 733 889 1075 1207 1112 1082 1247 1330 1328 1370 825 496 92
Slope of Edge Count 163 204 169 159 156 186 132 -95 -30 165 83 -2 42 -545 -329 -404
Z -05039753822 15973457028 -01964649795 -07089823173 -08627375186 06748144948 -20927791293 -137269226972 -6393653789 59836399187 20500693512 -16784942813 -06150208054 -316223197419 -114547624997 -100197139539
C+ 0 10973457028 04008807233 0 0 01748144948 0 0 0 54836399187 70337092699 48552149886 37401941833 0 0 0
C- 00039753822 0 0 02089823173 05717198359 0 15927791293 148197018266 207133556155 142297156968 116796463456 128581406269 129731614323 440954811741 550502436738 645699576277
Signal 0 0 0 0 0 0 0 0 1 1 1 1 1 1 1 1 1
Average 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 9475 4825 43 3075 54 72 -1055 -2085 -309
Std Dev 195115350498 195115350498 195115350498 195115350498 195115350498 195115350498 195115350498 195115350498 1284143683549 1324547092406 1254033492376 115840047767 882307580533 710539700979 29504858357 278774102097 2505474007049
k 05
Control Limit 4
1988 1988 1988 1988 1988 1988
1989 1989 1989 1989 1989 1989
1990 1990 1990 1990 1990 1990
1991 1991 1991 1991 1991 1991
1992 1992 1992 1992 1992 1992
1993 1993 1993 1993 1993 1993
1994 1994 1994 1994 1994 1994
1995 1995 1995 1995 1995 1995
1996 1996 1996 1996 1996 1996
1997 1997 1997 1997 1997 1997
1998 1998 1998 1998 1998 1998
1999 1999 1999 1999 1999 1999
2000 2000 2000 2000 2000 2000
2001 2001 2001 2001 2001 2001
2002 2002 2002 2002 2002 2002
2003 2003 2003 2003 2003 2003
2004 2004 2004 2004 2004 2004
httpwwwmarocnlnieuwsforumsshowphppg=timeline
Significant Events Founded in Afghanistan Moves to Sudan WTC 1 and Somalia Leaves Sudan Returns to Afghanistan Saudia Arabia bomb Issues fatwa US embassy bombings USS Cole WTC 2 Afghanistan 3 Man is Captured Dirty bomb foiled 911 plotter captured Sheikh Mohammed captured Saudi bombings Morroco attacks Sectarian violence plot Madrid Europe truce offered
1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Betweeness 0 00001 00003 00006 00008 00016 00027 00029 00024 00025 00033 00031 00033 00036 0002 00008 0
Density 00003 00015 0003 00043 00055 00067 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Closeness 00001 00005 00012 00016 00018 00021 00027 0003 00028 00028 00031 0003 00032 00034 00024 00015 00004
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Closeness 00027 0003 00028 00028 00031 0003 00032 00034 00024 00015 00004
Z -08728715609 10910894512 -02182178902 -02182178902 17457431219 10910894512 24003967926 3709704134 -28368325731 -87287156094 -159299059872
C+ 0 05910894512 0 0 12457431219 18368325731 37372293657 69469334997 36101009266 0 0
C- 03728715609 0 0 0 0 0 0 0 23368325731 105655481825 259954541697
Signal 0 0 0 0 0 0 0 1 0 1 1
Average 00028333333
Std Dev 00001527525
k 05
Control Limit 4
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Density 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Z -08444407432 11042686642 -0259827921 -06495698025 16888814864 30529780716 28581071308 36375908938 -43521176765 -9223891195 -150700194171
C+ 0 06042686642 0 0 11888814864 3741859558 60999666888 92375575826 43854399061 0 0
C- 03444407432 0 0 01495698025 0 0 0 0 38521176765 125760088714 271460282886
Signal 0 0 0 0 0 0 1 1 1 1 1
Average 00084333333
Std Dev 00005131601
k 05
Control Limit 4
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Betweeness 00027 00029 00024 00025 00033 00031 00033 00036 0002 00008 0
Z 01324532357 09271726499 -10596258857 -06622661785 25166114784 17218920642 25166114784 37086905998 -26490647141 -74173811996 -105962588565
C+ 0 04271726499 0 0 20166114784 32385035426 5255115021 84638056208 53147409067 0 0
C- 0 0 05596258857 07218920642 0 0 0 0 21490647141 90664459137 191627047702
Signal 0 0 0 0 0 0 1 1 1 1 1
Average 00026666667
Std Dev 00002516611
k 05
Control Limit 4
1988 1988 1988
1989 1989 1989
1990 1990 1990
1991 1991 1991
1992 1992 1992
1993 1993 1993
1994 1994 1994
1995 1995 1995
1996 1996 1996
1997 1997 1997
1998 1998 1998
1999 1999 1999
2000 2000 2000
2001 2001 2001
2002 2002 2002
2003 2003 2003
2004 2004 2004
Page 9: Data Science Center Eindhoven - 4TU.Federation · 4TU AMI SRO Big Data Meeting Big Data: Mathematics in Action! November 24, 2017 Data Science Center Eindhoven The Mathematics Behind

Topic Google Search

Topic Google Search

A =

Topic Google Search

Presenter
Presentation Notes
Note that pages 2 and 4 both have 2 incoming links but not the same relevance

Equivalent Random Surfer Model

bull Start at any point on the graphbull Assign equal probabilities to all outgoing linksbull Choose new node with probabilities given by the

linksbull Relevance is proportion of visits after surfing for a

very long time (ldquostationary distributionrdquo)

Google Matrix

bull Choose a ldquodamping factorrdquo p (0ltplt1) bull Googlersquos p is secret but around 015

Interpretation for random surferbull choose next link according to A with probability 1- pbull ldquoteleportrdquo to random page with probability p

119872119872 = 1 minus 119901119901 119860119860 + 119901119901119901119901

119901119901 =1119899119899

1 ⋯ 1⋮ ⋱ ⋮1 ⋯ 1

Choice of Matrix B

Google choose other B to avoid unwanted boosting of pages

Practical Issues

bull The real Google matrix has size in the order of 30 billion columns x 30 billion rows

bull Every day around 250000 new pages are added bull Matrix has many zeros (100 links to a page)

which speeds up calculationsbull Doing the matrix calculation

requires fast computers butalso advanced math

Topic Streaming algorithms

High volume high velocity data makes exact counting of frequencies or number of different items practically infeasible but approximate answers suffice in several applications (web site counters customer counts in retail transactionshellip)

New algorithms using advanced probabilistic methods (random hash functions)bull Count-Min Sketch bull MinHashbull HyperLogLog

Topic Uncertainty Quantification

Virtual prototyping using mathematical models UQ does not deal with1 unknown uncertainty in the initial

conditions of parameters2 parametrisation of design building blocks dimension

reduction

For 1) polynomial chaos (Wiener chaos) inverse statistical models Bayesian analysis (calibration)

For 2) Model Order Reduction

Topic Network monitoring

0

0002

0004

0006

0008

001

0012

1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004

Measu

re V

alu

e

Time

Al-Qaeda Network Measures

Betweeness

Density

Closeness

Challengesbull monitor high number of variablesbull models to capture structural changesbull scalable algorithms for likelihood ratio calculations

Closeness CUSUM Statistic for Al-Qaeda (1994-2004)

0

1

2

3

4

5

6

7

8

1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004

C+

Change Point

Signal

Chart3

Betweeness
Density
Closeness
Time
Measure Value
Al-Qaeda Network Measures
0
00003
00001
00001
00015
00005
00003
0003
00012
00006
00043
00016
00008
00055
00018
00016
00067
00021
00027
0008
00027
00029
0009
0003
00024
00083
00028
00025
00081
00028
00033
00093
00031
00031
001
0003
00033
00099
00032
00036
00103
00034
0002
00062
00024
00008
00037
00015
0
00007
00004

Std Measures Control Charts

Std Measures Control Charts

Betweeness
Density
Closeness
Time
Value
Standard Measures

Chart1

Number of AGENT
Edge Count[Agent x Agent Agent x Agent]
Component Members-Weak[Agent x Agent Agent x Agent] Average
Isolate Count[Agent x Agent Agent x Agent]
Component Count-Weak[Agent x Agent Agent x Agent]
Component Count-Strong[Agent x Agent Agent x Agent]
32
38
1728798
350
355
355
94
201
1248005
293
301
315
112
405
101776
269
272
281
137
574
862131
247
250
255
162
733
710437
222
226
232
183
889
559781
198
201
214
221
1075
367623
157
162
177
228
1207
299836
142
146
156
213
1112
366667
159
162
172
219
1082
345
154
157
169
254
1247
232377
126
128
145
250
1330
232213
125
128
144
256
1328
225628
122
126
137
260
1370
211885
119
122
131
190
825
50694
187
191
198
141
496
855628
243
249
256
62
92
1487596
322
329
336

Control Charts

raw data

Chart4

C+
Closeness CUSUM Statistic for Al-Qaeda (1994-2004)
0
05910894512
0
0
12457431219
18368325731
37372293657
69469334997
36101009266
0
0

Std Measures Control Charts

Std Measures Control Charts

Betweeness
Density
Closeness
Time
Value
Standard Measures

Chart1

C+
Closeness CUSUM Chart for Al-Qaeda (1994-2004)

Control Charts

Number of AGENT
Edge Count[Agent x Agent Agent x Agent]
Component Members-Weak[Agent x Agent Agent x Agent] Average
Isolate Count[Agent x Agent Agent x Agent]
Component Count-Weak[Agent x Agent Agent x Agent]
Component Count-Strong[Agent x Agent Agent x Agent]
32
38
1728798
350
355
355
94
201
1248005
293
301
315
112
405
101776
269
272
281
137
574
862131
247
250
255
162
733
710437
222
226
232
183
889
559781
198
201
214
221
1075
367623
157
162
177
228
1207
299836
142
146
156
213
1112
366667
159
162
172
219
1082
345
154
157
169
254
1247
232377
126
128
145
250
1330
232213
125
128
144
256
1328
225628
122
126
137
260
1370
211885
119
122
131
190
825
50694
187
191
198
141
496
855628
243
249
256
62
92
1487596
322
329
336

raw data

Topic Precision farming

Topic Precision farming

Topic Compressed Sensing

Topic Network structure

bull dependencies between nodes in networks measured by degrees of direct neighbours

bull assortativity coefficient of Newman is nothing but Pearsonrsquos correlation statistic

bull inconsistent estimator when variances are infinitebull Spearman rank correlation behaves better but

calculation is computationally intensive bull requires heavy asymptotics

Van der Hofstad R and Litvak N (2014) Degree-Degree Dependencies in Random Graphs with Heavy-Tailed Degrees Internet Mathematics 10 (3-4) pp 287-334

Topic Topological Data Analysis

Common Big Data problem is to choose relevant ldquofeaturesrdquo from high-dimensional data

Combination of machine learning with topological tools (simplices cohomology) yields new algorithmsfor finding patterns and clustering

Presenter
Presentation Notes
httpsyoutubeXfWibrh6stw Gunnar Carsson

Mathematical contributions in general

bull Modellingbull Performance of algorithmsbull Statistical thinking

We need to work hard to makemathematical contributions more explicit

Conclusions

1 Mathematical methods are an important enabler in big data settings

2 Developments in big data settings not only require more computer speed and memory capacity but also new advanced mathematics

  • The Mathematics Behind Big Data
  • Outline
  • What is big data
  • Four Vrsquos of Big Data
  • High-dimensional data ldquo119951≪119953rdquo
  • Topic Google Search
  • Topic Google Search
  • Topic Google Search
  • Topic Google Search
  • Topic Google Search
  • Topic Google Search
  • Equivalent Random Surfer Model
  • Google Matrix
  • Choice of Matrix B
  • Practical Issues
  • Topic Streaming algorithms
  • Topic Uncertainty Quantification
  • Topic Network monitoring
  • Topic Precision farming
  • Topic Precision farming
  • Topic Compressed Sensing
  • Topic Network structure
  • Topic Topological Data Analysis
  • Mathematical contributions in general
  • Conclusions
1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Centrality-Betweenness[Agent x Agent Agent x Agent] Average 0 00001 00003 00006 00008 00016 00027 00029 00024 00025 00033 00031 00033 00036 0002 00008 0
Meta-Matrix Hamming Distance 0 00018 00036 0006 00065 00087 00124 0012 00146 00136 00135 00166 00164 00156 0015 0009 0004
Network Centralization-Betweenness[Agent x Agent Agent x Agent] 0 00091 00292 00492 00608 01105 01557 01586 01242 01187 01295 01338 01316 01414 0104 0042 00014
Edge Count-Sequential[Agent x Agent Agent x Agent] 0 001 00025 00017 0 0 00019 00008 00009 00009 00016 00015 0 00007 0 0002 0
Hierarchy[Agent x Agent Agent x Agent] 0 04827 01679 00696 00796 01496 01339 00727 00775 01273 01492 01711 0075 00717 00786 00882 05587
Exclusivity[Agent x Agent Agent x Agent] Average 00001 00003 00003 00003 00004 00004 00005 00005 00005 00006 00006 00006 00006 00006 00006 00004 00002
Network Centralization-Closeness[Agent x Agent Agent x Agent] 00001 00005 00012 00016 00018 00021 00027 0003 00028 00028 00031 0003 00032 00034 00024 00015 00004
Similarity Correlation[Agent x Agent Agent x Agent] Average 00001 00023 00036 00043 0005 0006 0007 00077 0007 00069 0008 00084 0008 00082 00053 0003 00008
Simmelian Ties[Agent x Agent Agent x Agent] Average 00002 00008 00021 00029 0004 0005 00059 00068 00061 00058 00065 00076 00076 0008 00045 00029 00004
Overall Complexity 00003 00015 0003 00043 00055 00067 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Expertise Correlation[Agent x Agent Agent x Agent] Average 00003 00015 0003 00042 00053 00065 00079 00088 00081 00079 00091 00097 00097 00101 0006 00036 00007
Density[Agent x Agent Agent x Agent] 00003 00015 0003 00043 00055 00067 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Centrality-Inverse Closeness[Agent x Agent Agent x Agent] Average 00003 00057 00191 00306 00387 0053 00733 00943 0084 00877 01111 01117 01125 01206 00587 00257 00018
Efficiency-Global[Agent x Agent Agent x Agent] 00003 00087 00223 00338 00421 00591 00815 0101 00904 00972 01245 01244 01203 01286 00632 00277 00026
Connectedness[Agent x Agent Agent x Agent] 00003 0021 00589 00932 0119 01953 02921 03446 0301 03223 04222 04187 04049 04402 02202 00899 00059
Distinctiveness Correlation[Agent x Agent Agent x Agent] Average 00006 00029 00059 00083 00106 00129 00156 00175 00162 00157 00181 00193 00193 00199 0012 00073 00014
Centrality-Closeness[Agent x Agent Agent x Agent] Average 00027 00028 00029 00031 00032 00035 00042 00048 00044 00044 00053 00052 00054 00058 00038 00031 00027
Boundary Spanner[Agent x Agent Agent x Agent] Average 00027 00191 00328 00328 00546 00574 00683 00601 00656 0071 00683 0071 00847 00656 00738 00546 00246
Network Centralization-Row Degree[Agent x Agent Agent x Agent] 00134 00314 006 00752 00794 0081 00851 00841 00848 00878 00893 00969 00996 00993 00842 00648 0024
Network Centralization-Column Degree[Agent x Agent Agent x Agent] 00134 00588 00819 00834 00849 00892 00878 00896 00876 00905 00948 00996 01024 00993 00842 0062 00322
Network Centralization-Out Degree[Agent x Agent Agent x Agent] 00135 00315 00601 00754 00797 00812 00853 00843 00851 0088 00895 00972 00999 00996 00845 0065 0024
Network Centralization-Total Degree[Agent x Agent Agent x Agent] 00135 00453 00672 00769 00813 00828 00869 0086 00867 00897 00925 00988 01016 00999 00847 00638 00282
Network Centralization-In Degree[Agent x Agent Agent x Agent] 00135 00589 00821 00836 00852 00895 00881 00898 00878 00908 0095 00999 01027 00996 00845 00622 00323
Clustering Coefficient-Watts-Strogatz[Agent x Agent Agent x Agent] Average 0023 00786 0126 01471 01728 01993 02334 02782 02382 02256 02604 02856 02775 02958 02157 01468 00355
Efficiency-Local[Agent x Agent Agent x Agent] 00238 01032 01587 01916 02253 02598 03034 0362 03167 03054 03644 03843 03678 03861 02663 01705 0041
Interlockers[Agent x Agent Agent x Agent] Average 00246 00546 00656 00628 00738 00902 00902 00902 00847 0071 00738 00874 00874 0082 00792 00683 00546
Breadth-Column[Agent x Agent Agent x Agent] 00273 01011 01803 02596 03224 03716 04399 05082 04672 04672 05355 05328 05519 05738 03825 02404 00601
Breadth-Row[Agent x Agent Agent x Agent] 00273 01339 02158 02923 03388 03934 04754 05301 04863 04973 05792 05765 05765 05984 04153 02514 00601
Clique Count[Agent x Agent Agent x Agent] Average 00301 02896 06421 10929 14699 17213 19781 22486 20601 20492 24071 2571 24918 24727 13689 06913 01148
Constraint-Burt[Agent x Agent Agent x Agent] Average 0038 01331 01404 01481 01869 02077 02643 02719 02506 02623 02827 0295 03034 02956 02548 01967 00948
Interdependence[Agent x Agent Agent x Agent] 00485 00073 00042 0003 00024 0002 00017 00015 00016 00016 00014 00013 00014 00013 00022 00037 00176
Effective Network Size-Burt[Agent x Agent Agent x Agent] Average 0053 03636 06977 10333 13118 15388 18575 21156 19688 19928 22983 2382 23991 2429 14392 08408 01957
Redundancy-Row[Agent x Agent Agent x Agent] 00601 03525 08443 12459 16093 19699 23716 26913 24754 23798 27541 29781 29617 30683 1765 10219 01311
Redundancy-Column[Agent x Agent Agent x Agent] 00601 03743 08634 12568 1623 2 23989 27077 24945 24016 27814 30055 29891 30902 17842 10383 01448
Node Levels[Agent x Agent Agent x Agent] Average 00601 09836 14344 23087 24126 36339 46913 42541 3918 43197 51913 44727 46421 48169 47568 23033 03169
Centrality-Bonacich Power[Agent x Agent Agent x Agent] Average 01038 05492 11066 15683 20027 2429 29372 32978 30383 29563 34071 36339 36284 37432 22541 13552 02514
Triad Count[Agent x Agent Agent x Agent] Average 01967 08279 27104 42432 5918 78251 96503 101475 94454 84508 94645 111421 110164 112732 67131 35574 01995
Efficiency[Agent x Agent Agent x Agent] 03333 09552 0964 09658 09658 09746 09798 09806 09795 09818 09835 09822 0982 09828 09809 09756 09607
Speed-Minimum[Agent x Agent Agent x Agent] 05 0125 0125 01 01111 00909 00909 01 01 01 00909 01111 01 01 00769 01111 01667
Edge Count-Skip[Agent x Agent Agent x Agent] 07895 07761 08988 09408 09304 0946 0933 09519 09487 09409 09455 09534 09465 09584 09188 0881 06196
Transitivity[Agent x Agent Agent x Agent] 08372 03475 03372 03238 03336 03626 0375 03495 03503 03253 0319 03266 03302 0336 03822 0405 0258
Edge Count-Pooled[Agent x Agent Agent x Agent] 08421 08657 09432 09669 09714 09764 09665 09751 09748 09704 09735 09737 09759 09788 09612 09435 08152
Speed-Average[Agent x Agent Agent x Agent] 08519 02994 03068 02802 02799 0233 02156 02387 0245 02484 02448 02516 02464 02429 02259 02373 04139
Edge Count-Lateral[Agent x Agent Agent x Agent] 09474 03731 03926 0392 04216 04061 03944 03828 03768 03651 03633 03872 03795 03985 03842 03891 05761
Resemblance Correlation[Agent x Agent Agent x Agent] Average 09994 09971 09941 09917 09894 09871 09844 09825 09838 09843 09819 09807 09807 09801 0988 09927 09986
Fragmentation[Agent x Agent Agent x Agent] 09997 0979 09411 09068 0881 08047 07079 06554 0699 06777 05778 05813 05951 05598 07798 09101 09941
Communicative Need[Agent x Agent Agent x Agent] 1 0608 07457 07662 0801 08069 08098 08233 0817 08124 0784 08169 08368 08365 08212 09004 08039
Upper Boundedness[Agent x Agent Agent x Agent] 1 09021 09909 0999 09987 0994 09966 09993 09989 09965 0995 09928 09987 09988 09986 09983 09073
Average Distance[Agent x Agent Agent x Agent] 11739 33399 326 35685 35731 42917 46372 41893 40814 40263 40845 39749 40577 41172 44266 42139 24163
Network Levels[Agent x Agent Agent x Agent] 2 8 8 10 9 11 11 10 10 10 11 9 10 10 13 9 6
Span Of Control[Agent x Agent Agent x Agent] 2375 27917 42188 48644 50903 52917 51932 54369 53981 5128 52176 55417 54426 55466 46089 40656 20909
Number of AGENT 32 94 112 137 162 183 221 228 213 219 254 250 256 260 190 141 62
Edge Count[Agent x Agent Agent x Agent] 38 201 405 574 733 889 1075 1207 1112 1082 1247 1330 1328 1370 825 496 92
Component Members-Weak[Agent x Agent Agent x Agent] Average 1728798 1248005 101776 862131 710437 559781 367623 299836 366667 345 232377 232213 225628 211885 50694 855628 1487596
Isolate Count[Agent x Agent Agent x Agent] 350 293 269 247 222 198 157 142 159 154 126 125 122 119 187 243 322
Component Count-Weak[Agent x Agent Agent x Agent] 355 301 272 250 226 201 162 146 162 157 128 128 126 122 191 249 329
Component Count-Strong[Agent x Agent Agent x Agent] 355 315 281 255 232 214 177 156 172 169 145 144 137 131 198 256 336
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Agents 221 228 213 219 254 250 256 260 190 141 62
Z -03746343246 00624390541 -08741467575 -04995124328 16858544608 14360982444 18107325691 20604887855 -23102450019 -53697586531 -103024439274
C+ 0 0 0 0 11858544608 21219527053 34326852743 49931740598 21829290579 0 0
C- 0 0 03741467575 03736591903 0 0 0 0 18102450019 6680003655 164824475824
Signal 0 0 0 0 0 0 0 1 0 1 1
Average 227
Std Dev 16015617378
k 05
Control Limit 4
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Component Members-Weak Average 367623 299836 366667 345 232377 232213 225628 211885 50694 855628 1487596
Z 09566024078 -01181147213 09414456731 05979300132 -11876316317 -1190231741 -12946324712 -15125184597 31653794044 86935922814 187130183455
C+ 04566024078 0 04414456731 05393756863 0 0 0 0 26653794044 108589716857 290719900313
C- 0 0 0 0 06876316317 13778633728 21724958439 31850143037 0 0 0
Signal 0 0 0 0 0 0 0 0 0 1 1
Average 307286
Std Dev 63074271515
k 05
Control Limit 4
1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Edge Count 38 201 405 574 733 889 1075 1207 1112 1082 1247 1330 1328 1370 825 496 92
Slope of Edge Count 163 204 169 159 156 186 132 -95 -30 165 83 -2 42 -545 -329 -404
Z -05039753822 15973457028 -01964649795 -07089823173 -08627375186 06748144948 -20927791293 -137269226972 -6393653789 59836399187 20500693512 -16784942813 -06150208054 -316223197419 -114547624997 -100197139539
C+ 0 10973457028 04008807233 0 0 01748144948 0 0 0 54836399187 70337092699 48552149886 37401941833 0 0 0
C- 00039753822 0 0 02089823173 05717198359 0 15927791293 148197018266 207133556155 142297156968 116796463456 128581406269 129731614323 440954811741 550502436738 645699576277
Signal 0 0 0 0 0 0 0 0 1 1 1 1 1 1 1 1 1
Average 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 9475 4825 43 3075 54 72 -1055 -2085 -309
Std Dev 195115350498 195115350498 195115350498 195115350498 195115350498 195115350498 195115350498 195115350498 1284143683549 1324547092406 1254033492376 115840047767 882307580533 710539700979 29504858357 278774102097 2505474007049
k 05
Control Limit 4
1988 1988 1988 1988 1988 1988
1989 1989 1989 1989 1989 1989
1990 1990 1990 1990 1990 1990
1991 1991 1991 1991 1991 1991
1992 1992 1992 1992 1992 1992
1993 1993 1993 1993 1993 1993
1994 1994 1994 1994 1994 1994
1995 1995 1995 1995 1995 1995
1996 1996 1996 1996 1996 1996
1997 1997 1997 1997 1997 1997
1998 1998 1998 1998 1998 1998
1999 1999 1999 1999 1999 1999
2000 2000 2000 2000 2000 2000
2001 2001 2001 2001 2001 2001
2002 2002 2002 2002 2002 2002
2003 2003 2003 2003 2003 2003
2004 2004 2004 2004 2004 2004
httpwwwmarocnlnieuwsforumsshowphppg=timeline
Significant Events Founded in Afghanistan Moves to Sudan WTC 1 and Somalia Leaves Sudan Returns to Afghanistan Saudia Arabia bomb Issues fatwa US embassy bombings USS Cole WTC 2 Afghanistan 3 Man is Captured Dirty bomb foiled 911 plotter captured Sheikh Mohammed captured Saudi bombings Morroco attacks Sectarian violence plot Madrid Europe truce offered
1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Betweeness 0 00001 00003 00006 00008 00016 00027 00029 00024 00025 00033 00031 00033 00036 0002 00008 0
Density 00003 00015 0003 00043 00055 00067 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Closeness 00001 00005 00012 00016 00018 00021 00027 0003 00028 00028 00031 0003 00032 00034 00024 00015 00004
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Closeness 00027 0003 00028 00028 00031 0003 00032 00034 00024 00015 00004
Z -08728715609 10910894512 -02182178902 -02182178902 17457431219 10910894512 24003967926 3709704134 -28368325731 -87287156094 -159299059872
C+ 0 05910894512 0 0 12457431219 18368325731 37372293657 69469334997 36101009266 0 0
C- 03728715609 0 0 0 0 0 0 0 23368325731 105655481825 259954541697
03728715609 05910894512 0 0 12457431219 18368325731 37372293657 69469334997 36101009266 105655481825 259954541697
Signal 0 0 0 0 0 0 0 1 0 1 1
Average 00028333333
Std Dev 00001527525
k 05
Control Limit 4
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Density 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Z -08444407432 11042686642 -0259827921 -06495698025 16888814864 30529780716 28581071308 36375908938 -43521176765 -9223891195 -150700194171
C+ 0 06042686642 0 0 11888814864 3741859558 60999666888 92375575826 43854399061 0 0
C- 03444407432 0 0 01495698025 0 0 0 0 38521176765 125760088714 271460282886
Signal 0 0 0 0 0 0 1 1 1 1 1
Average 00084333333
Std Dev 00005131601
k 05
Control Limit 4
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Betweeness 00027 00029 00024 00025 00033 00031 00033 00036 0002 00008 0
Z 01324532357 09271726499 -10596258857 -06622661785 25166114784 17218920642 25166114784 37086905998 -26490647141 -74173811996 -105962588565
C+ 0 04271726499 0 0 20166114784 32385035426 5255115021 84638056208 53147409067 0 0
C- 0 0 05596258857 07218920642 0 0 0 0 21490647141 90664459137 191627047702
Signal 0 0 0 0 0 0 1 1 1 1 1
Average 00026666667
Std Dev 00002516611
k 05
Control Limit 4
1994
1995
1996
1997
1998
1999
2000
2001
2002
2003
2004
1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Centrality-Betweenness[Agent x Agent Agent x Agent] Average 0 00001 00003 00006 00008 00016 00027 00029 00024 00025 00033 00031 00033 00036 0002 00008 0
Meta-Matrix Hamming Distance 0 00018 00036 0006 00065 00087 00124 0012 00146 00136 00135 00166 00164 00156 0015 0009 0004
Network Centralization-Betweenness[Agent x Agent Agent x Agent] 0 00091 00292 00492 00608 01105 01557 01586 01242 01187 01295 01338 01316 01414 0104 0042 00014
Edge Count-Sequential[Agent x Agent Agent x Agent] 0 001 00025 00017 0 0 00019 00008 00009 00009 00016 00015 0 00007 0 0002 0
Hierarchy[Agent x Agent Agent x Agent] 0 04827 01679 00696 00796 01496 01339 00727 00775 01273 01492 01711 0075 00717 00786 00882 05587
Exclusivity[Agent x Agent Agent x Agent] Average 00001 00003 00003 00003 00004 00004 00005 00005 00005 00006 00006 00006 00006 00006 00006 00004 00002
Network Centralization-Closeness[Agent x Agent Agent x Agent] 00001 00005 00012 00016 00018 00021 00027 0003 00028 00028 00031 0003 00032 00034 00024 00015 00004
Similarity Correlation[Agent x Agent Agent x Agent] Average 00001 00023 00036 00043 0005 0006 0007 00077 0007 00069 0008 00084 0008 00082 00053 0003 00008
Simmelian Ties[Agent x Agent Agent x Agent] Average 00002 00008 00021 00029 0004 0005 00059 00068 00061 00058 00065 00076 00076 0008 00045 00029 00004
Overall Complexity 00003 00015 0003 00043 00055 00067 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Expertise Correlation[Agent x Agent Agent x Agent] Average 00003 00015 0003 00042 00053 00065 00079 00088 00081 00079 00091 00097 00097 00101 0006 00036 00007
Density[Agent x Agent Agent x Agent] 00003 00015 0003 00043 00055 00067 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Centrality-Inverse Closeness[Agent x Agent Agent x Agent] Average 00003 00057 00191 00306 00387 0053 00733 00943 0084 00877 01111 01117 01125 01206 00587 00257 00018
Efficiency-Global[Agent x Agent Agent x Agent] 00003 00087 00223 00338 00421 00591 00815 0101 00904 00972 01245 01244 01203 01286 00632 00277 00026
Connectedness[Agent x Agent Agent x Agent] 00003 0021 00589 00932 0119 01953 02921 03446 0301 03223 04222 04187 04049 04402 02202 00899 00059
Distinctiveness Correlation[Agent x Agent Agent x Agent] Average 00006 00029 00059 00083 00106 00129 00156 00175 00162 00157 00181 00193 00193 00199 0012 00073 00014
Centrality-Closeness[Agent x Agent Agent x Agent] Average 00027 00028 00029 00031 00032 00035 00042 00048 00044 00044 00053 00052 00054 00058 00038 00031 00027
Boundary Spanner[Agent x Agent Agent x Agent] Average 00027 00191 00328 00328 00546 00574 00683 00601 00656 0071 00683 0071 00847 00656 00738 00546 00246
Network Centralization-Row Degree[Agent x Agent Agent x Agent] 00134 00314 006 00752 00794 0081 00851 00841 00848 00878 00893 00969 00996 00993 00842 00648 0024
Network Centralization-Column Degree[Agent x Agent Agent x Agent] 00134 00588 00819 00834 00849 00892 00878 00896 00876 00905 00948 00996 01024 00993 00842 0062 00322
Network Centralization-Out Degree[Agent x Agent Agent x Agent] 00135 00315 00601 00754 00797 00812 00853 00843 00851 0088 00895 00972 00999 00996 00845 0065 0024
Network Centralization-Total Degree[Agent x Agent Agent x Agent] 00135 00453 00672 00769 00813 00828 00869 0086 00867 00897 00925 00988 01016 00999 00847 00638 00282
Network Centralization-In Degree[Agent x Agent Agent x Agent] 00135 00589 00821 00836 00852 00895 00881 00898 00878 00908 0095 00999 01027 00996 00845 00622 00323
Clustering Coefficient-Watts-Strogatz[Agent x Agent Agent x Agent] Average 0023 00786 0126 01471 01728 01993 02334 02782 02382 02256 02604 02856 02775 02958 02157 01468 00355
Efficiency-Local[Agent x Agent Agent x Agent] 00238 01032 01587 01916 02253 02598 03034 0362 03167 03054 03644 03843 03678 03861 02663 01705 0041
Interlockers[Agent x Agent Agent x Agent] Average 00246 00546 00656 00628 00738 00902 00902 00902 00847 0071 00738 00874 00874 0082 00792 00683 00546
Breadth-Column[Agent x Agent Agent x Agent] 00273 01011 01803 02596 03224 03716 04399 05082 04672 04672 05355 05328 05519 05738 03825 02404 00601
Breadth-Row[Agent x Agent Agent x Agent] 00273 01339 02158 02923 03388 03934 04754 05301 04863 04973 05792 05765 05765 05984 04153 02514 00601
Clique Count[Agent x Agent Agent x Agent] Average 00301 02896 06421 10929 14699 17213 19781 22486 20601 20492 24071 2571 24918 24727 13689 06913 01148
Constraint-Burt[Agent x Agent Agent x Agent] Average 0038 01331 01404 01481 01869 02077 02643 02719 02506 02623 02827 0295 03034 02956 02548 01967 00948
Interdependence[Agent x Agent Agent x Agent] 00485 00073 00042 0003 00024 0002 00017 00015 00016 00016 00014 00013 00014 00013 00022 00037 00176
Effective Network Size-Burt[Agent x Agent Agent x Agent] Average 0053 03636 06977 10333 13118 15388 18575 21156 19688 19928 22983 2382 23991 2429 14392 08408 01957
Redundancy-Row[Agent x Agent Agent x Agent] 00601 03525 08443 12459 16093 19699 23716 26913 24754 23798 27541 29781 29617 30683 1765 10219 01311
Redundancy-Column[Agent x Agent Agent x Agent] 00601 03743 08634 12568 1623 2 23989 27077 24945 24016 27814 30055 29891 30902 17842 10383 01448
Node Levels[Agent x Agent Agent x Agent] Average 00601 09836 14344 23087 24126 36339 46913 42541 3918 43197 51913 44727 46421 48169 47568 23033 03169
Centrality-Bonacich Power[Agent x Agent Agent x Agent] Average 01038 05492 11066 15683 20027 2429 29372 32978 30383 29563 34071 36339 36284 37432 22541 13552 02514
Triad Count[Agent x Agent Agent x Agent] Average 01967 08279 27104 42432 5918 78251 96503 101475 94454 84508 94645 111421 110164 112732 67131 35574 01995
Efficiency[Agent x Agent Agent x Agent] 03333 09552 0964 09658 09658 09746 09798 09806 09795 09818 09835 09822 0982 09828 09809 09756 09607
Speed-Minimum[Agent x Agent Agent x Agent] 05 0125 0125 01 01111 00909 00909 01 01 01 00909 01111 01 01 00769 01111 01667
Edge Count-Skip[Agent x Agent Agent x Agent] 07895 07761 08988 09408 09304 0946 0933 09519 09487 09409 09455 09534 09465 09584 09188 0881 06196
Transitivity[Agent x Agent Agent x Agent] 08372 03475 03372 03238 03336 03626 0375 03495 03503 03253 0319 03266 03302 0336 03822 0405 0258
Edge Count-Pooled[Agent x Agent Agent x Agent] 08421 08657 09432 09669 09714 09764 09665 09751 09748 09704 09735 09737 09759 09788 09612 09435 08152
Speed-Average[Agent x Agent Agent x Agent] 08519 02994 03068 02802 02799 0233 02156 02387 0245 02484 02448 02516 02464 02429 02259 02373 04139
Edge Count-Lateral[Agent x Agent Agent x Agent] 09474 03731 03926 0392 04216 04061 03944 03828 03768 03651 03633 03872 03795 03985 03842 03891 05761
Resemblance Correlation[Agent x Agent Agent x Agent] Average 09994 09971 09941 09917 09894 09871 09844 09825 09838 09843 09819 09807 09807 09801 0988 09927 09986
Fragmentation[Agent x Agent Agent x Agent] 09997 0979 09411 09068 0881 08047 07079 06554 0699 06777 05778 05813 05951 05598 07798 09101 09941
Communicative Need[Agent x Agent Agent x Agent] 1 0608 07457 07662 0801 08069 08098 08233 0817 08124 0784 08169 08368 08365 08212 09004 08039
Upper Boundedness[Agent x Agent Agent x Agent] 1 09021 09909 0999 09987 0994 09966 09993 09989 09965 0995 09928 09987 09988 09986 09983 09073
Average Distance[Agent x Agent Agent x Agent] 11739 33399 326 35685 35731 42917 46372 41893 40814 40263 40845 39749 40577 41172 44266 42139 24163
Network Levels[Agent x Agent Agent x Agent] 2 8 8 10 9 11 11 10 10 10 11 9 10 10 13 9 6
Span Of Control[Agent x Agent Agent x Agent] 2375 27917 42188 48644 50903 52917 51932 54369 53981 5128 52176 55417 54426 55466 46089 40656 20909
Number of AGENT 32 94 112 137 162 183 221 228 213 219 254 250 256 260 190 141 62
Edge Count[Agent x Agent Agent x Agent] 38 201 405 574 733 889 1075 1207 1112 1082 1247 1330 1328 1370 825 496 92
Component Members-Weak[Agent x Agent Agent x Agent] Average 1728798 1248005 101776 862131 710437 559781 367623 299836 366667 345 232377 232213 225628 211885 50694 855628 1487596
Isolate Count[Agent x Agent Agent x Agent] 350 293 269 247 222 198 157 142 159 154 126 125 122 119 187 243 322
Component Count-Weak[Agent x Agent Agent x Agent] 355 301 272 250 226 201 162 146 162 157 128 128 126 122 191 249 329
Component Count-Strong[Agent x Agent Agent x Agent] 355 315 281 255 232 214 177 156 172 169 145 144 137 131 198 256 336
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Agents 221 228 213 219 254 250 256 260 190 141 62
Z -03746343246 00624390541 -08741467575 -04995124328 16858544608 14360982444 18107325691 20604887855 -23102450019 -53697586531 -103024439274
C+ 0 0 0 0 11858544608 21219527053 34326852743 49931740598 21829290579 0 0
C- 0 0 03741467575 03736591903 0 0 0 0 18102450019 6680003655 164824475824
Signal 0 0 0 0 0 0 0 1 0 1 1
Average 227
Std Dev 16015617378
k 05
Control Limit 4
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Component Members-Weak Average 367623 299836 366667 345 232377 232213 225628 211885 50694 855628 1487596
Z 09566024078 -01181147213 09414456731 05979300132 -11876316317 -1190231741 -12946324712 -15125184597 31653794044 86935922814 187130183455
C+ 04566024078 0 04414456731 05393756863 0 0 0 0 26653794044 108589716857 290719900313
C- 0 0 0 0 06876316317 13778633728 21724958439 31850143037 0 0 0
Signal 0 0 0 0 0 0 0 0 0 1 1
Average 307286
Std Dev 63074271515
k 05
Control Limit 4
1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Edge Count 38 201 405 574 733 889 1075 1207 1112 1082 1247 1330 1328 1370 825 496 92
Slope of Edge Count 163 204 169 159 156 186 132 -95 -30 165 83 -2 42 -545 -329 -404
Z -05039753822 15973457028 -01964649795 -07089823173 -08627375186 06748144948 -20927791293 -137269226972 -6393653789 59836399187 20500693512 -16784942813 -06150208054 -316223197419 -114547624997 -100197139539
C+ 0 10973457028 04008807233 0 0 01748144948 0 0 0 54836399187 70337092699 48552149886 37401941833 0 0 0
C- 00039753822 0 0 02089823173 05717198359 0 15927791293 148197018266 207133556155 142297156968 116796463456 128581406269 129731614323 440954811741 550502436738 645699576277
Signal 0 0 0 0 0 0 0 0 1 1 1 1 1 1 1 1 1
Average 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 9475 4825 43 3075 54 72 -1055 -2085 -309
Std Dev 195115350498 195115350498 195115350498 195115350498 195115350498 195115350498 195115350498 195115350498 1284143683549 1324547092406 1254033492376 115840047767 882307580533 710539700979 29504858357 278774102097 2505474007049
k 05
Control Limit 4
1988 1988 1988 1988 1988 1988
1989 1989 1989 1989 1989 1989
1990 1990 1990 1990 1990 1990
1991 1991 1991 1991 1991 1991
1992 1992 1992 1992 1992 1992
1993 1993 1993 1993 1993 1993
1994 1994 1994 1994 1994 1994
1995 1995 1995 1995 1995 1995
1996 1996 1996 1996 1996 1996
1997 1997 1997 1997 1997 1997
1998 1998 1998 1998 1998 1998
1999 1999 1999 1999 1999 1999
2000 2000 2000 2000 2000 2000
2001 2001 2001 2001 2001 2001
2002 2002 2002 2002 2002 2002
2003 2003 2003 2003 2003 2003
2004 2004 2004 2004 2004 2004
httpwwwmarocnlnieuwsforumsshowphppg=timeline
Significant Events Founded in Afghanistan Moves to Sudan WTC 1 and Somalia Leaves Sudan Returns to Afghanistan Saudia Arabia bomb Issues fatwa US embassy bombings USS Cole WTC 2 Afghanistan 3 Man is Captured Dirty bomb foiled 911 plotter captured Sheikh Mohammed captured Saudi bombings Morroco attacks Sectarian violence plot Madrid Europe truce offered
1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Betweeness 0 00001 00003 00006 00008 00016 00027 00029 00024 00025 00033 00031 00033 00036 0002 00008 0
Density 00003 00015 0003 00043 00055 00067 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Closeness 00001 00005 00012 00016 00018 00021 00027 0003 00028 00028 00031 0003 00032 00034 00024 00015 00004
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Closeness 00027 0003 00028 00028 00031 0003 00032 00034 00024 00015 00004
Z -08728715609 10910894512 -02182178902 -02182178902 17457431219 10910894512 24003967926 3709704134 -28368325731 -87287156094 -159299059872
C+ 0 05910894512 0 0 12457431219 18368325731 37372293657 69469334997 36101009266 0 0
C- 03728715609 0 0 0 0 0 0 0 23368325731 105655481825 259954541697
Signal 0 0 0 0 0 0 0 1 0 1 1
Average 00028333333
Std Dev 00001527525
k 05
Control Limit 4
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Density 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Z -08444407432 11042686642 -0259827921 -06495698025 16888814864 30529780716 28581071308 36375908938 -43521176765 -9223891195 -150700194171
C+ 0 06042686642 0 0 11888814864 3741859558 60999666888 92375575826 43854399061 0 0
C- 03444407432 0 0 01495698025 0 0 0 0 38521176765 125760088714 271460282886
Signal 0 0 0 0 0 0 1 1 1 1 1
Average 00084333333
Std Dev 00005131601
k 05
Control Limit 4
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Betweeness 00027 00029 00024 00025 00033 00031 00033 00036 0002 00008 0
Z 01324532357 09271726499 -10596258857 -06622661785 25166114784 17218920642 25166114784 37086905998 -26490647141 -74173811996 -105962588565
C+ 0 04271726499 0 0 20166114784 32385035426 5255115021 84638056208 53147409067 0 0
C- 0 0 05596258857 07218920642 0 0 0 0 21490647141 90664459137 191627047702
Signal 0 0 0 0 0 0 1 1 1 1 1
Average 00026666667
Std Dev 00002516611
k 05
Control Limit 4
1988 1988 1988
1989 1989 1989
1990 1990 1990
1991 1991 1991
1992 1992 1992
1993 1993 1993
1994 1994 1994
1995 1995 1995
1996 1996 1996
1997 1997 1997
1998 1998 1998
1999 1999 1999
2000 2000 2000
2001 2001 2001
2002 2002 2002
2003 2003 2003
2004 2004 2004
Page 10: Data Science Center Eindhoven - 4TU.Federation · 4TU AMI SRO Big Data Meeting Big Data: Mathematics in Action! November 24, 2017 Data Science Center Eindhoven The Mathematics Behind

Topic Google Search

A =

Topic Google Search

Presenter
Presentation Notes
Note that pages 2 and 4 both have 2 incoming links but not the same relevance

Equivalent Random Surfer Model

bull Start at any point on the graphbull Assign equal probabilities to all outgoing linksbull Choose new node with probabilities given by the

linksbull Relevance is proportion of visits after surfing for a

very long time (ldquostationary distributionrdquo)

Google Matrix

bull Choose a ldquodamping factorrdquo p (0ltplt1) bull Googlersquos p is secret but around 015

Interpretation for random surferbull choose next link according to A with probability 1- pbull ldquoteleportrdquo to random page with probability p

119872119872 = 1 minus 119901119901 119860119860 + 119901119901119901119901

119901119901 =1119899119899

1 ⋯ 1⋮ ⋱ ⋮1 ⋯ 1

Choice of Matrix B

Google choose other B to avoid unwanted boosting of pages

Practical Issues

bull The real Google matrix has size in the order of 30 billion columns x 30 billion rows

bull Every day around 250000 new pages are added bull Matrix has many zeros (100 links to a page)

which speeds up calculationsbull Doing the matrix calculation

requires fast computers butalso advanced math

Topic Streaming algorithms

High volume high velocity data makes exact counting of frequencies or number of different items practically infeasible but approximate answers suffice in several applications (web site counters customer counts in retail transactionshellip)

New algorithms using advanced probabilistic methods (random hash functions)bull Count-Min Sketch bull MinHashbull HyperLogLog

Topic Uncertainty Quantification

Virtual prototyping using mathematical models UQ does not deal with1 unknown uncertainty in the initial

conditions of parameters2 parametrisation of design building blocks dimension

reduction

For 1) polynomial chaos (Wiener chaos) inverse statistical models Bayesian analysis (calibration)

For 2) Model Order Reduction

Topic Network monitoring

0

0002

0004

0006

0008

001

0012

1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004

Measu

re V

alu

e

Time

Al-Qaeda Network Measures

Betweeness

Density

Closeness

Challengesbull monitor high number of variablesbull models to capture structural changesbull scalable algorithms for likelihood ratio calculations

Closeness CUSUM Statistic for Al-Qaeda (1994-2004)

0

1

2

3

4

5

6

7

8

1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004

C+

Change Point

Signal

Chart3

Betweeness
Density
Closeness
Time
Measure Value
Al-Qaeda Network Measures
0
00003
00001
00001
00015
00005
00003
0003
00012
00006
00043
00016
00008
00055
00018
00016
00067
00021
00027
0008
00027
00029
0009
0003
00024
00083
00028
00025
00081
00028
00033
00093
00031
00031
001
0003
00033
00099
00032
00036
00103
00034
0002
00062
00024
00008
00037
00015
0
00007
00004

Std Measures Control Charts

Std Measures Control Charts

Betweeness
Density
Closeness
Time
Value
Standard Measures

Chart1

Number of AGENT
Edge Count[Agent x Agent Agent x Agent]
Component Members-Weak[Agent x Agent Agent x Agent] Average
Isolate Count[Agent x Agent Agent x Agent]
Component Count-Weak[Agent x Agent Agent x Agent]
Component Count-Strong[Agent x Agent Agent x Agent]
32
38
1728798
350
355
355
94
201
1248005
293
301
315
112
405
101776
269
272
281
137
574
862131
247
250
255
162
733
710437
222
226
232
183
889
559781
198
201
214
221
1075
367623
157
162
177
228
1207
299836
142
146
156
213
1112
366667
159
162
172
219
1082
345
154
157
169
254
1247
232377
126
128
145
250
1330
232213
125
128
144
256
1328
225628
122
126
137
260
1370
211885
119
122
131
190
825
50694
187
191
198
141
496
855628
243
249
256
62
92
1487596
322
329
336

Control Charts

raw data

Chart4

C+
Closeness CUSUM Statistic for Al-Qaeda (1994-2004)
0
05910894512
0
0
12457431219
18368325731
37372293657
69469334997
36101009266
0
0

Std Measures Control Charts

Std Measures Control Charts

Betweeness
Density
Closeness
Time
Value
Standard Measures

Chart1

C+
Closeness CUSUM Chart for Al-Qaeda (1994-2004)

Control Charts

Number of AGENT
Edge Count[Agent x Agent Agent x Agent]
Component Members-Weak[Agent x Agent Agent x Agent] Average
Isolate Count[Agent x Agent Agent x Agent]
Component Count-Weak[Agent x Agent Agent x Agent]
Component Count-Strong[Agent x Agent Agent x Agent]
32
38
1728798
350
355
355
94
201
1248005
293
301
315
112
405
101776
269
272
281
137
574
862131
247
250
255
162
733
710437
222
226
232
183
889
559781
198
201
214
221
1075
367623
157
162
177
228
1207
299836
142
146
156
213
1112
366667
159
162
172
219
1082
345
154
157
169
254
1247
232377
126
128
145
250
1330
232213
125
128
144
256
1328
225628
122
126
137
260
1370
211885
119
122
131
190
825
50694
187
191
198
141
496
855628
243
249
256
62
92
1487596
322
329
336

raw data

Topic Precision farming

Topic Precision farming

Topic Compressed Sensing

Topic Network structure

bull dependencies between nodes in networks measured by degrees of direct neighbours

bull assortativity coefficient of Newman is nothing but Pearsonrsquos correlation statistic

bull inconsistent estimator when variances are infinitebull Spearman rank correlation behaves better but

calculation is computationally intensive bull requires heavy asymptotics

Van der Hofstad R and Litvak N (2014) Degree-Degree Dependencies in Random Graphs with Heavy-Tailed Degrees Internet Mathematics 10 (3-4) pp 287-334

Topic Topological Data Analysis

Common Big Data problem is to choose relevant ldquofeaturesrdquo from high-dimensional data

Combination of machine learning with topological tools (simplices cohomology) yields new algorithmsfor finding patterns and clustering

Presenter
Presentation Notes
httpsyoutubeXfWibrh6stw Gunnar Carsson

Mathematical contributions in general

bull Modellingbull Performance of algorithmsbull Statistical thinking

We need to work hard to makemathematical contributions more explicit

Conclusions

1 Mathematical methods are an important enabler in big data settings

2 Developments in big data settings not only require more computer speed and memory capacity but also new advanced mathematics

  • The Mathematics Behind Big Data
  • Outline
  • What is big data
  • Four Vrsquos of Big Data
  • High-dimensional data ldquo119951≪119953rdquo
  • Topic Google Search
  • Topic Google Search
  • Topic Google Search
  • Topic Google Search
  • Topic Google Search
  • Topic Google Search
  • Equivalent Random Surfer Model
  • Google Matrix
  • Choice of Matrix B
  • Practical Issues
  • Topic Streaming algorithms
  • Topic Uncertainty Quantification
  • Topic Network monitoring
  • Topic Precision farming
  • Topic Precision farming
  • Topic Compressed Sensing
  • Topic Network structure
  • Topic Topological Data Analysis
  • Mathematical contributions in general
  • Conclusions
1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Centrality-Betweenness[Agent x Agent Agent x Agent] Average 0 00001 00003 00006 00008 00016 00027 00029 00024 00025 00033 00031 00033 00036 0002 00008 0
Meta-Matrix Hamming Distance 0 00018 00036 0006 00065 00087 00124 0012 00146 00136 00135 00166 00164 00156 0015 0009 0004
Network Centralization-Betweenness[Agent x Agent Agent x Agent] 0 00091 00292 00492 00608 01105 01557 01586 01242 01187 01295 01338 01316 01414 0104 0042 00014
Edge Count-Sequential[Agent x Agent Agent x Agent] 0 001 00025 00017 0 0 00019 00008 00009 00009 00016 00015 0 00007 0 0002 0
Hierarchy[Agent x Agent Agent x Agent] 0 04827 01679 00696 00796 01496 01339 00727 00775 01273 01492 01711 0075 00717 00786 00882 05587
Exclusivity[Agent x Agent Agent x Agent] Average 00001 00003 00003 00003 00004 00004 00005 00005 00005 00006 00006 00006 00006 00006 00006 00004 00002
Network Centralization-Closeness[Agent x Agent Agent x Agent] 00001 00005 00012 00016 00018 00021 00027 0003 00028 00028 00031 0003 00032 00034 00024 00015 00004
Similarity Correlation[Agent x Agent Agent x Agent] Average 00001 00023 00036 00043 0005 0006 0007 00077 0007 00069 0008 00084 0008 00082 00053 0003 00008
Simmelian Ties[Agent x Agent Agent x Agent] Average 00002 00008 00021 00029 0004 0005 00059 00068 00061 00058 00065 00076 00076 0008 00045 00029 00004
Overall Complexity 00003 00015 0003 00043 00055 00067 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Expertise Correlation[Agent x Agent Agent x Agent] Average 00003 00015 0003 00042 00053 00065 00079 00088 00081 00079 00091 00097 00097 00101 0006 00036 00007
Density[Agent x Agent Agent x Agent] 00003 00015 0003 00043 00055 00067 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Centrality-Inverse Closeness[Agent x Agent Agent x Agent] Average 00003 00057 00191 00306 00387 0053 00733 00943 0084 00877 01111 01117 01125 01206 00587 00257 00018
Efficiency-Global[Agent x Agent Agent x Agent] 00003 00087 00223 00338 00421 00591 00815 0101 00904 00972 01245 01244 01203 01286 00632 00277 00026
Connectedness[Agent x Agent Agent x Agent] 00003 0021 00589 00932 0119 01953 02921 03446 0301 03223 04222 04187 04049 04402 02202 00899 00059
Distinctiveness Correlation[Agent x Agent Agent x Agent] Average 00006 00029 00059 00083 00106 00129 00156 00175 00162 00157 00181 00193 00193 00199 0012 00073 00014
Centrality-Closeness[Agent x Agent Agent x Agent] Average 00027 00028 00029 00031 00032 00035 00042 00048 00044 00044 00053 00052 00054 00058 00038 00031 00027
Boundary Spanner[Agent x Agent Agent x Agent] Average 00027 00191 00328 00328 00546 00574 00683 00601 00656 0071 00683 0071 00847 00656 00738 00546 00246
Network Centralization-Row Degree[Agent x Agent Agent x Agent] 00134 00314 006 00752 00794 0081 00851 00841 00848 00878 00893 00969 00996 00993 00842 00648 0024
Network Centralization-Column Degree[Agent x Agent Agent x Agent] 00134 00588 00819 00834 00849 00892 00878 00896 00876 00905 00948 00996 01024 00993 00842 0062 00322
Network Centralization-Out Degree[Agent x Agent Agent x Agent] 00135 00315 00601 00754 00797 00812 00853 00843 00851 0088 00895 00972 00999 00996 00845 0065 0024
Network Centralization-Total Degree[Agent x Agent Agent x Agent] 00135 00453 00672 00769 00813 00828 00869 0086 00867 00897 00925 00988 01016 00999 00847 00638 00282
Network Centralization-In Degree[Agent x Agent Agent x Agent] 00135 00589 00821 00836 00852 00895 00881 00898 00878 00908 0095 00999 01027 00996 00845 00622 00323
Clustering Coefficient-Watts-Strogatz[Agent x Agent Agent x Agent] Average 0023 00786 0126 01471 01728 01993 02334 02782 02382 02256 02604 02856 02775 02958 02157 01468 00355
Efficiency-Local[Agent x Agent Agent x Agent] 00238 01032 01587 01916 02253 02598 03034 0362 03167 03054 03644 03843 03678 03861 02663 01705 0041
Interlockers[Agent x Agent Agent x Agent] Average 00246 00546 00656 00628 00738 00902 00902 00902 00847 0071 00738 00874 00874 0082 00792 00683 00546
Breadth-Column[Agent x Agent Agent x Agent] 00273 01011 01803 02596 03224 03716 04399 05082 04672 04672 05355 05328 05519 05738 03825 02404 00601
Breadth-Row[Agent x Agent Agent x Agent] 00273 01339 02158 02923 03388 03934 04754 05301 04863 04973 05792 05765 05765 05984 04153 02514 00601
Clique Count[Agent x Agent Agent x Agent] Average 00301 02896 06421 10929 14699 17213 19781 22486 20601 20492 24071 2571 24918 24727 13689 06913 01148
Constraint-Burt[Agent x Agent Agent x Agent] Average 0038 01331 01404 01481 01869 02077 02643 02719 02506 02623 02827 0295 03034 02956 02548 01967 00948
Interdependence[Agent x Agent Agent x Agent] 00485 00073 00042 0003 00024 0002 00017 00015 00016 00016 00014 00013 00014 00013 00022 00037 00176
Effective Network Size-Burt[Agent x Agent Agent x Agent] Average 0053 03636 06977 10333 13118 15388 18575 21156 19688 19928 22983 2382 23991 2429 14392 08408 01957
Redundancy-Row[Agent x Agent Agent x Agent] 00601 03525 08443 12459 16093 19699 23716 26913 24754 23798 27541 29781 29617 30683 1765 10219 01311
Redundancy-Column[Agent x Agent Agent x Agent] 00601 03743 08634 12568 1623 2 23989 27077 24945 24016 27814 30055 29891 30902 17842 10383 01448
Node Levels[Agent x Agent Agent x Agent] Average 00601 09836 14344 23087 24126 36339 46913 42541 3918 43197 51913 44727 46421 48169 47568 23033 03169
Centrality-Bonacich Power[Agent x Agent Agent x Agent] Average 01038 05492 11066 15683 20027 2429 29372 32978 30383 29563 34071 36339 36284 37432 22541 13552 02514
Triad Count[Agent x Agent Agent x Agent] Average 01967 08279 27104 42432 5918 78251 96503 101475 94454 84508 94645 111421 110164 112732 67131 35574 01995
Efficiency[Agent x Agent Agent x Agent] 03333 09552 0964 09658 09658 09746 09798 09806 09795 09818 09835 09822 0982 09828 09809 09756 09607
Speed-Minimum[Agent x Agent Agent x Agent] 05 0125 0125 01 01111 00909 00909 01 01 01 00909 01111 01 01 00769 01111 01667
Edge Count-Skip[Agent x Agent Agent x Agent] 07895 07761 08988 09408 09304 0946 0933 09519 09487 09409 09455 09534 09465 09584 09188 0881 06196
Transitivity[Agent x Agent Agent x Agent] 08372 03475 03372 03238 03336 03626 0375 03495 03503 03253 0319 03266 03302 0336 03822 0405 0258
Edge Count-Pooled[Agent x Agent Agent x Agent] 08421 08657 09432 09669 09714 09764 09665 09751 09748 09704 09735 09737 09759 09788 09612 09435 08152
Speed-Average[Agent x Agent Agent x Agent] 08519 02994 03068 02802 02799 0233 02156 02387 0245 02484 02448 02516 02464 02429 02259 02373 04139
Edge Count-Lateral[Agent x Agent Agent x Agent] 09474 03731 03926 0392 04216 04061 03944 03828 03768 03651 03633 03872 03795 03985 03842 03891 05761
Resemblance Correlation[Agent x Agent Agent x Agent] Average 09994 09971 09941 09917 09894 09871 09844 09825 09838 09843 09819 09807 09807 09801 0988 09927 09986
Fragmentation[Agent x Agent Agent x Agent] 09997 0979 09411 09068 0881 08047 07079 06554 0699 06777 05778 05813 05951 05598 07798 09101 09941
Communicative Need[Agent x Agent Agent x Agent] 1 0608 07457 07662 0801 08069 08098 08233 0817 08124 0784 08169 08368 08365 08212 09004 08039
Upper Boundedness[Agent x Agent Agent x Agent] 1 09021 09909 0999 09987 0994 09966 09993 09989 09965 0995 09928 09987 09988 09986 09983 09073
Average Distance[Agent x Agent Agent x Agent] 11739 33399 326 35685 35731 42917 46372 41893 40814 40263 40845 39749 40577 41172 44266 42139 24163
Network Levels[Agent x Agent Agent x Agent] 2 8 8 10 9 11 11 10 10 10 11 9 10 10 13 9 6
Span Of Control[Agent x Agent Agent x Agent] 2375 27917 42188 48644 50903 52917 51932 54369 53981 5128 52176 55417 54426 55466 46089 40656 20909
Number of AGENT 32 94 112 137 162 183 221 228 213 219 254 250 256 260 190 141 62
Edge Count[Agent x Agent Agent x Agent] 38 201 405 574 733 889 1075 1207 1112 1082 1247 1330 1328 1370 825 496 92
Component Members-Weak[Agent x Agent Agent x Agent] Average 1728798 1248005 101776 862131 710437 559781 367623 299836 366667 345 232377 232213 225628 211885 50694 855628 1487596
Isolate Count[Agent x Agent Agent x Agent] 350 293 269 247 222 198 157 142 159 154 126 125 122 119 187 243 322
Component Count-Weak[Agent x Agent Agent x Agent] 355 301 272 250 226 201 162 146 162 157 128 128 126 122 191 249 329
Component Count-Strong[Agent x Agent Agent x Agent] 355 315 281 255 232 214 177 156 172 169 145 144 137 131 198 256 336
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Agents 221 228 213 219 254 250 256 260 190 141 62
Z -03746343246 00624390541 -08741467575 -04995124328 16858544608 14360982444 18107325691 20604887855 -23102450019 -53697586531 -103024439274
C+ 0 0 0 0 11858544608 21219527053 34326852743 49931740598 21829290579 0 0
C- 0 0 03741467575 03736591903 0 0 0 0 18102450019 6680003655 164824475824
Signal 0 0 0 0 0 0 0 1 0 1 1
Average 227
Std Dev 16015617378
k 05
Control Limit 4
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Component Members-Weak Average 367623 299836 366667 345 232377 232213 225628 211885 50694 855628 1487596
Z 09566024078 -01181147213 09414456731 05979300132 -11876316317 -1190231741 -12946324712 -15125184597 31653794044 86935922814 187130183455
C+ 04566024078 0 04414456731 05393756863 0 0 0 0 26653794044 108589716857 290719900313
C- 0 0 0 0 06876316317 13778633728 21724958439 31850143037 0 0 0
Signal 0 0 0 0 0 0 0 0 0 1 1
Average 307286
Std Dev 63074271515
k 05
Control Limit 4
1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Edge Count 38 201 405 574 733 889 1075 1207 1112 1082 1247 1330 1328 1370 825 496 92
Slope of Edge Count 163 204 169 159 156 186 132 -95 -30 165 83 -2 42 -545 -329 -404
Z -05039753822 15973457028 -01964649795 -07089823173 -08627375186 06748144948 -20927791293 -137269226972 -6393653789 59836399187 20500693512 -16784942813 -06150208054 -316223197419 -114547624997 -100197139539
C+ 0 10973457028 04008807233 0 0 01748144948 0 0 0 54836399187 70337092699 48552149886 37401941833 0 0 0
C- 00039753822 0 0 02089823173 05717198359 0 15927791293 148197018266 207133556155 142297156968 116796463456 128581406269 129731614323 440954811741 550502436738 645699576277
Signal 0 0 0 0 0 0 0 0 1 1 1 1 1 1 1 1 1
Average 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 9475 4825 43 3075 54 72 -1055 -2085 -309
Std Dev 195115350498 195115350498 195115350498 195115350498 195115350498 195115350498 195115350498 195115350498 1284143683549 1324547092406 1254033492376 115840047767 882307580533 710539700979 29504858357 278774102097 2505474007049
k 05
Control Limit 4
1988 1988 1988 1988 1988 1988
1989 1989 1989 1989 1989 1989
1990 1990 1990 1990 1990 1990
1991 1991 1991 1991 1991 1991
1992 1992 1992 1992 1992 1992
1993 1993 1993 1993 1993 1993
1994 1994 1994 1994 1994 1994
1995 1995 1995 1995 1995 1995
1996 1996 1996 1996 1996 1996
1997 1997 1997 1997 1997 1997
1998 1998 1998 1998 1998 1998
1999 1999 1999 1999 1999 1999
2000 2000 2000 2000 2000 2000
2001 2001 2001 2001 2001 2001
2002 2002 2002 2002 2002 2002
2003 2003 2003 2003 2003 2003
2004 2004 2004 2004 2004 2004
httpwwwmarocnlnieuwsforumsshowphppg=timeline
Significant Events Founded in Afghanistan Moves to Sudan WTC 1 and Somalia Leaves Sudan Returns to Afghanistan Saudia Arabia bomb Issues fatwa US embassy bombings USS Cole WTC 2 Afghanistan 3 Man is Captured Dirty bomb foiled 911 plotter captured Sheikh Mohammed captured Saudi bombings Morroco attacks Sectarian violence plot Madrid Europe truce offered
1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Betweeness 0 00001 00003 00006 00008 00016 00027 00029 00024 00025 00033 00031 00033 00036 0002 00008 0
Density 00003 00015 0003 00043 00055 00067 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Closeness 00001 00005 00012 00016 00018 00021 00027 0003 00028 00028 00031 0003 00032 00034 00024 00015 00004
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Closeness 00027 0003 00028 00028 00031 0003 00032 00034 00024 00015 00004
Z -08728715609 10910894512 -02182178902 -02182178902 17457431219 10910894512 24003967926 3709704134 -28368325731 -87287156094 -159299059872
C+ 0 05910894512 0 0 12457431219 18368325731 37372293657 69469334997 36101009266 0 0
C- 03728715609 0 0 0 0 0 0 0 23368325731 105655481825 259954541697
03728715609 05910894512 0 0 12457431219 18368325731 37372293657 69469334997 36101009266 105655481825 259954541697
Signal 0 0 0 0 0 0 0 1 0 1 1
Average 00028333333
Std Dev 00001527525
k 05
Control Limit 4
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Density 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Z -08444407432 11042686642 -0259827921 -06495698025 16888814864 30529780716 28581071308 36375908938 -43521176765 -9223891195 -150700194171
C+ 0 06042686642 0 0 11888814864 3741859558 60999666888 92375575826 43854399061 0 0
C- 03444407432 0 0 01495698025 0 0 0 0 38521176765 125760088714 271460282886
Signal 0 0 0 0 0 0 1 1 1 1 1
Average 00084333333
Std Dev 00005131601
k 05
Control Limit 4
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Betweeness 00027 00029 00024 00025 00033 00031 00033 00036 0002 00008 0
Z 01324532357 09271726499 -10596258857 -06622661785 25166114784 17218920642 25166114784 37086905998 -26490647141 -74173811996 -105962588565
C+ 0 04271726499 0 0 20166114784 32385035426 5255115021 84638056208 53147409067 0 0
C- 0 0 05596258857 07218920642 0 0 0 0 21490647141 90664459137 191627047702
Signal 0 0 0 0 0 0 1 1 1 1 1
Average 00026666667
Std Dev 00002516611
k 05
Control Limit 4
1994
1995
1996
1997
1998
1999
2000
2001
2002
2003
2004
1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Centrality-Betweenness[Agent x Agent Agent x Agent] Average 0 00001 00003 00006 00008 00016 00027 00029 00024 00025 00033 00031 00033 00036 0002 00008 0
Meta-Matrix Hamming Distance 0 00018 00036 0006 00065 00087 00124 0012 00146 00136 00135 00166 00164 00156 0015 0009 0004
Network Centralization-Betweenness[Agent x Agent Agent x Agent] 0 00091 00292 00492 00608 01105 01557 01586 01242 01187 01295 01338 01316 01414 0104 0042 00014
Edge Count-Sequential[Agent x Agent Agent x Agent] 0 001 00025 00017 0 0 00019 00008 00009 00009 00016 00015 0 00007 0 0002 0
Hierarchy[Agent x Agent Agent x Agent] 0 04827 01679 00696 00796 01496 01339 00727 00775 01273 01492 01711 0075 00717 00786 00882 05587
Exclusivity[Agent x Agent Agent x Agent] Average 00001 00003 00003 00003 00004 00004 00005 00005 00005 00006 00006 00006 00006 00006 00006 00004 00002
Network Centralization-Closeness[Agent x Agent Agent x Agent] 00001 00005 00012 00016 00018 00021 00027 0003 00028 00028 00031 0003 00032 00034 00024 00015 00004
Similarity Correlation[Agent x Agent Agent x Agent] Average 00001 00023 00036 00043 0005 0006 0007 00077 0007 00069 0008 00084 0008 00082 00053 0003 00008
Simmelian Ties[Agent x Agent Agent x Agent] Average 00002 00008 00021 00029 0004 0005 00059 00068 00061 00058 00065 00076 00076 0008 00045 00029 00004
Overall Complexity 00003 00015 0003 00043 00055 00067 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Expertise Correlation[Agent x Agent Agent x Agent] Average 00003 00015 0003 00042 00053 00065 00079 00088 00081 00079 00091 00097 00097 00101 0006 00036 00007
Density[Agent x Agent Agent x Agent] 00003 00015 0003 00043 00055 00067 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Centrality-Inverse Closeness[Agent x Agent Agent x Agent] Average 00003 00057 00191 00306 00387 0053 00733 00943 0084 00877 01111 01117 01125 01206 00587 00257 00018
Efficiency-Global[Agent x Agent Agent x Agent] 00003 00087 00223 00338 00421 00591 00815 0101 00904 00972 01245 01244 01203 01286 00632 00277 00026
Connectedness[Agent x Agent Agent x Agent] 00003 0021 00589 00932 0119 01953 02921 03446 0301 03223 04222 04187 04049 04402 02202 00899 00059
Distinctiveness Correlation[Agent x Agent Agent x Agent] Average 00006 00029 00059 00083 00106 00129 00156 00175 00162 00157 00181 00193 00193 00199 0012 00073 00014
Centrality-Closeness[Agent x Agent Agent x Agent] Average 00027 00028 00029 00031 00032 00035 00042 00048 00044 00044 00053 00052 00054 00058 00038 00031 00027
Boundary Spanner[Agent x Agent Agent x Agent] Average 00027 00191 00328 00328 00546 00574 00683 00601 00656 0071 00683 0071 00847 00656 00738 00546 00246
Network Centralization-Row Degree[Agent x Agent Agent x Agent] 00134 00314 006 00752 00794 0081 00851 00841 00848 00878 00893 00969 00996 00993 00842 00648 0024
Network Centralization-Column Degree[Agent x Agent Agent x Agent] 00134 00588 00819 00834 00849 00892 00878 00896 00876 00905 00948 00996 01024 00993 00842 0062 00322
Network Centralization-Out Degree[Agent x Agent Agent x Agent] 00135 00315 00601 00754 00797 00812 00853 00843 00851 0088 00895 00972 00999 00996 00845 0065 0024
Network Centralization-Total Degree[Agent x Agent Agent x Agent] 00135 00453 00672 00769 00813 00828 00869 0086 00867 00897 00925 00988 01016 00999 00847 00638 00282
Network Centralization-In Degree[Agent x Agent Agent x Agent] 00135 00589 00821 00836 00852 00895 00881 00898 00878 00908 0095 00999 01027 00996 00845 00622 00323
Clustering Coefficient-Watts-Strogatz[Agent x Agent Agent x Agent] Average 0023 00786 0126 01471 01728 01993 02334 02782 02382 02256 02604 02856 02775 02958 02157 01468 00355
Efficiency-Local[Agent x Agent Agent x Agent] 00238 01032 01587 01916 02253 02598 03034 0362 03167 03054 03644 03843 03678 03861 02663 01705 0041
Interlockers[Agent x Agent Agent x Agent] Average 00246 00546 00656 00628 00738 00902 00902 00902 00847 0071 00738 00874 00874 0082 00792 00683 00546
Breadth-Column[Agent x Agent Agent x Agent] 00273 01011 01803 02596 03224 03716 04399 05082 04672 04672 05355 05328 05519 05738 03825 02404 00601
Breadth-Row[Agent x Agent Agent x Agent] 00273 01339 02158 02923 03388 03934 04754 05301 04863 04973 05792 05765 05765 05984 04153 02514 00601
Clique Count[Agent x Agent Agent x Agent] Average 00301 02896 06421 10929 14699 17213 19781 22486 20601 20492 24071 2571 24918 24727 13689 06913 01148
Constraint-Burt[Agent x Agent Agent x Agent] Average 0038 01331 01404 01481 01869 02077 02643 02719 02506 02623 02827 0295 03034 02956 02548 01967 00948
Interdependence[Agent x Agent Agent x Agent] 00485 00073 00042 0003 00024 0002 00017 00015 00016 00016 00014 00013 00014 00013 00022 00037 00176
Effective Network Size-Burt[Agent x Agent Agent x Agent] Average 0053 03636 06977 10333 13118 15388 18575 21156 19688 19928 22983 2382 23991 2429 14392 08408 01957
Redundancy-Row[Agent x Agent Agent x Agent] 00601 03525 08443 12459 16093 19699 23716 26913 24754 23798 27541 29781 29617 30683 1765 10219 01311
Redundancy-Column[Agent x Agent Agent x Agent] 00601 03743 08634 12568 1623 2 23989 27077 24945 24016 27814 30055 29891 30902 17842 10383 01448
Node Levels[Agent x Agent Agent x Agent] Average 00601 09836 14344 23087 24126 36339 46913 42541 3918 43197 51913 44727 46421 48169 47568 23033 03169
Centrality-Bonacich Power[Agent x Agent Agent x Agent] Average 01038 05492 11066 15683 20027 2429 29372 32978 30383 29563 34071 36339 36284 37432 22541 13552 02514
Triad Count[Agent x Agent Agent x Agent] Average 01967 08279 27104 42432 5918 78251 96503 101475 94454 84508 94645 111421 110164 112732 67131 35574 01995
Efficiency[Agent x Agent Agent x Agent] 03333 09552 0964 09658 09658 09746 09798 09806 09795 09818 09835 09822 0982 09828 09809 09756 09607
Speed-Minimum[Agent x Agent Agent x Agent] 05 0125 0125 01 01111 00909 00909 01 01 01 00909 01111 01 01 00769 01111 01667
Edge Count-Skip[Agent x Agent Agent x Agent] 07895 07761 08988 09408 09304 0946 0933 09519 09487 09409 09455 09534 09465 09584 09188 0881 06196
Transitivity[Agent x Agent Agent x Agent] 08372 03475 03372 03238 03336 03626 0375 03495 03503 03253 0319 03266 03302 0336 03822 0405 0258
Edge Count-Pooled[Agent x Agent Agent x Agent] 08421 08657 09432 09669 09714 09764 09665 09751 09748 09704 09735 09737 09759 09788 09612 09435 08152
Speed-Average[Agent x Agent Agent x Agent] 08519 02994 03068 02802 02799 0233 02156 02387 0245 02484 02448 02516 02464 02429 02259 02373 04139
Edge Count-Lateral[Agent x Agent Agent x Agent] 09474 03731 03926 0392 04216 04061 03944 03828 03768 03651 03633 03872 03795 03985 03842 03891 05761
Resemblance Correlation[Agent x Agent Agent x Agent] Average 09994 09971 09941 09917 09894 09871 09844 09825 09838 09843 09819 09807 09807 09801 0988 09927 09986
Fragmentation[Agent x Agent Agent x Agent] 09997 0979 09411 09068 0881 08047 07079 06554 0699 06777 05778 05813 05951 05598 07798 09101 09941
Communicative Need[Agent x Agent Agent x Agent] 1 0608 07457 07662 0801 08069 08098 08233 0817 08124 0784 08169 08368 08365 08212 09004 08039
Upper Boundedness[Agent x Agent Agent x Agent] 1 09021 09909 0999 09987 0994 09966 09993 09989 09965 0995 09928 09987 09988 09986 09983 09073
Average Distance[Agent x Agent Agent x Agent] 11739 33399 326 35685 35731 42917 46372 41893 40814 40263 40845 39749 40577 41172 44266 42139 24163
Network Levels[Agent x Agent Agent x Agent] 2 8 8 10 9 11 11 10 10 10 11 9 10 10 13 9 6
Span Of Control[Agent x Agent Agent x Agent] 2375 27917 42188 48644 50903 52917 51932 54369 53981 5128 52176 55417 54426 55466 46089 40656 20909
Number of AGENT 32 94 112 137 162 183 221 228 213 219 254 250 256 260 190 141 62
Edge Count[Agent x Agent Agent x Agent] 38 201 405 574 733 889 1075 1207 1112 1082 1247 1330 1328 1370 825 496 92
Component Members-Weak[Agent x Agent Agent x Agent] Average 1728798 1248005 101776 862131 710437 559781 367623 299836 366667 345 232377 232213 225628 211885 50694 855628 1487596
Isolate Count[Agent x Agent Agent x Agent] 350 293 269 247 222 198 157 142 159 154 126 125 122 119 187 243 322
Component Count-Weak[Agent x Agent Agent x Agent] 355 301 272 250 226 201 162 146 162 157 128 128 126 122 191 249 329
Component Count-Strong[Agent x Agent Agent x Agent] 355 315 281 255 232 214 177 156 172 169 145 144 137 131 198 256 336
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Agents 221 228 213 219 254 250 256 260 190 141 62
Z -03746343246 00624390541 -08741467575 -04995124328 16858544608 14360982444 18107325691 20604887855 -23102450019 -53697586531 -103024439274
C+ 0 0 0 0 11858544608 21219527053 34326852743 49931740598 21829290579 0 0
C- 0 0 03741467575 03736591903 0 0 0 0 18102450019 6680003655 164824475824
Signal 0 0 0 0 0 0 0 1 0 1 1
Average 227
Std Dev 16015617378
k 05
Control Limit 4
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Component Members-Weak Average 367623 299836 366667 345 232377 232213 225628 211885 50694 855628 1487596
Z 09566024078 -01181147213 09414456731 05979300132 -11876316317 -1190231741 -12946324712 -15125184597 31653794044 86935922814 187130183455
C+ 04566024078 0 04414456731 05393756863 0 0 0 0 26653794044 108589716857 290719900313
C- 0 0 0 0 06876316317 13778633728 21724958439 31850143037 0 0 0
Signal 0 0 0 0 0 0 0 0 0 1 1
Average 307286
Std Dev 63074271515
k 05
Control Limit 4
1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Edge Count 38 201 405 574 733 889 1075 1207 1112 1082 1247 1330 1328 1370 825 496 92
Slope of Edge Count 163 204 169 159 156 186 132 -95 -30 165 83 -2 42 -545 -329 -404
Z -05039753822 15973457028 -01964649795 -07089823173 -08627375186 06748144948 -20927791293 -137269226972 -6393653789 59836399187 20500693512 -16784942813 -06150208054 -316223197419 -114547624997 -100197139539
C+ 0 10973457028 04008807233 0 0 01748144948 0 0 0 54836399187 70337092699 48552149886 37401941833 0 0 0
C- 00039753822 0 0 02089823173 05717198359 0 15927791293 148197018266 207133556155 142297156968 116796463456 128581406269 129731614323 440954811741 550502436738 645699576277
Signal 0 0 0 0 0 0 0 0 1 1 1 1 1 1 1 1 1
Average 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 9475 4825 43 3075 54 72 -1055 -2085 -309
Std Dev 195115350498 195115350498 195115350498 195115350498 195115350498 195115350498 195115350498 195115350498 1284143683549 1324547092406 1254033492376 115840047767 882307580533 710539700979 29504858357 278774102097 2505474007049
k 05
Control Limit 4
1988 1988 1988 1988 1988 1988
1989 1989 1989 1989 1989 1989
1990 1990 1990 1990 1990 1990
1991 1991 1991 1991 1991 1991
1992 1992 1992 1992 1992 1992
1993 1993 1993 1993 1993 1993
1994 1994 1994 1994 1994 1994
1995 1995 1995 1995 1995 1995
1996 1996 1996 1996 1996 1996
1997 1997 1997 1997 1997 1997
1998 1998 1998 1998 1998 1998
1999 1999 1999 1999 1999 1999
2000 2000 2000 2000 2000 2000
2001 2001 2001 2001 2001 2001
2002 2002 2002 2002 2002 2002
2003 2003 2003 2003 2003 2003
2004 2004 2004 2004 2004 2004
httpwwwmarocnlnieuwsforumsshowphppg=timeline
Significant Events Founded in Afghanistan Moves to Sudan WTC 1 and Somalia Leaves Sudan Returns to Afghanistan Saudia Arabia bomb Issues fatwa US embassy bombings USS Cole WTC 2 Afghanistan 3 Man is Captured Dirty bomb foiled 911 plotter captured Sheikh Mohammed captured Saudi bombings Morroco attacks Sectarian violence plot Madrid Europe truce offered
1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Betweeness 0 00001 00003 00006 00008 00016 00027 00029 00024 00025 00033 00031 00033 00036 0002 00008 0
Density 00003 00015 0003 00043 00055 00067 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Closeness 00001 00005 00012 00016 00018 00021 00027 0003 00028 00028 00031 0003 00032 00034 00024 00015 00004
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Closeness 00027 0003 00028 00028 00031 0003 00032 00034 00024 00015 00004
Z -08728715609 10910894512 -02182178902 -02182178902 17457431219 10910894512 24003967926 3709704134 -28368325731 -87287156094 -159299059872
C+ 0 05910894512 0 0 12457431219 18368325731 37372293657 69469334997 36101009266 0 0
C- 03728715609 0 0 0 0 0 0 0 23368325731 105655481825 259954541697
Signal 0 0 0 0 0 0 0 1 0 1 1
Average 00028333333
Std Dev 00001527525
k 05
Control Limit 4
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Density 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Z -08444407432 11042686642 -0259827921 -06495698025 16888814864 30529780716 28581071308 36375908938 -43521176765 -9223891195 -150700194171
C+ 0 06042686642 0 0 11888814864 3741859558 60999666888 92375575826 43854399061 0 0
C- 03444407432 0 0 01495698025 0 0 0 0 38521176765 125760088714 271460282886
Signal 0 0 0 0 0 0 1 1 1 1 1
Average 00084333333
Std Dev 00005131601
k 05
Control Limit 4
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Betweeness 00027 00029 00024 00025 00033 00031 00033 00036 0002 00008 0
Z 01324532357 09271726499 -10596258857 -06622661785 25166114784 17218920642 25166114784 37086905998 -26490647141 -74173811996 -105962588565
C+ 0 04271726499 0 0 20166114784 32385035426 5255115021 84638056208 53147409067 0 0
C- 0 0 05596258857 07218920642 0 0 0 0 21490647141 90664459137 191627047702
Signal 0 0 0 0 0 0 1 1 1 1 1
Average 00026666667
Std Dev 00002516611
k 05
Control Limit 4
1988 1988 1988
1989 1989 1989
1990 1990 1990
1991 1991 1991
1992 1992 1992
1993 1993 1993
1994 1994 1994
1995 1995 1995
1996 1996 1996
1997 1997 1997
1998 1998 1998
1999 1999 1999
2000 2000 2000
2001 2001 2001
2002 2002 2002
2003 2003 2003
2004 2004 2004
Page 11: Data Science Center Eindhoven - 4TU.Federation · 4TU AMI SRO Big Data Meeting Big Data: Mathematics in Action! November 24, 2017 Data Science Center Eindhoven The Mathematics Behind

Topic Google Search

Presenter
Presentation Notes
Note that pages 2 and 4 both have 2 incoming links but not the same relevance

Equivalent Random Surfer Model

bull Start at any point on the graphbull Assign equal probabilities to all outgoing linksbull Choose new node with probabilities given by the

linksbull Relevance is proportion of visits after surfing for a

very long time (ldquostationary distributionrdquo)

Google Matrix

bull Choose a ldquodamping factorrdquo p (0ltplt1) bull Googlersquos p is secret but around 015

Interpretation for random surferbull choose next link according to A with probability 1- pbull ldquoteleportrdquo to random page with probability p

119872119872 = 1 minus 119901119901 119860119860 + 119901119901119901119901

119901119901 =1119899119899

1 ⋯ 1⋮ ⋱ ⋮1 ⋯ 1

Choice of Matrix B

Google choose other B to avoid unwanted boosting of pages

Practical Issues

bull The real Google matrix has size in the order of 30 billion columns x 30 billion rows

bull Every day around 250000 new pages are added bull Matrix has many zeros (100 links to a page)

which speeds up calculationsbull Doing the matrix calculation

requires fast computers butalso advanced math

Topic Streaming algorithms

High volume high velocity data makes exact counting of frequencies or number of different items practically infeasible but approximate answers suffice in several applications (web site counters customer counts in retail transactionshellip)

New algorithms using advanced probabilistic methods (random hash functions)bull Count-Min Sketch bull MinHashbull HyperLogLog

Topic Uncertainty Quantification

Virtual prototyping using mathematical models UQ does not deal with1 unknown uncertainty in the initial

conditions of parameters2 parametrisation of design building blocks dimension

reduction

For 1) polynomial chaos (Wiener chaos) inverse statistical models Bayesian analysis (calibration)

For 2) Model Order Reduction

Topic Network monitoring

0

0002

0004

0006

0008

001

0012

1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004

Measu

re V

alu

e

Time

Al-Qaeda Network Measures

Betweeness

Density

Closeness

Challengesbull monitor high number of variablesbull models to capture structural changesbull scalable algorithms for likelihood ratio calculations

Closeness CUSUM Statistic for Al-Qaeda (1994-2004)

0

1

2

3

4

5

6

7

8

1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004

C+

Change Point

Signal

Chart3

Betweeness
Density
Closeness
Time
Measure Value
Al-Qaeda Network Measures
0
00003
00001
00001
00015
00005
00003
0003
00012
00006
00043
00016
00008
00055
00018
00016
00067
00021
00027
0008
00027
00029
0009
0003
00024
00083
00028
00025
00081
00028
00033
00093
00031
00031
001
0003
00033
00099
00032
00036
00103
00034
0002
00062
00024
00008
00037
00015
0
00007
00004

Std Measures Control Charts

Std Measures Control Charts

Betweeness
Density
Closeness
Time
Value
Standard Measures

Chart1

Number of AGENT
Edge Count[Agent x Agent Agent x Agent]
Component Members-Weak[Agent x Agent Agent x Agent] Average
Isolate Count[Agent x Agent Agent x Agent]
Component Count-Weak[Agent x Agent Agent x Agent]
Component Count-Strong[Agent x Agent Agent x Agent]
32
38
1728798
350
355
355
94
201
1248005
293
301
315
112
405
101776
269
272
281
137
574
862131
247
250
255
162
733
710437
222
226
232
183
889
559781
198
201
214
221
1075
367623
157
162
177
228
1207
299836
142
146
156
213
1112
366667
159
162
172
219
1082
345
154
157
169
254
1247
232377
126
128
145
250
1330
232213
125
128
144
256
1328
225628
122
126
137
260
1370
211885
119
122
131
190
825
50694
187
191
198
141
496
855628
243
249
256
62
92
1487596
322
329
336

Control Charts

raw data

Chart4

C+
Closeness CUSUM Statistic for Al-Qaeda (1994-2004)
0
05910894512
0
0
12457431219
18368325731
37372293657
69469334997
36101009266
0
0

Std Measures Control Charts

Std Measures Control Charts

Betweeness
Density
Closeness
Time
Value
Standard Measures

Chart1

C+
Closeness CUSUM Chart for Al-Qaeda (1994-2004)

Control Charts

Number of AGENT
Edge Count[Agent x Agent Agent x Agent]
Component Members-Weak[Agent x Agent Agent x Agent] Average
Isolate Count[Agent x Agent Agent x Agent]
Component Count-Weak[Agent x Agent Agent x Agent]
Component Count-Strong[Agent x Agent Agent x Agent]
32
38
1728798
350
355
355
94
201
1248005
293
301
315
112
405
101776
269
272
281
137
574
862131
247
250
255
162
733
710437
222
226
232
183
889
559781
198
201
214
221
1075
367623
157
162
177
228
1207
299836
142
146
156
213
1112
366667
159
162
172
219
1082
345
154
157
169
254
1247
232377
126
128
145
250
1330
232213
125
128
144
256
1328
225628
122
126
137
260
1370
211885
119
122
131
190
825
50694
187
191
198
141
496
855628
243
249
256
62
92
1487596
322
329
336

raw data

Topic Precision farming

Topic Precision farming

Topic Compressed Sensing

Topic Network structure

bull dependencies between nodes in networks measured by degrees of direct neighbours

bull assortativity coefficient of Newman is nothing but Pearsonrsquos correlation statistic

bull inconsistent estimator when variances are infinitebull Spearman rank correlation behaves better but

calculation is computationally intensive bull requires heavy asymptotics

Van der Hofstad R and Litvak N (2014) Degree-Degree Dependencies in Random Graphs with Heavy-Tailed Degrees Internet Mathematics 10 (3-4) pp 287-334

Topic Topological Data Analysis

Common Big Data problem is to choose relevant ldquofeaturesrdquo from high-dimensional data

Combination of machine learning with topological tools (simplices cohomology) yields new algorithmsfor finding patterns and clustering

Presenter
Presentation Notes
httpsyoutubeXfWibrh6stw Gunnar Carsson

Mathematical contributions in general

bull Modellingbull Performance of algorithmsbull Statistical thinking

We need to work hard to makemathematical contributions more explicit

Conclusions

1 Mathematical methods are an important enabler in big data settings

2 Developments in big data settings not only require more computer speed and memory capacity but also new advanced mathematics

  • The Mathematics Behind Big Data
  • Outline
  • What is big data
  • Four Vrsquos of Big Data
  • High-dimensional data ldquo119951≪119953rdquo
  • Topic Google Search
  • Topic Google Search
  • Topic Google Search
  • Topic Google Search
  • Topic Google Search
  • Topic Google Search
  • Equivalent Random Surfer Model
  • Google Matrix
  • Choice of Matrix B
  • Practical Issues
  • Topic Streaming algorithms
  • Topic Uncertainty Quantification
  • Topic Network monitoring
  • Topic Precision farming
  • Topic Precision farming
  • Topic Compressed Sensing
  • Topic Network structure
  • Topic Topological Data Analysis
  • Mathematical contributions in general
  • Conclusions
1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Centrality-Betweenness[Agent x Agent Agent x Agent] Average 0 00001 00003 00006 00008 00016 00027 00029 00024 00025 00033 00031 00033 00036 0002 00008 0
Meta-Matrix Hamming Distance 0 00018 00036 0006 00065 00087 00124 0012 00146 00136 00135 00166 00164 00156 0015 0009 0004
Network Centralization-Betweenness[Agent x Agent Agent x Agent] 0 00091 00292 00492 00608 01105 01557 01586 01242 01187 01295 01338 01316 01414 0104 0042 00014
Edge Count-Sequential[Agent x Agent Agent x Agent] 0 001 00025 00017 0 0 00019 00008 00009 00009 00016 00015 0 00007 0 0002 0
Hierarchy[Agent x Agent Agent x Agent] 0 04827 01679 00696 00796 01496 01339 00727 00775 01273 01492 01711 0075 00717 00786 00882 05587
Exclusivity[Agent x Agent Agent x Agent] Average 00001 00003 00003 00003 00004 00004 00005 00005 00005 00006 00006 00006 00006 00006 00006 00004 00002
Network Centralization-Closeness[Agent x Agent Agent x Agent] 00001 00005 00012 00016 00018 00021 00027 0003 00028 00028 00031 0003 00032 00034 00024 00015 00004
Similarity Correlation[Agent x Agent Agent x Agent] Average 00001 00023 00036 00043 0005 0006 0007 00077 0007 00069 0008 00084 0008 00082 00053 0003 00008
Simmelian Ties[Agent x Agent Agent x Agent] Average 00002 00008 00021 00029 0004 0005 00059 00068 00061 00058 00065 00076 00076 0008 00045 00029 00004
Overall Complexity 00003 00015 0003 00043 00055 00067 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Expertise Correlation[Agent x Agent Agent x Agent] Average 00003 00015 0003 00042 00053 00065 00079 00088 00081 00079 00091 00097 00097 00101 0006 00036 00007
Density[Agent x Agent Agent x Agent] 00003 00015 0003 00043 00055 00067 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Centrality-Inverse Closeness[Agent x Agent Agent x Agent] Average 00003 00057 00191 00306 00387 0053 00733 00943 0084 00877 01111 01117 01125 01206 00587 00257 00018
Efficiency-Global[Agent x Agent Agent x Agent] 00003 00087 00223 00338 00421 00591 00815 0101 00904 00972 01245 01244 01203 01286 00632 00277 00026
Connectedness[Agent x Agent Agent x Agent] 00003 0021 00589 00932 0119 01953 02921 03446 0301 03223 04222 04187 04049 04402 02202 00899 00059
Distinctiveness Correlation[Agent x Agent Agent x Agent] Average 00006 00029 00059 00083 00106 00129 00156 00175 00162 00157 00181 00193 00193 00199 0012 00073 00014
Centrality-Closeness[Agent x Agent Agent x Agent] Average 00027 00028 00029 00031 00032 00035 00042 00048 00044 00044 00053 00052 00054 00058 00038 00031 00027
Boundary Spanner[Agent x Agent Agent x Agent] Average 00027 00191 00328 00328 00546 00574 00683 00601 00656 0071 00683 0071 00847 00656 00738 00546 00246
Network Centralization-Row Degree[Agent x Agent Agent x Agent] 00134 00314 006 00752 00794 0081 00851 00841 00848 00878 00893 00969 00996 00993 00842 00648 0024
Network Centralization-Column Degree[Agent x Agent Agent x Agent] 00134 00588 00819 00834 00849 00892 00878 00896 00876 00905 00948 00996 01024 00993 00842 0062 00322
Network Centralization-Out Degree[Agent x Agent Agent x Agent] 00135 00315 00601 00754 00797 00812 00853 00843 00851 0088 00895 00972 00999 00996 00845 0065 0024
Network Centralization-Total Degree[Agent x Agent Agent x Agent] 00135 00453 00672 00769 00813 00828 00869 0086 00867 00897 00925 00988 01016 00999 00847 00638 00282
Network Centralization-In Degree[Agent x Agent Agent x Agent] 00135 00589 00821 00836 00852 00895 00881 00898 00878 00908 0095 00999 01027 00996 00845 00622 00323
Clustering Coefficient-Watts-Strogatz[Agent x Agent Agent x Agent] Average 0023 00786 0126 01471 01728 01993 02334 02782 02382 02256 02604 02856 02775 02958 02157 01468 00355
Efficiency-Local[Agent x Agent Agent x Agent] 00238 01032 01587 01916 02253 02598 03034 0362 03167 03054 03644 03843 03678 03861 02663 01705 0041
Interlockers[Agent x Agent Agent x Agent] Average 00246 00546 00656 00628 00738 00902 00902 00902 00847 0071 00738 00874 00874 0082 00792 00683 00546
Breadth-Column[Agent x Agent Agent x Agent] 00273 01011 01803 02596 03224 03716 04399 05082 04672 04672 05355 05328 05519 05738 03825 02404 00601
Breadth-Row[Agent x Agent Agent x Agent] 00273 01339 02158 02923 03388 03934 04754 05301 04863 04973 05792 05765 05765 05984 04153 02514 00601
Clique Count[Agent x Agent Agent x Agent] Average 00301 02896 06421 10929 14699 17213 19781 22486 20601 20492 24071 2571 24918 24727 13689 06913 01148
Constraint-Burt[Agent x Agent Agent x Agent] Average 0038 01331 01404 01481 01869 02077 02643 02719 02506 02623 02827 0295 03034 02956 02548 01967 00948
Interdependence[Agent x Agent Agent x Agent] 00485 00073 00042 0003 00024 0002 00017 00015 00016 00016 00014 00013 00014 00013 00022 00037 00176
Effective Network Size-Burt[Agent x Agent Agent x Agent] Average 0053 03636 06977 10333 13118 15388 18575 21156 19688 19928 22983 2382 23991 2429 14392 08408 01957
Redundancy-Row[Agent x Agent Agent x Agent] 00601 03525 08443 12459 16093 19699 23716 26913 24754 23798 27541 29781 29617 30683 1765 10219 01311
Redundancy-Column[Agent x Agent Agent x Agent] 00601 03743 08634 12568 1623 2 23989 27077 24945 24016 27814 30055 29891 30902 17842 10383 01448
Node Levels[Agent x Agent Agent x Agent] Average 00601 09836 14344 23087 24126 36339 46913 42541 3918 43197 51913 44727 46421 48169 47568 23033 03169
Centrality-Bonacich Power[Agent x Agent Agent x Agent] Average 01038 05492 11066 15683 20027 2429 29372 32978 30383 29563 34071 36339 36284 37432 22541 13552 02514
Triad Count[Agent x Agent Agent x Agent] Average 01967 08279 27104 42432 5918 78251 96503 101475 94454 84508 94645 111421 110164 112732 67131 35574 01995
Efficiency[Agent x Agent Agent x Agent] 03333 09552 0964 09658 09658 09746 09798 09806 09795 09818 09835 09822 0982 09828 09809 09756 09607
Speed-Minimum[Agent x Agent Agent x Agent] 05 0125 0125 01 01111 00909 00909 01 01 01 00909 01111 01 01 00769 01111 01667
Edge Count-Skip[Agent x Agent Agent x Agent] 07895 07761 08988 09408 09304 0946 0933 09519 09487 09409 09455 09534 09465 09584 09188 0881 06196
Transitivity[Agent x Agent Agent x Agent] 08372 03475 03372 03238 03336 03626 0375 03495 03503 03253 0319 03266 03302 0336 03822 0405 0258
Edge Count-Pooled[Agent x Agent Agent x Agent] 08421 08657 09432 09669 09714 09764 09665 09751 09748 09704 09735 09737 09759 09788 09612 09435 08152
Speed-Average[Agent x Agent Agent x Agent] 08519 02994 03068 02802 02799 0233 02156 02387 0245 02484 02448 02516 02464 02429 02259 02373 04139
Edge Count-Lateral[Agent x Agent Agent x Agent] 09474 03731 03926 0392 04216 04061 03944 03828 03768 03651 03633 03872 03795 03985 03842 03891 05761
Resemblance Correlation[Agent x Agent Agent x Agent] Average 09994 09971 09941 09917 09894 09871 09844 09825 09838 09843 09819 09807 09807 09801 0988 09927 09986
Fragmentation[Agent x Agent Agent x Agent] 09997 0979 09411 09068 0881 08047 07079 06554 0699 06777 05778 05813 05951 05598 07798 09101 09941
Communicative Need[Agent x Agent Agent x Agent] 1 0608 07457 07662 0801 08069 08098 08233 0817 08124 0784 08169 08368 08365 08212 09004 08039
Upper Boundedness[Agent x Agent Agent x Agent] 1 09021 09909 0999 09987 0994 09966 09993 09989 09965 0995 09928 09987 09988 09986 09983 09073
Average Distance[Agent x Agent Agent x Agent] 11739 33399 326 35685 35731 42917 46372 41893 40814 40263 40845 39749 40577 41172 44266 42139 24163
Network Levels[Agent x Agent Agent x Agent] 2 8 8 10 9 11 11 10 10 10 11 9 10 10 13 9 6
Span Of Control[Agent x Agent Agent x Agent] 2375 27917 42188 48644 50903 52917 51932 54369 53981 5128 52176 55417 54426 55466 46089 40656 20909
Number of AGENT 32 94 112 137 162 183 221 228 213 219 254 250 256 260 190 141 62
Edge Count[Agent x Agent Agent x Agent] 38 201 405 574 733 889 1075 1207 1112 1082 1247 1330 1328 1370 825 496 92
Component Members-Weak[Agent x Agent Agent x Agent] Average 1728798 1248005 101776 862131 710437 559781 367623 299836 366667 345 232377 232213 225628 211885 50694 855628 1487596
Isolate Count[Agent x Agent Agent x Agent] 350 293 269 247 222 198 157 142 159 154 126 125 122 119 187 243 322
Component Count-Weak[Agent x Agent Agent x Agent] 355 301 272 250 226 201 162 146 162 157 128 128 126 122 191 249 329
Component Count-Strong[Agent x Agent Agent x Agent] 355 315 281 255 232 214 177 156 172 169 145 144 137 131 198 256 336
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Agents 221 228 213 219 254 250 256 260 190 141 62
Z -03746343246 00624390541 -08741467575 -04995124328 16858544608 14360982444 18107325691 20604887855 -23102450019 -53697586531 -103024439274
C+ 0 0 0 0 11858544608 21219527053 34326852743 49931740598 21829290579 0 0
C- 0 0 03741467575 03736591903 0 0 0 0 18102450019 6680003655 164824475824
Signal 0 0 0 0 0 0 0 1 0 1 1
Average 227
Std Dev 16015617378
k 05
Control Limit 4
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Component Members-Weak Average 367623 299836 366667 345 232377 232213 225628 211885 50694 855628 1487596
Z 09566024078 -01181147213 09414456731 05979300132 -11876316317 -1190231741 -12946324712 -15125184597 31653794044 86935922814 187130183455
C+ 04566024078 0 04414456731 05393756863 0 0 0 0 26653794044 108589716857 290719900313
C- 0 0 0 0 06876316317 13778633728 21724958439 31850143037 0 0 0
Signal 0 0 0 0 0 0 0 0 0 1 1
Average 307286
Std Dev 63074271515
k 05
Control Limit 4
1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Edge Count 38 201 405 574 733 889 1075 1207 1112 1082 1247 1330 1328 1370 825 496 92
Slope of Edge Count 163 204 169 159 156 186 132 -95 -30 165 83 -2 42 -545 -329 -404
Z -05039753822 15973457028 -01964649795 -07089823173 -08627375186 06748144948 -20927791293 -137269226972 -6393653789 59836399187 20500693512 -16784942813 -06150208054 -316223197419 -114547624997 -100197139539
C+ 0 10973457028 04008807233 0 0 01748144948 0 0 0 54836399187 70337092699 48552149886 37401941833 0 0 0
C- 00039753822 0 0 02089823173 05717198359 0 15927791293 148197018266 207133556155 142297156968 116796463456 128581406269 129731614323 440954811741 550502436738 645699576277
Signal 0 0 0 0 0 0 0 0 1 1 1 1 1 1 1 1 1
Average 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 9475 4825 43 3075 54 72 -1055 -2085 -309
Std Dev 195115350498 195115350498 195115350498 195115350498 195115350498 195115350498 195115350498 195115350498 1284143683549 1324547092406 1254033492376 115840047767 882307580533 710539700979 29504858357 278774102097 2505474007049
k 05
Control Limit 4
1988 1988 1988 1988 1988 1988
1989 1989 1989 1989 1989 1989
1990 1990 1990 1990 1990 1990
1991 1991 1991 1991 1991 1991
1992 1992 1992 1992 1992 1992
1993 1993 1993 1993 1993 1993
1994 1994 1994 1994 1994 1994
1995 1995 1995 1995 1995 1995
1996 1996 1996 1996 1996 1996
1997 1997 1997 1997 1997 1997
1998 1998 1998 1998 1998 1998
1999 1999 1999 1999 1999 1999
2000 2000 2000 2000 2000 2000
2001 2001 2001 2001 2001 2001
2002 2002 2002 2002 2002 2002
2003 2003 2003 2003 2003 2003
2004 2004 2004 2004 2004 2004
httpwwwmarocnlnieuwsforumsshowphppg=timeline
Significant Events Founded in Afghanistan Moves to Sudan WTC 1 and Somalia Leaves Sudan Returns to Afghanistan Saudia Arabia bomb Issues fatwa US embassy bombings USS Cole WTC 2 Afghanistan 3 Man is Captured Dirty bomb foiled 911 plotter captured Sheikh Mohammed captured Saudi bombings Morroco attacks Sectarian violence plot Madrid Europe truce offered
1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Betweeness 0 00001 00003 00006 00008 00016 00027 00029 00024 00025 00033 00031 00033 00036 0002 00008 0
Density 00003 00015 0003 00043 00055 00067 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Closeness 00001 00005 00012 00016 00018 00021 00027 0003 00028 00028 00031 0003 00032 00034 00024 00015 00004
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Closeness 00027 0003 00028 00028 00031 0003 00032 00034 00024 00015 00004
Z -08728715609 10910894512 -02182178902 -02182178902 17457431219 10910894512 24003967926 3709704134 -28368325731 -87287156094 -159299059872
C+ 0 05910894512 0 0 12457431219 18368325731 37372293657 69469334997 36101009266 0 0
C- 03728715609 0 0 0 0 0 0 0 23368325731 105655481825 259954541697
03728715609 05910894512 0 0 12457431219 18368325731 37372293657 69469334997 36101009266 105655481825 259954541697
Signal 0 0 0 0 0 0 0 1 0 1 1
Average 00028333333
Std Dev 00001527525
k 05
Control Limit 4
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Density 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Z -08444407432 11042686642 -0259827921 -06495698025 16888814864 30529780716 28581071308 36375908938 -43521176765 -9223891195 -150700194171
C+ 0 06042686642 0 0 11888814864 3741859558 60999666888 92375575826 43854399061 0 0
C- 03444407432 0 0 01495698025 0 0 0 0 38521176765 125760088714 271460282886
Signal 0 0 0 0 0 0 1 1 1 1 1
Average 00084333333
Std Dev 00005131601
k 05
Control Limit 4
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Betweeness 00027 00029 00024 00025 00033 00031 00033 00036 0002 00008 0
Z 01324532357 09271726499 -10596258857 -06622661785 25166114784 17218920642 25166114784 37086905998 -26490647141 -74173811996 -105962588565
C+ 0 04271726499 0 0 20166114784 32385035426 5255115021 84638056208 53147409067 0 0
C- 0 0 05596258857 07218920642 0 0 0 0 21490647141 90664459137 191627047702
Signal 0 0 0 0 0 0 1 1 1 1 1
Average 00026666667
Std Dev 00002516611
k 05
Control Limit 4
1994
1995
1996
1997
1998
1999
2000
2001
2002
2003
2004
1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Centrality-Betweenness[Agent x Agent Agent x Agent] Average 0 00001 00003 00006 00008 00016 00027 00029 00024 00025 00033 00031 00033 00036 0002 00008 0
Meta-Matrix Hamming Distance 0 00018 00036 0006 00065 00087 00124 0012 00146 00136 00135 00166 00164 00156 0015 0009 0004
Network Centralization-Betweenness[Agent x Agent Agent x Agent] 0 00091 00292 00492 00608 01105 01557 01586 01242 01187 01295 01338 01316 01414 0104 0042 00014
Edge Count-Sequential[Agent x Agent Agent x Agent] 0 001 00025 00017 0 0 00019 00008 00009 00009 00016 00015 0 00007 0 0002 0
Hierarchy[Agent x Agent Agent x Agent] 0 04827 01679 00696 00796 01496 01339 00727 00775 01273 01492 01711 0075 00717 00786 00882 05587
Exclusivity[Agent x Agent Agent x Agent] Average 00001 00003 00003 00003 00004 00004 00005 00005 00005 00006 00006 00006 00006 00006 00006 00004 00002
Network Centralization-Closeness[Agent x Agent Agent x Agent] 00001 00005 00012 00016 00018 00021 00027 0003 00028 00028 00031 0003 00032 00034 00024 00015 00004
Similarity Correlation[Agent x Agent Agent x Agent] Average 00001 00023 00036 00043 0005 0006 0007 00077 0007 00069 0008 00084 0008 00082 00053 0003 00008
Simmelian Ties[Agent x Agent Agent x Agent] Average 00002 00008 00021 00029 0004 0005 00059 00068 00061 00058 00065 00076 00076 0008 00045 00029 00004
Overall Complexity 00003 00015 0003 00043 00055 00067 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Expertise Correlation[Agent x Agent Agent x Agent] Average 00003 00015 0003 00042 00053 00065 00079 00088 00081 00079 00091 00097 00097 00101 0006 00036 00007
Density[Agent x Agent Agent x Agent] 00003 00015 0003 00043 00055 00067 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Centrality-Inverse Closeness[Agent x Agent Agent x Agent] Average 00003 00057 00191 00306 00387 0053 00733 00943 0084 00877 01111 01117 01125 01206 00587 00257 00018
Efficiency-Global[Agent x Agent Agent x Agent] 00003 00087 00223 00338 00421 00591 00815 0101 00904 00972 01245 01244 01203 01286 00632 00277 00026
Connectedness[Agent x Agent Agent x Agent] 00003 0021 00589 00932 0119 01953 02921 03446 0301 03223 04222 04187 04049 04402 02202 00899 00059
Distinctiveness Correlation[Agent x Agent Agent x Agent] Average 00006 00029 00059 00083 00106 00129 00156 00175 00162 00157 00181 00193 00193 00199 0012 00073 00014
Centrality-Closeness[Agent x Agent Agent x Agent] Average 00027 00028 00029 00031 00032 00035 00042 00048 00044 00044 00053 00052 00054 00058 00038 00031 00027
Boundary Spanner[Agent x Agent Agent x Agent] Average 00027 00191 00328 00328 00546 00574 00683 00601 00656 0071 00683 0071 00847 00656 00738 00546 00246
Network Centralization-Row Degree[Agent x Agent Agent x Agent] 00134 00314 006 00752 00794 0081 00851 00841 00848 00878 00893 00969 00996 00993 00842 00648 0024
Network Centralization-Column Degree[Agent x Agent Agent x Agent] 00134 00588 00819 00834 00849 00892 00878 00896 00876 00905 00948 00996 01024 00993 00842 0062 00322
Network Centralization-Out Degree[Agent x Agent Agent x Agent] 00135 00315 00601 00754 00797 00812 00853 00843 00851 0088 00895 00972 00999 00996 00845 0065 0024
Network Centralization-Total Degree[Agent x Agent Agent x Agent] 00135 00453 00672 00769 00813 00828 00869 0086 00867 00897 00925 00988 01016 00999 00847 00638 00282
Network Centralization-In Degree[Agent x Agent Agent x Agent] 00135 00589 00821 00836 00852 00895 00881 00898 00878 00908 0095 00999 01027 00996 00845 00622 00323
Clustering Coefficient-Watts-Strogatz[Agent x Agent Agent x Agent] Average 0023 00786 0126 01471 01728 01993 02334 02782 02382 02256 02604 02856 02775 02958 02157 01468 00355
Efficiency-Local[Agent x Agent Agent x Agent] 00238 01032 01587 01916 02253 02598 03034 0362 03167 03054 03644 03843 03678 03861 02663 01705 0041
Interlockers[Agent x Agent Agent x Agent] Average 00246 00546 00656 00628 00738 00902 00902 00902 00847 0071 00738 00874 00874 0082 00792 00683 00546
Breadth-Column[Agent x Agent Agent x Agent] 00273 01011 01803 02596 03224 03716 04399 05082 04672 04672 05355 05328 05519 05738 03825 02404 00601
Breadth-Row[Agent x Agent Agent x Agent] 00273 01339 02158 02923 03388 03934 04754 05301 04863 04973 05792 05765 05765 05984 04153 02514 00601
Clique Count[Agent x Agent Agent x Agent] Average 00301 02896 06421 10929 14699 17213 19781 22486 20601 20492 24071 2571 24918 24727 13689 06913 01148
Constraint-Burt[Agent x Agent Agent x Agent] Average 0038 01331 01404 01481 01869 02077 02643 02719 02506 02623 02827 0295 03034 02956 02548 01967 00948
Interdependence[Agent x Agent Agent x Agent] 00485 00073 00042 0003 00024 0002 00017 00015 00016 00016 00014 00013 00014 00013 00022 00037 00176
Effective Network Size-Burt[Agent x Agent Agent x Agent] Average 0053 03636 06977 10333 13118 15388 18575 21156 19688 19928 22983 2382 23991 2429 14392 08408 01957
Redundancy-Row[Agent x Agent Agent x Agent] 00601 03525 08443 12459 16093 19699 23716 26913 24754 23798 27541 29781 29617 30683 1765 10219 01311
Redundancy-Column[Agent x Agent Agent x Agent] 00601 03743 08634 12568 1623 2 23989 27077 24945 24016 27814 30055 29891 30902 17842 10383 01448
Node Levels[Agent x Agent Agent x Agent] Average 00601 09836 14344 23087 24126 36339 46913 42541 3918 43197 51913 44727 46421 48169 47568 23033 03169
Centrality-Bonacich Power[Agent x Agent Agent x Agent] Average 01038 05492 11066 15683 20027 2429 29372 32978 30383 29563 34071 36339 36284 37432 22541 13552 02514
Triad Count[Agent x Agent Agent x Agent] Average 01967 08279 27104 42432 5918 78251 96503 101475 94454 84508 94645 111421 110164 112732 67131 35574 01995
Efficiency[Agent x Agent Agent x Agent] 03333 09552 0964 09658 09658 09746 09798 09806 09795 09818 09835 09822 0982 09828 09809 09756 09607
Speed-Minimum[Agent x Agent Agent x Agent] 05 0125 0125 01 01111 00909 00909 01 01 01 00909 01111 01 01 00769 01111 01667
Edge Count-Skip[Agent x Agent Agent x Agent] 07895 07761 08988 09408 09304 0946 0933 09519 09487 09409 09455 09534 09465 09584 09188 0881 06196
Transitivity[Agent x Agent Agent x Agent] 08372 03475 03372 03238 03336 03626 0375 03495 03503 03253 0319 03266 03302 0336 03822 0405 0258
Edge Count-Pooled[Agent x Agent Agent x Agent] 08421 08657 09432 09669 09714 09764 09665 09751 09748 09704 09735 09737 09759 09788 09612 09435 08152
Speed-Average[Agent x Agent Agent x Agent] 08519 02994 03068 02802 02799 0233 02156 02387 0245 02484 02448 02516 02464 02429 02259 02373 04139
Edge Count-Lateral[Agent x Agent Agent x Agent] 09474 03731 03926 0392 04216 04061 03944 03828 03768 03651 03633 03872 03795 03985 03842 03891 05761
Resemblance Correlation[Agent x Agent Agent x Agent] Average 09994 09971 09941 09917 09894 09871 09844 09825 09838 09843 09819 09807 09807 09801 0988 09927 09986
Fragmentation[Agent x Agent Agent x Agent] 09997 0979 09411 09068 0881 08047 07079 06554 0699 06777 05778 05813 05951 05598 07798 09101 09941
Communicative Need[Agent x Agent Agent x Agent] 1 0608 07457 07662 0801 08069 08098 08233 0817 08124 0784 08169 08368 08365 08212 09004 08039
Upper Boundedness[Agent x Agent Agent x Agent] 1 09021 09909 0999 09987 0994 09966 09993 09989 09965 0995 09928 09987 09988 09986 09983 09073
Average Distance[Agent x Agent Agent x Agent] 11739 33399 326 35685 35731 42917 46372 41893 40814 40263 40845 39749 40577 41172 44266 42139 24163
Network Levels[Agent x Agent Agent x Agent] 2 8 8 10 9 11 11 10 10 10 11 9 10 10 13 9 6
Span Of Control[Agent x Agent Agent x Agent] 2375 27917 42188 48644 50903 52917 51932 54369 53981 5128 52176 55417 54426 55466 46089 40656 20909
Number of AGENT 32 94 112 137 162 183 221 228 213 219 254 250 256 260 190 141 62
Edge Count[Agent x Agent Agent x Agent] 38 201 405 574 733 889 1075 1207 1112 1082 1247 1330 1328 1370 825 496 92
Component Members-Weak[Agent x Agent Agent x Agent] Average 1728798 1248005 101776 862131 710437 559781 367623 299836 366667 345 232377 232213 225628 211885 50694 855628 1487596
Isolate Count[Agent x Agent Agent x Agent] 350 293 269 247 222 198 157 142 159 154 126 125 122 119 187 243 322
Component Count-Weak[Agent x Agent Agent x Agent] 355 301 272 250 226 201 162 146 162 157 128 128 126 122 191 249 329
Component Count-Strong[Agent x Agent Agent x Agent] 355 315 281 255 232 214 177 156 172 169 145 144 137 131 198 256 336
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Agents 221 228 213 219 254 250 256 260 190 141 62
Z -03746343246 00624390541 -08741467575 -04995124328 16858544608 14360982444 18107325691 20604887855 -23102450019 -53697586531 -103024439274
C+ 0 0 0 0 11858544608 21219527053 34326852743 49931740598 21829290579 0 0
C- 0 0 03741467575 03736591903 0 0 0 0 18102450019 6680003655 164824475824
Signal 0 0 0 0 0 0 0 1 0 1 1
Average 227
Std Dev 16015617378
k 05
Control Limit 4
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Component Members-Weak Average 367623 299836 366667 345 232377 232213 225628 211885 50694 855628 1487596
Z 09566024078 -01181147213 09414456731 05979300132 -11876316317 -1190231741 -12946324712 -15125184597 31653794044 86935922814 187130183455
C+ 04566024078 0 04414456731 05393756863 0 0 0 0 26653794044 108589716857 290719900313
C- 0 0 0 0 06876316317 13778633728 21724958439 31850143037 0 0 0
Signal 0 0 0 0 0 0 0 0 0 1 1
Average 307286
Std Dev 63074271515
k 05
Control Limit 4
1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Edge Count 38 201 405 574 733 889 1075 1207 1112 1082 1247 1330 1328 1370 825 496 92
Slope of Edge Count 163 204 169 159 156 186 132 -95 -30 165 83 -2 42 -545 -329 -404
Z -05039753822 15973457028 -01964649795 -07089823173 -08627375186 06748144948 -20927791293 -137269226972 -6393653789 59836399187 20500693512 -16784942813 -06150208054 -316223197419 -114547624997 -100197139539
C+ 0 10973457028 04008807233 0 0 01748144948 0 0 0 54836399187 70337092699 48552149886 37401941833 0 0 0
C- 00039753822 0 0 02089823173 05717198359 0 15927791293 148197018266 207133556155 142297156968 116796463456 128581406269 129731614323 440954811741 550502436738 645699576277
Signal 0 0 0 0 0 0 0 0 1 1 1 1 1 1 1 1 1
Average 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 9475 4825 43 3075 54 72 -1055 -2085 -309
Std Dev 195115350498 195115350498 195115350498 195115350498 195115350498 195115350498 195115350498 195115350498 1284143683549 1324547092406 1254033492376 115840047767 882307580533 710539700979 29504858357 278774102097 2505474007049
k 05
Control Limit 4
1988 1988 1988 1988 1988 1988
1989 1989 1989 1989 1989 1989
1990 1990 1990 1990 1990 1990
1991 1991 1991 1991 1991 1991
1992 1992 1992 1992 1992 1992
1993 1993 1993 1993 1993 1993
1994 1994 1994 1994 1994 1994
1995 1995 1995 1995 1995 1995
1996 1996 1996 1996 1996 1996
1997 1997 1997 1997 1997 1997
1998 1998 1998 1998 1998 1998
1999 1999 1999 1999 1999 1999
2000 2000 2000 2000 2000 2000
2001 2001 2001 2001 2001 2001
2002 2002 2002 2002 2002 2002
2003 2003 2003 2003 2003 2003
2004 2004 2004 2004 2004 2004
httpwwwmarocnlnieuwsforumsshowphppg=timeline
Significant Events Founded in Afghanistan Moves to Sudan WTC 1 and Somalia Leaves Sudan Returns to Afghanistan Saudia Arabia bomb Issues fatwa US embassy bombings USS Cole WTC 2 Afghanistan 3 Man is Captured Dirty bomb foiled 911 plotter captured Sheikh Mohammed captured Saudi bombings Morroco attacks Sectarian violence plot Madrid Europe truce offered
1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Betweeness 0 00001 00003 00006 00008 00016 00027 00029 00024 00025 00033 00031 00033 00036 0002 00008 0
Density 00003 00015 0003 00043 00055 00067 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Closeness 00001 00005 00012 00016 00018 00021 00027 0003 00028 00028 00031 0003 00032 00034 00024 00015 00004
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Closeness 00027 0003 00028 00028 00031 0003 00032 00034 00024 00015 00004
Z -08728715609 10910894512 -02182178902 -02182178902 17457431219 10910894512 24003967926 3709704134 -28368325731 -87287156094 -159299059872
C+ 0 05910894512 0 0 12457431219 18368325731 37372293657 69469334997 36101009266 0 0
C- 03728715609 0 0 0 0 0 0 0 23368325731 105655481825 259954541697
Signal 0 0 0 0 0 0 0 1 0 1 1
Average 00028333333
Std Dev 00001527525
k 05
Control Limit 4
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Density 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Z -08444407432 11042686642 -0259827921 -06495698025 16888814864 30529780716 28581071308 36375908938 -43521176765 -9223891195 -150700194171
C+ 0 06042686642 0 0 11888814864 3741859558 60999666888 92375575826 43854399061 0 0
C- 03444407432 0 0 01495698025 0 0 0 0 38521176765 125760088714 271460282886
Signal 0 0 0 0 0 0 1 1 1 1 1
Average 00084333333
Std Dev 00005131601
k 05
Control Limit 4
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Betweeness 00027 00029 00024 00025 00033 00031 00033 00036 0002 00008 0
Z 01324532357 09271726499 -10596258857 -06622661785 25166114784 17218920642 25166114784 37086905998 -26490647141 -74173811996 -105962588565
C+ 0 04271726499 0 0 20166114784 32385035426 5255115021 84638056208 53147409067 0 0
C- 0 0 05596258857 07218920642 0 0 0 0 21490647141 90664459137 191627047702
Signal 0 0 0 0 0 0 1 1 1 1 1
Average 00026666667
Std Dev 00002516611
k 05
Control Limit 4
1988 1988 1988
1989 1989 1989
1990 1990 1990
1991 1991 1991
1992 1992 1992
1993 1993 1993
1994 1994 1994
1995 1995 1995
1996 1996 1996
1997 1997 1997
1998 1998 1998
1999 1999 1999
2000 2000 2000
2001 2001 2001
2002 2002 2002
2003 2003 2003
2004 2004 2004
Page 12: Data Science Center Eindhoven - 4TU.Federation · 4TU AMI SRO Big Data Meeting Big Data: Mathematics in Action! November 24, 2017 Data Science Center Eindhoven The Mathematics Behind

Equivalent Random Surfer Model

bull Start at any point on the graphbull Assign equal probabilities to all outgoing linksbull Choose new node with probabilities given by the

linksbull Relevance is proportion of visits after surfing for a

very long time (ldquostationary distributionrdquo)

Google Matrix

bull Choose a ldquodamping factorrdquo p (0ltplt1) bull Googlersquos p is secret but around 015

Interpretation for random surferbull choose next link according to A with probability 1- pbull ldquoteleportrdquo to random page with probability p

119872119872 = 1 minus 119901119901 119860119860 + 119901119901119901119901

119901119901 =1119899119899

1 ⋯ 1⋮ ⋱ ⋮1 ⋯ 1

Choice of Matrix B

Google choose other B to avoid unwanted boosting of pages

Practical Issues

bull The real Google matrix has size in the order of 30 billion columns x 30 billion rows

bull Every day around 250000 new pages are added bull Matrix has many zeros (100 links to a page)

which speeds up calculationsbull Doing the matrix calculation

requires fast computers butalso advanced math

Topic Streaming algorithms

High volume high velocity data makes exact counting of frequencies or number of different items practically infeasible but approximate answers suffice in several applications (web site counters customer counts in retail transactionshellip)

New algorithms using advanced probabilistic methods (random hash functions)bull Count-Min Sketch bull MinHashbull HyperLogLog

Topic Uncertainty Quantification

Virtual prototyping using mathematical models UQ does not deal with1 unknown uncertainty in the initial

conditions of parameters2 parametrisation of design building blocks dimension

reduction

For 1) polynomial chaos (Wiener chaos) inverse statistical models Bayesian analysis (calibration)

For 2) Model Order Reduction

Topic Network monitoring

0

0002

0004

0006

0008

001

0012

1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004

Measu

re V

alu

e

Time

Al-Qaeda Network Measures

Betweeness

Density

Closeness

Challengesbull monitor high number of variablesbull models to capture structural changesbull scalable algorithms for likelihood ratio calculations

Closeness CUSUM Statistic for Al-Qaeda (1994-2004)

0

1

2

3

4

5

6

7

8

1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004

C+

Change Point

Signal

Chart3

Betweeness
Density
Closeness
Time
Measure Value
Al-Qaeda Network Measures
0
00003
00001
00001
00015
00005
00003
0003
00012
00006
00043
00016
00008
00055
00018
00016
00067
00021
00027
0008
00027
00029
0009
0003
00024
00083
00028
00025
00081
00028
00033
00093
00031
00031
001
0003
00033
00099
00032
00036
00103
00034
0002
00062
00024
00008
00037
00015
0
00007
00004

Std Measures Control Charts

Std Measures Control Charts

Betweeness
Density
Closeness
Time
Value
Standard Measures

Chart1

Number of AGENT
Edge Count[Agent x Agent Agent x Agent]
Component Members-Weak[Agent x Agent Agent x Agent] Average
Isolate Count[Agent x Agent Agent x Agent]
Component Count-Weak[Agent x Agent Agent x Agent]
Component Count-Strong[Agent x Agent Agent x Agent]
32
38
1728798
350
355
355
94
201
1248005
293
301
315
112
405
101776
269
272
281
137
574
862131
247
250
255
162
733
710437
222
226
232
183
889
559781
198
201
214
221
1075
367623
157
162
177
228
1207
299836
142
146
156
213
1112
366667
159
162
172
219
1082
345
154
157
169
254
1247
232377
126
128
145
250
1330
232213
125
128
144
256
1328
225628
122
126
137
260
1370
211885
119
122
131
190
825
50694
187
191
198
141
496
855628
243
249
256
62
92
1487596
322
329
336

Control Charts

raw data

Chart4

C+
Closeness CUSUM Statistic for Al-Qaeda (1994-2004)
0
05910894512
0
0
12457431219
18368325731
37372293657
69469334997
36101009266
0
0

Std Measures Control Charts

Std Measures Control Charts

Betweeness
Density
Closeness
Time
Value
Standard Measures

Chart1

C+
Closeness CUSUM Chart for Al-Qaeda (1994-2004)

Control Charts

Number of AGENT
Edge Count[Agent x Agent Agent x Agent]
Component Members-Weak[Agent x Agent Agent x Agent] Average
Isolate Count[Agent x Agent Agent x Agent]
Component Count-Weak[Agent x Agent Agent x Agent]
Component Count-Strong[Agent x Agent Agent x Agent]
32
38
1728798
350
355
355
94
201
1248005
293
301
315
112
405
101776
269
272
281
137
574
862131
247
250
255
162
733
710437
222
226
232
183
889
559781
198
201
214
221
1075
367623
157
162
177
228
1207
299836
142
146
156
213
1112
366667
159
162
172
219
1082
345
154
157
169
254
1247
232377
126
128
145
250
1330
232213
125
128
144
256
1328
225628
122
126
137
260
1370
211885
119
122
131
190
825
50694
187
191
198
141
496
855628
243
249
256
62
92
1487596
322
329
336

raw data

Topic Precision farming

Topic Precision farming

Topic Compressed Sensing

Topic Network structure

bull dependencies between nodes in networks measured by degrees of direct neighbours

bull assortativity coefficient of Newman is nothing but Pearsonrsquos correlation statistic

bull inconsistent estimator when variances are infinitebull Spearman rank correlation behaves better but

calculation is computationally intensive bull requires heavy asymptotics

Van der Hofstad R and Litvak N (2014) Degree-Degree Dependencies in Random Graphs with Heavy-Tailed Degrees Internet Mathematics 10 (3-4) pp 287-334

Topic Topological Data Analysis

Common Big Data problem is to choose relevant ldquofeaturesrdquo from high-dimensional data

Combination of machine learning with topological tools (simplices cohomology) yields new algorithmsfor finding patterns and clustering

Presenter
Presentation Notes
httpsyoutubeXfWibrh6stw Gunnar Carsson

Mathematical contributions in general

bull Modellingbull Performance of algorithmsbull Statistical thinking

We need to work hard to makemathematical contributions more explicit

Conclusions

1 Mathematical methods are an important enabler in big data settings

2 Developments in big data settings not only require more computer speed and memory capacity but also new advanced mathematics

  • The Mathematics Behind Big Data
  • Outline
  • What is big data
  • Four Vrsquos of Big Data
  • High-dimensional data ldquo119951≪119953rdquo
  • Topic Google Search
  • Topic Google Search
  • Topic Google Search
  • Topic Google Search
  • Topic Google Search
  • Topic Google Search
  • Equivalent Random Surfer Model
  • Google Matrix
  • Choice of Matrix B
  • Practical Issues
  • Topic Streaming algorithms
  • Topic Uncertainty Quantification
  • Topic Network monitoring
  • Topic Precision farming
  • Topic Precision farming
  • Topic Compressed Sensing
  • Topic Network structure
  • Topic Topological Data Analysis
  • Mathematical contributions in general
  • Conclusions
1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Centrality-Betweenness[Agent x Agent Agent x Agent] Average 0 00001 00003 00006 00008 00016 00027 00029 00024 00025 00033 00031 00033 00036 0002 00008 0
Meta-Matrix Hamming Distance 0 00018 00036 0006 00065 00087 00124 0012 00146 00136 00135 00166 00164 00156 0015 0009 0004
Network Centralization-Betweenness[Agent x Agent Agent x Agent] 0 00091 00292 00492 00608 01105 01557 01586 01242 01187 01295 01338 01316 01414 0104 0042 00014
Edge Count-Sequential[Agent x Agent Agent x Agent] 0 001 00025 00017 0 0 00019 00008 00009 00009 00016 00015 0 00007 0 0002 0
Hierarchy[Agent x Agent Agent x Agent] 0 04827 01679 00696 00796 01496 01339 00727 00775 01273 01492 01711 0075 00717 00786 00882 05587
Exclusivity[Agent x Agent Agent x Agent] Average 00001 00003 00003 00003 00004 00004 00005 00005 00005 00006 00006 00006 00006 00006 00006 00004 00002
Network Centralization-Closeness[Agent x Agent Agent x Agent] 00001 00005 00012 00016 00018 00021 00027 0003 00028 00028 00031 0003 00032 00034 00024 00015 00004
Similarity Correlation[Agent x Agent Agent x Agent] Average 00001 00023 00036 00043 0005 0006 0007 00077 0007 00069 0008 00084 0008 00082 00053 0003 00008
Simmelian Ties[Agent x Agent Agent x Agent] Average 00002 00008 00021 00029 0004 0005 00059 00068 00061 00058 00065 00076 00076 0008 00045 00029 00004
Overall Complexity 00003 00015 0003 00043 00055 00067 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Expertise Correlation[Agent x Agent Agent x Agent] Average 00003 00015 0003 00042 00053 00065 00079 00088 00081 00079 00091 00097 00097 00101 0006 00036 00007
Density[Agent x Agent Agent x Agent] 00003 00015 0003 00043 00055 00067 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Centrality-Inverse Closeness[Agent x Agent Agent x Agent] Average 00003 00057 00191 00306 00387 0053 00733 00943 0084 00877 01111 01117 01125 01206 00587 00257 00018
Efficiency-Global[Agent x Agent Agent x Agent] 00003 00087 00223 00338 00421 00591 00815 0101 00904 00972 01245 01244 01203 01286 00632 00277 00026
Connectedness[Agent x Agent Agent x Agent] 00003 0021 00589 00932 0119 01953 02921 03446 0301 03223 04222 04187 04049 04402 02202 00899 00059
Distinctiveness Correlation[Agent x Agent Agent x Agent] Average 00006 00029 00059 00083 00106 00129 00156 00175 00162 00157 00181 00193 00193 00199 0012 00073 00014
Centrality-Closeness[Agent x Agent Agent x Agent] Average 00027 00028 00029 00031 00032 00035 00042 00048 00044 00044 00053 00052 00054 00058 00038 00031 00027
Boundary Spanner[Agent x Agent Agent x Agent] Average 00027 00191 00328 00328 00546 00574 00683 00601 00656 0071 00683 0071 00847 00656 00738 00546 00246
Network Centralization-Row Degree[Agent x Agent Agent x Agent] 00134 00314 006 00752 00794 0081 00851 00841 00848 00878 00893 00969 00996 00993 00842 00648 0024
Network Centralization-Column Degree[Agent x Agent Agent x Agent] 00134 00588 00819 00834 00849 00892 00878 00896 00876 00905 00948 00996 01024 00993 00842 0062 00322
Network Centralization-Out Degree[Agent x Agent Agent x Agent] 00135 00315 00601 00754 00797 00812 00853 00843 00851 0088 00895 00972 00999 00996 00845 0065 0024
Network Centralization-Total Degree[Agent x Agent Agent x Agent] 00135 00453 00672 00769 00813 00828 00869 0086 00867 00897 00925 00988 01016 00999 00847 00638 00282
Network Centralization-In Degree[Agent x Agent Agent x Agent] 00135 00589 00821 00836 00852 00895 00881 00898 00878 00908 0095 00999 01027 00996 00845 00622 00323
Clustering Coefficient-Watts-Strogatz[Agent x Agent Agent x Agent] Average 0023 00786 0126 01471 01728 01993 02334 02782 02382 02256 02604 02856 02775 02958 02157 01468 00355
Efficiency-Local[Agent x Agent Agent x Agent] 00238 01032 01587 01916 02253 02598 03034 0362 03167 03054 03644 03843 03678 03861 02663 01705 0041
Interlockers[Agent x Agent Agent x Agent] Average 00246 00546 00656 00628 00738 00902 00902 00902 00847 0071 00738 00874 00874 0082 00792 00683 00546
Breadth-Column[Agent x Agent Agent x Agent] 00273 01011 01803 02596 03224 03716 04399 05082 04672 04672 05355 05328 05519 05738 03825 02404 00601
Breadth-Row[Agent x Agent Agent x Agent] 00273 01339 02158 02923 03388 03934 04754 05301 04863 04973 05792 05765 05765 05984 04153 02514 00601
Clique Count[Agent x Agent Agent x Agent] Average 00301 02896 06421 10929 14699 17213 19781 22486 20601 20492 24071 2571 24918 24727 13689 06913 01148
Constraint-Burt[Agent x Agent Agent x Agent] Average 0038 01331 01404 01481 01869 02077 02643 02719 02506 02623 02827 0295 03034 02956 02548 01967 00948
Interdependence[Agent x Agent Agent x Agent] 00485 00073 00042 0003 00024 0002 00017 00015 00016 00016 00014 00013 00014 00013 00022 00037 00176
Effective Network Size-Burt[Agent x Agent Agent x Agent] Average 0053 03636 06977 10333 13118 15388 18575 21156 19688 19928 22983 2382 23991 2429 14392 08408 01957
Redundancy-Row[Agent x Agent Agent x Agent] 00601 03525 08443 12459 16093 19699 23716 26913 24754 23798 27541 29781 29617 30683 1765 10219 01311
Redundancy-Column[Agent x Agent Agent x Agent] 00601 03743 08634 12568 1623 2 23989 27077 24945 24016 27814 30055 29891 30902 17842 10383 01448
Node Levels[Agent x Agent Agent x Agent] Average 00601 09836 14344 23087 24126 36339 46913 42541 3918 43197 51913 44727 46421 48169 47568 23033 03169
Centrality-Bonacich Power[Agent x Agent Agent x Agent] Average 01038 05492 11066 15683 20027 2429 29372 32978 30383 29563 34071 36339 36284 37432 22541 13552 02514
Triad Count[Agent x Agent Agent x Agent] Average 01967 08279 27104 42432 5918 78251 96503 101475 94454 84508 94645 111421 110164 112732 67131 35574 01995
Efficiency[Agent x Agent Agent x Agent] 03333 09552 0964 09658 09658 09746 09798 09806 09795 09818 09835 09822 0982 09828 09809 09756 09607
Speed-Minimum[Agent x Agent Agent x Agent] 05 0125 0125 01 01111 00909 00909 01 01 01 00909 01111 01 01 00769 01111 01667
Edge Count-Skip[Agent x Agent Agent x Agent] 07895 07761 08988 09408 09304 0946 0933 09519 09487 09409 09455 09534 09465 09584 09188 0881 06196
Transitivity[Agent x Agent Agent x Agent] 08372 03475 03372 03238 03336 03626 0375 03495 03503 03253 0319 03266 03302 0336 03822 0405 0258
Edge Count-Pooled[Agent x Agent Agent x Agent] 08421 08657 09432 09669 09714 09764 09665 09751 09748 09704 09735 09737 09759 09788 09612 09435 08152
Speed-Average[Agent x Agent Agent x Agent] 08519 02994 03068 02802 02799 0233 02156 02387 0245 02484 02448 02516 02464 02429 02259 02373 04139
Edge Count-Lateral[Agent x Agent Agent x Agent] 09474 03731 03926 0392 04216 04061 03944 03828 03768 03651 03633 03872 03795 03985 03842 03891 05761
Resemblance Correlation[Agent x Agent Agent x Agent] Average 09994 09971 09941 09917 09894 09871 09844 09825 09838 09843 09819 09807 09807 09801 0988 09927 09986
Fragmentation[Agent x Agent Agent x Agent] 09997 0979 09411 09068 0881 08047 07079 06554 0699 06777 05778 05813 05951 05598 07798 09101 09941
Communicative Need[Agent x Agent Agent x Agent] 1 0608 07457 07662 0801 08069 08098 08233 0817 08124 0784 08169 08368 08365 08212 09004 08039
Upper Boundedness[Agent x Agent Agent x Agent] 1 09021 09909 0999 09987 0994 09966 09993 09989 09965 0995 09928 09987 09988 09986 09983 09073
Average Distance[Agent x Agent Agent x Agent] 11739 33399 326 35685 35731 42917 46372 41893 40814 40263 40845 39749 40577 41172 44266 42139 24163
Network Levels[Agent x Agent Agent x Agent] 2 8 8 10 9 11 11 10 10 10 11 9 10 10 13 9 6
Span Of Control[Agent x Agent Agent x Agent] 2375 27917 42188 48644 50903 52917 51932 54369 53981 5128 52176 55417 54426 55466 46089 40656 20909
Number of AGENT 32 94 112 137 162 183 221 228 213 219 254 250 256 260 190 141 62
Edge Count[Agent x Agent Agent x Agent] 38 201 405 574 733 889 1075 1207 1112 1082 1247 1330 1328 1370 825 496 92
Component Members-Weak[Agent x Agent Agent x Agent] Average 1728798 1248005 101776 862131 710437 559781 367623 299836 366667 345 232377 232213 225628 211885 50694 855628 1487596
Isolate Count[Agent x Agent Agent x Agent] 350 293 269 247 222 198 157 142 159 154 126 125 122 119 187 243 322
Component Count-Weak[Agent x Agent Agent x Agent] 355 301 272 250 226 201 162 146 162 157 128 128 126 122 191 249 329
Component Count-Strong[Agent x Agent Agent x Agent] 355 315 281 255 232 214 177 156 172 169 145 144 137 131 198 256 336
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Agents 221 228 213 219 254 250 256 260 190 141 62
Z -03746343246 00624390541 -08741467575 -04995124328 16858544608 14360982444 18107325691 20604887855 -23102450019 -53697586531 -103024439274
C+ 0 0 0 0 11858544608 21219527053 34326852743 49931740598 21829290579 0 0
C- 0 0 03741467575 03736591903 0 0 0 0 18102450019 6680003655 164824475824
Signal 0 0 0 0 0 0 0 1 0 1 1
Average 227
Std Dev 16015617378
k 05
Control Limit 4
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Component Members-Weak Average 367623 299836 366667 345 232377 232213 225628 211885 50694 855628 1487596
Z 09566024078 -01181147213 09414456731 05979300132 -11876316317 -1190231741 -12946324712 -15125184597 31653794044 86935922814 187130183455
C+ 04566024078 0 04414456731 05393756863 0 0 0 0 26653794044 108589716857 290719900313
C- 0 0 0 0 06876316317 13778633728 21724958439 31850143037 0 0 0
Signal 0 0 0 0 0 0 0 0 0 1 1
Average 307286
Std Dev 63074271515
k 05
Control Limit 4
1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Edge Count 38 201 405 574 733 889 1075 1207 1112 1082 1247 1330 1328 1370 825 496 92
Slope of Edge Count 163 204 169 159 156 186 132 -95 -30 165 83 -2 42 -545 -329 -404
Z -05039753822 15973457028 -01964649795 -07089823173 -08627375186 06748144948 -20927791293 -137269226972 -6393653789 59836399187 20500693512 -16784942813 -06150208054 -316223197419 -114547624997 -100197139539
C+ 0 10973457028 04008807233 0 0 01748144948 0 0 0 54836399187 70337092699 48552149886 37401941833 0 0 0
C- 00039753822 0 0 02089823173 05717198359 0 15927791293 148197018266 207133556155 142297156968 116796463456 128581406269 129731614323 440954811741 550502436738 645699576277
Signal 0 0 0 0 0 0 0 0 1 1 1 1 1 1 1 1 1
Average 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 9475 4825 43 3075 54 72 -1055 -2085 -309
Std Dev 195115350498 195115350498 195115350498 195115350498 195115350498 195115350498 195115350498 195115350498 1284143683549 1324547092406 1254033492376 115840047767 882307580533 710539700979 29504858357 278774102097 2505474007049
k 05
Control Limit 4
1988 1988 1988 1988 1988 1988
1989 1989 1989 1989 1989 1989
1990 1990 1990 1990 1990 1990
1991 1991 1991 1991 1991 1991
1992 1992 1992 1992 1992 1992
1993 1993 1993 1993 1993 1993
1994 1994 1994 1994 1994 1994
1995 1995 1995 1995 1995 1995
1996 1996 1996 1996 1996 1996
1997 1997 1997 1997 1997 1997
1998 1998 1998 1998 1998 1998
1999 1999 1999 1999 1999 1999
2000 2000 2000 2000 2000 2000
2001 2001 2001 2001 2001 2001
2002 2002 2002 2002 2002 2002
2003 2003 2003 2003 2003 2003
2004 2004 2004 2004 2004 2004
httpwwwmarocnlnieuwsforumsshowphppg=timeline
Significant Events Founded in Afghanistan Moves to Sudan WTC 1 and Somalia Leaves Sudan Returns to Afghanistan Saudia Arabia bomb Issues fatwa US embassy bombings USS Cole WTC 2 Afghanistan 3 Man is Captured Dirty bomb foiled 911 plotter captured Sheikh Mohammed captured Saudi bombings Morroco attacks Sectarian violence plot Madrid Europe truce offered
1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Betweeness 0 00001 00003 00006 00008 00016 00027 00029 00024 00025 00033 00031 00033 00036 0002 00008 0
Density 00003 00015 0003 00043 00055 00067 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Closeness 00001 00005 00012 00016 00018 00021 00027 0003 00028 00028 00031 0003 00032 00034 00024 00015 00004
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Closeness 00027 0003 00028 00028 00031 0003 00032 00034 00024 00015 00004
Z -08728715609 10910894512 -02182178902 -02182178902 17457431219 10910894512 24003967926 3709704134 -28368325731 -87287156094 -159299059872
C+ 0 05910894512 0 0 12457431219 18368325731 37372293657 69469334997 36101009266 0 0
C- 03728715609 0 0 0 0 0 0 0 23368325731 105655481825 259954541697
03728715609 05910894512 0 0 12457431219 18368325731 37372293657 69469334997 36101009266 105655481825 259954541697
Signal 0 0 0 0 0 0 0 1 0 1 1
Average 00028333333
Std Dev 00001527525
k 05
Control Limit 4
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Density 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Z -08444407432 11042686642 -0259827921 -06495698025 16888814864 30529780716 28581071308 36375908938 -43521176765 -9223891195 -150700194171
C+ 0 06042686642 0 0 11888814864 3741859558 60999666888 92375575826 43854399061 0 0
C- 03444407432 0 0 01495698025 0 0 0 0 38521176765 125760088714 271460282886
Signal 0 0 0 0 0 0 1 1 1 1 1
Average 00084333333
Std Dev 00005131601
k 05
Control Limit 4
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Betweeness 00027 00029 00024 00025 00033 00031 00033 00036 0002 00008 0
Z 01324532357 09271726499 -10596258857 -06622661785 25166114784 17218920642 25166114784 37086905998 -26490647141 -74173811996 -105962588565
C+ 0 04271726499 0 0 20166114784 32385035426 5255115021 84638056208 53147409067 0 0
C- 0 0 05596258857 07218920642 0 0 0 0 21490647141 90664459137 191627047702
Signal 0 0 0 0 0 0 1 1 1 1 1
Average 00026666667
Std Dev 00002516611
k 05
Control Limit 4
1994
1995
1996
1997
1998
1999
2000
2001
2002
2003
2004
1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Centrality-Betweenness[Agent x Agent Agent x Agent] Average 0 00001 00003 00006 00008 00016 00027 00029 00024 00025 00033 00031 00033 00036 0002 00008 0
Meta-Matrix Hamming Distance 0 00018 00036 0006 00065 00087 00124 0012 00146 00136 00135 00166 00164 00156 0015 0009 0004
Network Centralization-Betweenness[Agent x Agent Agent x Agent] 0 00091 00292 00492 00608 01105 01557 01586 01242 01187 01295 01338 01316 01414 0104 0042 00014
Edge Count-Sequential[Agent x Agent Agent x Agent] 0 001 00025 00017 0 0 00019 00008 00009 00009 00016 00015 0 00007 0 0002 0
Hierarchy[Agent x Agent Agent x Agent] 0 04827 01679 00696 00796 01496 01339 00727 00775 01273 01492 01711 0075 00717 00786 00882 05587
Exclusivity[Agent x Agent Agent x Agent] Average 00001 00003 00003 00003 00004 00004 00005 00005 00005 00006 00006 00006 00006 00006 00006 00004 00002
Network Centralization-Closeness[Agent x Agent Agent x Agent] 00001 00005 00012 00016 00018 00021 00027 0003 00028 00028 00031 0003 00032 00034 00024 00015 00004
Similarity Correlation[Agent x Agent Agent x Agent] Average 00001 00023 00036 00043 0005 0006 0007 00077 0007 00069 0008 00084 0008 00082 00053 0003 00008
Simmelian Ties[Agent x Agent Agent x Agent] Average 00002 00008 00021 00029 0004 0005 00059 00068 00061 00058 00065 00076 00076 0008 00045 00029 00004
Overall Complexity 00003 00015 0003 00043 00055 00067 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Expertise Correlation[Agent x Agent Agent x Agent] Average 00003 00015 0003 00042 00053 00065 00079 00088 00081 00079 00091 00097 00097 00101 0006 00036 00007
Density[Agent x Agent Agent x Agent] 00003 00015 0003 00043 00055 00067 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Centrality-Inverse Closeness[Agent x Agent Agent x Agent] Average 00003 00057 00191 00306 00387 0053 00733 00943 0084 00877 01111 01117 01125 01206 00587 00257 00018
Efficiency-Global[Agent x Agent Agent x Agent] 00003 00087 00223 00338 00421 00591 00815 0101 00904 00972 01245 01244 01203 01286 00632 00277 00026
Connectedness[Agent x Agent Agent x Agent] 00003 0021 00589 00932 0119 01953 02921 03446 0301 03223 04222 04187 04049 04402 02202 00899 00059
Distinctiveness Correlation[Agent x Agent Agent x Agent] Average 00006 00029 00059 00083 00106 00129 00156 00175 00162 00157 00181 00193 00193 00199 0012 00073 00014
Centrality-Closeness[Agent x Agent Agent x Agent] Average 00027 00028 00029 00031 00032 00035 00042 00048 00044 00044 00053 00052 00054 00058 00038 00031 00027
Boundary Spanner[Agent x Agent Agent x Agent] Average 00027 00191 00328 00328 00546 00574 00683 00601 00656 0071 00683 0071 00847 00656 00738 00546 00246
Network Centralization-Row Degree[Agent x Agent Agent x Agent] 00134 00314 006 00752 00794 0081 00851 00841 00848 00878 00893 00969 00996 00993 00842 00648 0024
Network Centralization-Column Degree[Agent x Agent Agent x Agent] 00134 00588 00819 00834 00849 00892 00878 00896 00876 00905 00948 00996 01024 00993 00842 0062 00322
Network Centralization-Out Degree[Agent x Agent Agent x Agent] 00135 00315 00601 00754 00797 00812 00853 00843 00851 0088 00895 00972 00999 00996 00845 0065 0024
Network Centralization-Total Degree[Agent x Agent Agent x Agent] 00135 00453 00672 00769 00813 00828 00869 0086 00867 00897 00925 00988 01016 00999 00847 00638 00282
Network Centralization-In Degree[Agent x Agent Agent x Agent] 00135 00589 00821 00836 00852 00895 00881 00898 00878 00908 0095 00999 01027 00996 00845 00622 00323
Clustering Coefficient-Watts-Strogatz[Agent x Agent Agent x Agent] Average 0023 00786 0126 01471 01728 01993 02334 02782 02382 02256 02604 02856 02775 02958 02157 01468 00355
Efficiency-Local[Agent x Agent Agent x Agent] 00238 01032 01587 01916 02253 02598 03034 0362 03167 03054 03644 03843 03678 03861 02663 01705 0041
Interlockers[Agent x Agent Agent x Agent] Average 00246 00546 00656 00628 00738 00902 00902 00902 00847 0071 00738 00874 00874 0082 00792 00683 00546
Breadth-Column[Agent x Agent Agent x Agent] 00273 01011 01803 02596 03224 03716 04399 05082 04672 04672 05355 05328 05519 05738 03825 02404 00601
Breadth-Row[Agent x Agent Agent x Agent] 00273 01339 02158 02923 03388 03934 04754 05301 04863 04973 05792 05765 05765 05984 04153 02514 00601
Clique Count[Agent x Agent Agent x Agent] Average 00301 02896 06421 10929 14699 17213 19781 22486 20601 20492 24071 2571 24918 24727 13689 06913 01148
Constraint-Burt[Agent x Agent Agent x Agent] Average 0038 01331 01404 01481 01869 02077 02643 02719 02506 02623 02827 0295 03034 02956 02548 01967 00948
Interdependence[Agent x Agent Agent x Agent] 00485 00073 00042 0003 00024 0002 00017 00015 00016 00016 00014 00013 00014 00013 00022 00037 00176
Effective Network Size-Burt[Agent x Agent Agent x Agent] Average 0053 03636 06977 10333 13118 15388 18575 21156 19688 19928 22983 2382 23991 2429 14392 08408 01957
Redundancy-Row[Agent x Agent Agent x Agent] 00601 03525 08443 12459 16093 19699 23716 26913 24754 23798 27541 29781 29617 30683 1765 10219 01311
Redundancy-Column[Agent x Agent Agent x Agent] 00601 03743 08634 12568 1623 2 23989 27077 24945 24016 27814 30055 29891 30902 17842 10383 01448
Node Levels[Agent x Agent Agent x Agent] Average 00601 09836 14344 23087 24126 36339 46913 42541 3918 43197 51913 44727 46421 48169 47568 23033 03169
Centrality-Bonacich Power[Agent x Agent Agent x Agent] Average 01038 05492 11066 15683 20027 2429 29372 32978 30383 29563 34071 36339 36284 37432 22541 13552 02514
Triad Count[Agent x Agent Agent x Agent] Average 01967 08279 27104 42432 5918 78251 96503 101475 94454 84508 94645 111421 110164 112732 67131 35574 01995
Efficiency[Agent x Agent Agent x Agent] 03333 09552 0964 09658 09658 09746 09798 09806 09795 09818 09835 09822 0982 09828 09809 09756 09607
Speed-Minimum[Agent x Agent Agent x Agent] 05 0125 0125 01 01111 00909 00909 01 01 01 00909 01111 01 01 00769 01111 01667
Edge Count-Skip[Agent x Agent Agent x Agent] 07895 07761 08988 09408 09304 0946 0933 09519 09487 09409 09455 09534 09465 09584 09188 0881 06196
Transitivity[Agent x Agent Agent x Agent] 08372 03475 03372 03238 03336 03626 0375 03495 03503 03253 0319 03266 03302 0336 03822 0405 0258
Edge Count-Pooled[Agent x Agent Agent x Agent] 08421 08657 09432 09669 09714 09764 09665 09751 09748 09704 09735 09737 09759 09788 09612 09435 08152
Speed-Average[Agent x Agent Agent x Agent] 08519 02994 03068 02802 02799 0233 02156 02387 0245 02484 02448 02516 02464 02429 02259 02373 04139
Edge Count-Lateral[Agent x Agent Agent x Agent] 09474 03731 03926 0392 04216 04061 03944 03828 03768 03651 03633 03872 03795 03985 03842 03891 05761
Resemblance Correlation[Agent x Agent Agent x Agent] Average 09994 09971 09941 09917 09894 09871 09844 09825 09838 09843 09819 09807 09807 09801 0988 09927 09986
Fragmentation[Agent x Agent Agent x Agent] 09997 0979 09411 09068 0881 08047 07079 06554 0699 06777 05778 05813 05951 05598 07798 09101 09941
Communicative Need[Agent x Agent Agent x Agent] 1 0608 07457 07662 0801 08069 08098 08233 0817 08124 0784 08169 08368 08365 08212 09004 08039
Upper Boundedness[Agent x Agent Agent x Agent] 1 09021 09909 0999 09987 0994 09966 09993 09989 09965 0995 09928 09987 09988 09986 09983 09073
Average Distance[Agent x Agent Agent x Agent] 11739 33399 326 35685 35731 42917 46372 41893 40814 40263 40845 39749 40577 41172 44266 42139 24163
Network Levels[Agent x Agent Agent x Agent] 2 8 8 10 9 11 11 10 10 10 11 9 10 10 13 9 6
Span Of Control[Agent x Agent Agent x Agent] 2375 27917 42188 48644 50903 52917 51932 54369 53981 5128 52176 55417 54426 55466 46089 40656 20909
Number of AGENT 32 94 112 137 162 183 221 228 213 219 254 250 256 260 190 141 62
Edge Count[Agent x Agent Agent x Agent] 38 201 405 574 733 889 1075 1207 1112 1082 1247 1330 1328 1370 825 496 92
Component Members-Weak[Agent x Agent Agent x Agent] Average 1728798 1248005 101776 862131 710437 559781 367623 299836 366667 345 232377 232213 225628 211885 50694 855628 1487596
Isolate Count[Agent x Agent Agent x Agent] 350 293 269 247 222 198 157 142 159 154 126 125 122 119 187 243 322
Component Count-Weak[Agent x Agent Agent x Agent] 355 301 272 250 226 201 162 146 162 157 128 128 126 122 191 249 329
Component Count-Strong[Agent x Agent Agent x Agent] 355 315 281 255 232 214 177 156 172 169 145 144 137 131 198 256 336
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Agents 221 228 213 219 254 250 256 260 190 141 62
Z -03746343246 00624390541 -08741467575 -04995124328 16858544608 14360982444 18107325691 20604887855 -23102450019 -53697586531 -103024439274
C+ 0 0 0 0 11858544608 21219527053 34326852743 49931740598 21829290579 0 0
C- 0 0 03741467575 03736591903 0 0 0 0 18102450019 6680003655 164824475824
Signal 0 0 0 0 0 0 0 1 0 1 1
Average 227
Std Dev 16015617378
k 05
Control Limit 4
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Component Members-Weak Average 367623 299836 366667 345 232377 232213 225628 211885 50694 855628 1487596
Z 09566024078 -01181147213 09414456731 05979300132 -11876316317 -1190231741 -12946324712 -15125184597 31653794044 86935922814 187130183455
C+ 04566024078 0 04414456731 05393756863 0 0 0 0 26653794044 108589716857 290719900313
C- 0 0 0 0 06876316317 13778633728 21724958439 31850143037 0 0 0
Signal 0 0 0 0 0 0 0 0 0 1 1
Average 307286
Std Dev 63074271515
k 05
Control Limit 4
1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Edge Count 38 201 405 574 733 889 1075 1207 1112 1082 1247 1330 1328 1370 825 496 92
Slope of Edge Count 163 204 169 159 156 186 132 -95 -30 165 83 -2 42 -545 -329 -404
Z -05039753822 15973457028 -01964649795 -07089823173 -08627375186 06748144948 -20927791293 -137269226972 -6393653789 59836399187 20500693512 -16784942813 -06150208054 -316223197419 -114547624997 -100197139539
C+ 0 10973457028 04008807233 0 0 01748144948 0 0 0 54836399187 70337092699 48552149886 37401941833 0 0 0
C- 00039753822 0 0 02089823173 05717198359 0 15927791293 148197018266 207133556155 142297156968 116796463456 128581406269 129731614323 440954811741 550502436738 645699576277
Signal 0 0 0 0 0 0 0 0 1 1 1 1 1 1 1 1 1
Average 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 9475 4825 43 3075 54 72 -1055 -2085 -309
Std Dev 195115350498 195115350498 195115350498 195115350498 195115350498 195115350498 195115350498 195115350498 1284143683549 1324547092406 1254033492376 115840047767 882307580533 710539700979 29504858357 278774102097 2505474007049
k 05
Control Limit 4
1988 1988 1988 1988 1988 1988
1989 1989 1989 1989 1989 1989
1990 1990 1990 1990 1990 1990
1991 1991 1991 1991 1991 1991
1992 1992 1992 1992 1992 1992
1993 1993 1993 1993 1993 1993
1994 1994 1994 1994 1994 1994
1995 1995 1995 1995 1995 1995
1996 1996 1996 1996 1996 1996
1997 1997 1997 1997 1997 1997
1998 1998 1998 1998 1998 1998
1999 1999 1999 1999 1999 1999
2000 2000 2000 2000 2000 2000
2001 2001 2001 2001 2001 2001
2002 2002 2002 2002 2002 2002
2003 2003 2003 2003 2003 2003
2004 2004 2004 2004 2004 2004
httpwwwmarocnlnieuwsforumsshowphppg=timeline
Significant Events Founded in Afghanistan Moves to Sudan WTC 1 and Somalia Leaves Sudan Returns to Afghanistan Saudia Arabia bomb Issues fatwa US embassy bombings USS Cole WTC 2 Afghanistan 3 Man is Captured Dirty bomb foiled 911 plotter captured Sheikh Mohammed captured Saudi bombings Morroco attacks Sectarian violence plot Madrid Europe truce offered
1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Betweeness 0 00001 00003 00006 00008 00016 00027 00029 00024 00025 00033 00031 00033 00036 0002 00008 0
Density 00003 00015 0003 00043 00055 00067 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Closeness 00001 00005 00012 00016 00018 00021 00027 0003 00028 00028 00031 0003 00032 00034 00024 00015 00004
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Closeness 00027 0003 00028 00028 00031 0003 00032 00034 00024 00015 00004
Z -08728715609 10910894512 -02182178902 -02182178902 17457431219 10910894512 24003967926 3709704134 -28368325731 -87287156094 -159299059872
C+ 0 05910894512 0 0 12457431219 18368325731 37372293657 69469334997 36101009266 0 0
C- 03728715609 0 0 0 0 0 0 0 23368325731 105655481825 259954541697
Signal 0 0 0 0 0 0 0 1 0 1 1
Average 00028333333
Std Dev 00001527525
k 05
Control Limit 4
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Density 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Z -08444407432 11042686642 -0259827921 -06495698025 16888814864 30529780716 28581071308 36375908938 -43521176765 -9223891195 -150700194171
C+ 0 06042686642 0 0 11888814864 3741859558 60999666888 92375575826 43854399061 0 0
C- 03444407432 0 0 01495698025 0 0 0 0 38521176765 125760088714 271460282886
Signal 0 0 0 0 0 0 1 1 1 1 1
Average 00084333333
Std Dev 00005131601
k 05
Control Limit 4
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Betweeness 00027 00029 00024 00025 00033 00031 00033 00036 0002 00008 0
Z 01324532357 09271726499 -10596258857 -06622661785 25166114784 17218920642 25166114784 37086905998 -26490647141 -74173811996 -105962588565
C+ 0 04271726499 0 0 20166114784 32385035426 5255115021 84638056208 53147409067 0 0
C- 0 0 05596258857 07218920642 0 0 0 0 21490647141 90664459137 191627047702
Signal 0 0 0 0 0 0 1 1 1 1 1
Average 00026666667
Std Dev 00002516611
k 05
Control Limit 4
1988 1988 1988
1989 1989 1989
1990 1990 1990
1991 1991 1991
1992 1992 1992
1993 1993 1993
1994 1994 1994
1995 1995 1995
1996 1996 1996
1997 1997 1997
1998 1998 1998
1999 1999 1999
2000 2000 2000
2001 2001 2001
2002 2002 2002
2003 2003 2003
2004 2004 2004
Page 13: Data Science Center Eindhoven - 4TU.Federation · 4TU AMI SRO Big Data Meeting Big Data: Mathematics in Action! November 24, 2017 Data Science Center Eindhoven The Mathematics Behind

Google Matrix

bull Choose a ldquodamping factorrdquo p (0ltplt1) bull Googlersquos p is secret but around 015

Interpretation for random surferbull choose next link according to A with probability 1- pbull ldquoteleportrdquo to random page with probability p

119872119872 = 1 minus 119901119901 119860119860 + 119901119901119901119901

119901119901 =1119899119899

1 ⋯ 1⋮ ⋱ ⋮1 ⋯ 1

Choice of Matrix B

Google choose other B to avoid unwanted boosting of pages

Practical Issues

bull The real Google matrix has size in the order of 30 billion columns x 30 billion rows

bull Every day around 250000 new pages are added bull Matrix has many zeros (100 links to a page)

which speeds up calculationsbull Doing the matrix calculation

requires fast computers butalso advanced math

Topic Streaming algorithms

High volume high velocity data makes exact counting of frequencies or number of different items practically infeasible but approximate answers suffice in several applications (web site counters customer counts in retail transactionshellip)

New algorithms using advanced probabilistic methods (random hash functions)bull Count-Min Sketch bull MinHashbull HyperLogLog

Topic Uncertainty Quantification

Virtual prototyping using mathematical models UQ does not deal with1 unknown uncertainty in the initial

conditions of parameters2 parametrisation of design building blocks dimension

reduction

For 1) polynomial chaos (Wiener chaos) inverse statistical models Bayesian analysis (calibration)

For 2) Model Order Reduction

Topic Network monitoring

0

0002

0004

0006

0008

001

0012

1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004

Measu

re V

alu

e

Time

Al-Qaeda Network Measures

Betweeness

Density

Closeness

Challengesbull monitor high number of variablesbull models to capture structural changesbull scalable algorithms for likelihood ratio calculations

Closeness CUSUM Statistic for Al-Qaeda (1994-2004)

0

1

2

3

4

5

6

7

8

1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004

C+

Change Point

Signal

Chart3

Betweeness
Density
Closeness
Time
Measure Value
Al-Qaeda Network Measures
0
00003
00001
00001
00015
00005
00003
0003
00012
00006
00043
00016
00008
00055
00018
00016
00067
00021
00027
0008
00027
00029
0009
0003
00024
00083
00028
00025
00081
00028
00033
00093
00031
00031
001
0003
00033
00099
00032
00036
00103
00034
0002
00062
00024
00008
00037
00015
0
00007
00004

Std Measures Control Charts

Std Measures Control Charts

Betweeness
Density
Closeness
Time
Value
Standard Measures

Chart1

Number of AGENT
Edge Count[Agent x Agent Agent x Agent]
Component Members-Weak[Agent x Agent Agent x Agent] Average
Isolate Count[Agent x Agent Agent x Agent]
Component Count-Weak[Agent x Agent Agent x Agent]
Component Count-Strong[Agent x Agent Agent x Agent]
32
38
1728798
350
355
355
94
201
1248005
293
301
315
112
405
101776
269
272
281
137
574
862131
247
250
255
162
733
710437
222
226
232
183
889
559781
198
201
214
221
1075
367623
157
162
177
228
1207
299836
142
146
156
213
1112
366667
159
162
172
219
1082
345
154
157
169
254
1247
232377
126
128
145
250
1330
232213
125
128
144
256
1328
225628
122
126
137
260
1370
211885
119
122
131
190
825
50694
187
191
198
141
496
855628
243
249
256
62
92
1487596
322
329
336

Control Charts

raw data

Chart4

C+
Closeness CUSUM Statistic for Al-Qaeda (1994-2004)
0
05910894512
0
0
12457431219
18368325731
37372293657
69469334997
36101009266
0
0

Std Measures Control Charts

Std Measures Control Charts

Betweeness
Density
Closeness
Time
Value
Standard Measures

Chart1

C+
Closeness CUSUM Chart for Al-Qaeda (1994-2004)

Control Charts

Number of AGENT
Edge Count[Agent x Agent Agent x Agent]
Component Members-Weak[Agent x Agent Agent x Agent] Average
Isolate Count[Agent x Agent Agent x Agent]
Component Count-Weak[Agent x Agent Agent x Agent]
Component Count-Strong[Agent x Agent Agent x Agent]
32
38
1728798
350
355
355
94
201
1248005
293
301
315
112
405
101776
269
272
281
137
574
862131
247
250
255
162
733
710437
222
226
232
183
889
559781
198
201
214
221
1075
367623
157
162
177
228
1207
299836
142
146
156
213
1112
366667
159
162
172
219
1082
345
154
157
169
254
1247
232377
126
128
145
250
1330
232213
125
128
144
256
1328
225628
122
126
137
260
1370
211885
119
122
131
190
825
50694
187
191
198
141
496
855628
243
249
256
62
92
1487596
322
329
336

raw data

Topic Precision farming

Topic Precision farming

Topic Compressed Sensing

Topic Network structure

bull dependencies between nodes in networks measured by degrees of direct neighbours

bull assortativity coefficient of Newman is nothing but Pearsonrsquos correlation statistic

bull inconsistent estimator when variances are infinitebull Spearman rank correlation behaves better but

calculation is computationally intensive bull requires heavy asymptotics

Van der Hofstad R and Litvak N (2014) Degree-Degree Dependencies in Random Graphs with Heavy-Tailed Degrees Internet Mathematics 10 (3-4) pp 287-334

Topic Topological Data Analysis

Common Big Data problem is to choose relevant ldquofeaturesrdquo from high-dimensional data

Combination of machine learning with topological tools (simplices cohomology) yields new algorithmsfor finding patterns and clustering

Presenter
Presentation Notes
httpsyoutubeXfWibrh6stw Gunnar Carsson

Mathematical contributions in general

bull Modellingbull Performance of algorithmsbull Statistical thinking

We need to work hard to makemathematical contributions more explicit

Conclusions

1 Mathematical methods are an important enabler in big data settings

2 Developments in big data settings not only require more computer speed and memory capacity but also new advanced mathematics

  • The Mathematics Behind Big Data
  • Outline
  • What is big data
  • Four Vrsquos of Big Data
  • High-dimensional data ldquo119951≪119953rdquo
  • Topic Google Search
  • Topic Google Search
  • Topic Google Search
  • Topic Google Search
  • Topic Google Search
  • Topic Google Search
  • Equivalent Random Surfer Model
  • Google Matrix
  • Choice of Matrix B
  • Practical Issues
  • Topic Streaming algorithms
  • Topic Uncertainty Quantification
  • Topic Network monitoring
  • Topic Precision farming
  • Topic Precision farming
  • Topic Compressed Sensing
  • Topic Network structure
  • Topic Topological Data Analysis
  • Mathematical contributions in general
  • Conclusions
1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Centrality-Betweenness[Agent x Agent Agent x Agent] Average 0 00001 00003 00006 00008 00016 00027 00029 00024 00025 00033 00031 00033 00036 0002 00008 0
Meta-Matrix Hamming Distance 0 00018 00036 0006 00065 00087 00124 0012 00146 00136 00135 00166 00164 00156 0015 0009 0004
Network Centralization-Betweenness[Agent x Agent Agent x Agent] 0 00091 00292 00492 00608 01105 01557 01586 01242 01187 01295 01338 01316 01414 0104 0042 00014
Edge Count-Sequential[Agent x Agent Agent x Agent] 0 001 00025 00017 0 0 00019 00008 00009 00009 00016 00015 0 00007 0 0002 0
Hierarchy[Agent x Agent Agent x Agent] 0 04827 01679 00696 00796 01496 01339 00727 00775 01273 01492 01711 0075 00717 00786 00882 05587
Exclusivity[Agent x Agent Agent x Agent] Average 00001 00003 00003 00003 00004 00004 00005 00005 00005 00006 00006 00006 00006 00006 00006 00004 00002
Network Centralization-Closeness[Agent x Agent Agent x Agent] 00001 00005 00012 00016 00018 00021 00027 0003 00028 00028 00031 0003 00032 00034 00024 00015 00004
Similarity Correlation[Agent x Agent Agent x Agent] Average 00001 00023 00036 00043 0005 0006 0007 00077 0007 00069 0008 00084 0008 00082 00053 0003 00008
Simmelian Ties[Agent x Agent Agent x Agent] Average 00002 00008 00021 00029 0004 0005 00059 00068 00061 00058 00065 00076 00076 0008 00045 00029 00004
Overall Complexity 00003 00015 0003 00043 00055 00067 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Expertise Correlation[Agent x Agent Agent x Agent] Average 00003 00015 0003 00042 00053 00065 00079 00088 00081 00079 00091 00097 00097 00101 0006 00036 00007
Density[Agent x Agent Agent x Agent] 00003 00015 0003 00043 00055 00067 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Centrality-Inverse Closeness[Agent x Agent Agent x Agent] Average 00003 00057 00191 00306 00387 0053 00733 00943 0084 00877 01111 01117 01125 01206 00587 00257 00018
Efficiency-Global[Agent x Agent Agent x Agent] 00003 00087 00223 00338 00421 00591 00815 0101 00904 00972 01245 01244 01203 01286 00632 00277 00026
Connectedness[Agent x Agent Agent x Agent] 00003 0021 00589 00932 0119 01953 02921 03446 0301 03223 04222 04187 04049 04402 02202 00899 00059
Distinctiveness Correlation[Agent x Agent Agent x Agent] Average 00006 00029 00059 00083 00106 00129 00156 00175 00162 00157 00181 00193 00193 00199 0012 00073 00014
Centrality-Closeness[Agent x Agent Agent x Agent] Average 00027 00028 00029 00031 00032 00035 00042 00048 00044 00044 00053 00052 00054 00058 00038 00031 00027
Boundary Spanner[Agent x Agent Agent x Agent] Average 00027 00191 00328 00328 00546 00574 00683 00601 00656 0071 00683 0071 00847 00656 00738 00546 00246
Network Centralization-Row Degree[Agent x Agent Agent x Agent] 00134 00314 006 00752 00794 0081 00851 00841 00848 00878 00893 00969 00996 00993 00842 00648 0024
Network Centralization-Column Degree[Agent x Agent Agent x Agent] 00134 00588 00819 00834 00849 00892 00878 00896 00876 00905 00948 00996 01024 00993 00842 0062 00322
Network Centralization-Out Degree[Agent x Agent Agent x Agent] 00135 00315 00601 00754 00797 00812 00853 00843 00851 0088 00895 00972 00999 00996 00845 0065 0024
Network Centralization-Total Degree[Agent x Agent Agent x Agent] 00135 00453 00672 00769 00813 00828 00869 0086 00867 00897 00925 00988 01016 00999 00847 00638 00282
Network Centralization-In Degree[Agent x Agent Agent x Agent] 00135 00589 00821 00836 00852 00895 00881 00898 00878 00908 0095 00999 01027 00996 00845 00622 00323
Clustering Coefficient-Watts-Strogatz[Agent x Agent Agent x Agent] Average 0023 00786 0126 01471 01728 01993 02334 02782 02382 02256 02604 02856 02775 02958 02157 01468 00355
Efficiency-Local[Agent x Agent Agent x Agent] 00238 01032 01587 01916 02253 02598 03034 0362 03167 03054 03644 03843 03678 03861 02663 01705 0041
Interlockers[Agent x Agent Agent x Agent] Average 00246 00546 00656 00628 00738 00902 00902 00902 00847 0071 00738 00874 00874 0082 00792 00683 00546
Breadth-Column[Agent x Agent Agent x Agent] 00273 01011 01803 02596 03224 03716 04399 05082 04672 04672 05355 05328 05519 05738 03825 02404 00601
Breadth-Row[Agent x Agent Agent x Agent] 00273 01339 02158 02923 03388 03934 04754 05301 04863 04973 05792 05765 05765 05984 04153 02514 00601
Clique Count[Agent x Agent Agent x Agent] Average 00301 02896 06421 10929 14699 17213 19781 22486 20601 20492 24071 2571 24918 24727 13689 06913 01148
Constraint-Burt[Agent x Agent Agent x Agent] Average 0038 01331 01404 01481 01869 02077 02643 02719 02506 02623 02827 0295 03034 02956 02548 01967 00948
Interdependence[Agent x Agent Agent x Agent] 00485 00073 00042 0003 00024 0002 00017 00015 00016 00016 00014 00013 00014 00013 00022 00037 00176
Effective Network Size-Burt[Agent x Agent Agent x Agent] Average 0053 03636 06977 10333 13118 15388 18575 21156 19688 19928 22983 2382 23991 2429 14392 08408 01957
Redundancy-Row[Agent x Agent Agent x Agent] 00601 03525 08443 12459 16093 19699 23716 26913 24754 23798 27541 29781 29617 30683 1765 10219 01311
Redundancy-Column[Agent x Agent Agent x Agent] 00601 03743 08634 12568 1623 2 23989 27077 24945 24016 27814 30055 29891 30902 17842 10383 01448
Node Levels[Agent x Agent Agent x Agent] Average 00601 09836 14344 23087 24126 36339 46913 42541 3918 43197 51913 44727 46421 48169 47568 23033 03169
Centrality-Bonacich Power[Agent x Agent Agent x Agent] Average 01038 05492 11066 15683 20027 2429 29372 32978 30383 29563 34071 36339 36284 37432 22541 13552 02514
Triad Count[Agent x Agent Agent x Agent] Average 01967 08279 27104 42432 5918 78251 96503 101475 94454 84508 94645 111421 110164 112732 67131 35574 01995
Efficiency[Agent x Agent Agent x Agent] 03333 09552 0964 09658 09658 09746 09798 09806 09795 09818 09835 09822 0982 09828 09809 09756 09607
Speed-Minimum[Agent x Agent Agent x Agent] 05 0125 0125 01 01111 00909 00909 01 01 01 00909 01111 01 01 00769 01111 01667
Edge Count-Skip[Agent x Agent Agent x Agent] 07895 07761 08988 09408 09304 0946 0933 09519 09487 09409 09455 09534 09465 09584 09188 0881 06196
Transitivity[Agent x Agent Agent x Agent] 08372 03475 03372 03238 03336 03626 0375 03495 03503 03253 0319 03266 03302 0336 03822 0405 0258
Edge Count-Pooled[Agent x Agent Agent x Agent] 08421 08657 09432 09669 09714 09764 09665 09751 09748 09704 09735 09737 09759 09788 09612 09435 08152
Speed-Average[Agent x Agent Agent x Agent] 08519 02994 03068 02802 02799 0233 02156 02387 0245 02484 02448 02516 02464 02429 02259 02373 04139
Edge Count-Lateral[Agent x Agent Agent x Agent] 09474 03731 03926 0392 04216 04061 03944 03828 03768 03651 03633 03872 03795 03985 03842 03891 05761
Resemblance Correlation[Agent x Agent Agent x Agent] Average 09994 09971 09941 09917 09894 09871 09844 09825 09838 09843 09819 09807 09807 09801 0988 09927 09986
Fragmentation[Agent x Agent Agent x Agent] 09997 0979 09411 09068 0881 08047 07079 06554 0699 06777 05778 05813 05951 05598 07798 09101 09941
Communicative Need[Agent x Agent Agent x Agent] 1 0608 07457 07662 0801 08069 08098 08233 0817 08124 0784 08169 08368 08365 08212 09004 08039
Upper Boundedness[Agent x Agent Agent x Agent] 1 09021 09909 0999 09987 0994 09966 09993 09989 09965 0995 09928 09987 09988 09986 09983 09073
Average Distance[Agent x Agent Agent x Agent] 11739 33399 326 35685 35731 42917 46372 41893 40814 40263 40845 39749 40577 41172 44266 42139 24163
Network Levels[Agent x Agent Agent x Agent] 2 8 8 10 9 11 11 10 10 10 11 9 10 10 13 9 6
Span Of Control[Agent x Agent Agent x Agent] 2375 27917 42188 48644 50903 52917 51932 54369 53981 5128 52176 55417 54426 55466 46089 40656 20909
Number of AGENT 32 94 112 137 162 183 221 228 213 219 254 250 256 260 190 141 62
Edge Count[Agent x Agent Agent x Agent] 38 201 405 574 733 889 1075 1207 1112 1082 1247 1330 1328 1370 825 496 92
Component Members-Weak[Agent x Agent Agent x Agent] Average 1728798 1248005 101776 862131 710437 559781 367623 299836 366667 345 232377 232213 225628 211885 50694 855628 1487596
Isolate Count[Agent x Agent Agent x Agent] 350 293 269 247 222 198 157 142 159 154 126 125 122 119 187 243 322
Component Count-Weak[Agent x Agent Agent x Agent] 355 301 272 250 226 201 162 146 162 157 128 128 126 122 191 249 329
Component Count-Strong[Agent x Agent Agent x Agent] 355 315 281 255 232 214 177 156 172 169 145 144 137 131 198 256 336
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Agents 221 228 213 219 254 250 256 260 190 141 62
Z -03746343246 00624390541 -08741467575 -04995124328 16858544608 14360982444 18107325691 20604887855 -23102450019 -53697586531 -103024439274
C+ 0 0 0 0 11858544608 21219527053 34326852743 49931740598 21829290579 0 0
C- 0 0 03741467575 03736591903 0 0 0 0 18102450019 6680003655 164824475824
Signal 0 0 0 0 0 0 0 1 0 1 1
Average 227
Std Dev 16015617378
k 05
Control Limit 4
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Component Members-Weak Average 367623 299836 366667 345 232377 232213 225628 211885 50694 855628 1487596
Z 09566024078 -01181147213 09414456731 05979300132 -11876316317 -1190231741 -12946324712 -15125184597 31653794044 86935922814 187130183455
C+ 04566024078 0 04414456731 05393756863 0 0 0 0 26653794044 108589716857 290719900313
C- 0 0 0 0 06876316317 13778633728 21724958439 31850143037 0 0 0
Signal 0 0 0 0 0 0 0 0 0 1 1
Average 307286
Std Dev 63074271515
k 05
Control Limit 4
1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Edge Count 38 201 405 574 733 889 1075 1207 1112 1082 1247 1330 1328 1370 825 496 92
Slope of Edge Count 163 204 169 159 156 186 132 -95 -30 165 83 -2 42 -545 -329 -404
Z -05039753822 15973457028 -01964649795 -07089823173 -08627375186 06748144948 -20927791293 -137269226972 -6393653789 59836399187 20500693512 -16784942813 -06150208054 -316223197419 -114547624997 -100197139539
C+ 0 10973457028 04008807233 0 0 01748144948 0 0 0 54836399187 70337092699 48552149886 37401941833 0 0 0
C- 00039753822 0 0 02089823173 05717198359 0 15927791293 148197018266 207133556155 142297156968 116796463456 128581406269 129731614323 440954811741 550502436738 645699576277
Signal 0 0 0 0 0 0 0 0 1 1 1 1 1 1 1 1 1
Average 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 9475 4825 43 3075 54 72 -1055 -2085 -309
Std Dev 195115350498 195115350498 195115350498 195115350498 195115350498 195115350498 195115350498 195115350498 1284143683549 1324547092406 1254033492376 115840047767 882307580533 710539700979 29504858357 278774102097 2505474007049
k 05
Control Limit 4
1988 1988 1988 1988 1988 1988
1989 1989 1989 1989 1989 1989
1990 1990 1990 1990 1990 1990
1991 1991 1991 1991 1991 1991
1992 1992 1992 1992 1992 1992
1993 1993 1993 1993 1993 1993
1994 1994 1994 1994 1994 1994
1995 1995 1995 1995 1995 1995
1996 1996 1996 1996 1996 1996
1997 1997 1997 1997 1997 1997
1998 1998 1998 1998 1998 1998
1999 1999 1999 1999 1999 1999
2000 2000 2000 2000 2000 2000
2001 2001 2001 2001 2001 2001
2002 2002 2002 2002 2002 2002
2003 2003 2003 2003 2003 2003
2004 2004 2004 2004 2004 2004
httpwwwmarocnlnieuwsforumsshowphppg=timeline
Significant Events Founded in Afghanistan Moves to Sudan WTC 1 and Somalia Leaves Sudan Returns to Afghanistan Saudia Arabia bomb Issues fatwa US embassy bombings USS Cole WTC 2 Afghanistan 3 Man is Captured Dirty bomb foiled 911 plotter captured Sheikh Mohammed captured Saudi bombings Morroco attacks Sectarian violence plot Madrid Europe truce offered
1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Betweeness 0 00001 00003 00006 00008 00016 00027 00029 00024 00025 00033 00031 00033 00036 0002 00008 0
Density 00003 00015 0003 00043 00055 00067 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Closeness 00001 00005 00012 00016 00018 00021 00027 0003 00028 00028 00031 0003 00032 00034 00024 00015 00004
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Closeness 00027 0003 00028 00028 00031 0003 00032 00034 00024 00015 00004
Z -08728715609 10910894512 -02182178902 -02182178902 17457431219 10910894512 24003967926 3709704134 -28368325731 -87287156094 -159299059872
C+ 0 05910894512 0 0 12457431219 18368325731 37372293657 69469334997 36101009266 0 0
C- 03728715609 0 0 0 0 0 0 0 23368325731 105655481825 259954541697
03728715609 05910894512 0 0 12457431219 18368325731 37372293657 69469334997 36101009266 105655481825 259954541697
Signal 0 0 0 0 0 0 0 1 0 1 1
Average 00028333333
Std Dev 00001527525
k 05
Control Limit 4
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Density 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Z -08444407432 11042686642 -0259827921 -06495698025 16888814864 30529780716 28581071308 36375908938 -43521176765 -9223891195 -150700194171
C+ 0 06042686642 0 0 11888814864 3741859558 60999666888 92375575826 43854399061 0 0
C- 03444407432 0 0 01495698025 0 0 0 0 38521176765 125760088714 271460282886
Signal 0 0 0 0 0 0 1 1 1 1 1
Average 00084333333
Std Dev 00005131601
k 05
Control Limit 4
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Betweeness 00027 00029 00024 00025 00033 00031 00033 00036 0002 00008 0
Z 01324532357 09271726499 -10596258857 -06622661785 25166114784 17218920642 25166114784 37086905998 -26490647141 -74173811996 -105962588565
C+ 0 04271726499 0 0 20166114784 32385035426 5255115021 84638056208 53147409067 0 0
C- 0 0 05596258857 07218920642 0 0 0 0 21490647141 90664459137 191627047702
Signal 0 0 0 0 0 0 1 1 1 1 1
Average 00026666667
Std Dev 00002516611
k 05
Control Limit 4
1994
1995
1996
1997
1998
1999
2000
2001
2002
2003
2004
1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Centrality-Betweenness[Agent x Agent Agent x Agent] Average 0 00001 00003 00006 00008 00016 00027 00029 00024 00025 00033 00031 00033 00036 0002 00008 0
Meta-Matrix Hamming Distance 0 00018 00036 0006 00065 00087 00124 0012 00146 00136 00135 00166 00164 00156 0015 0009 0004
Network Centralization-Betweenness[Agent x Agent Agent x Agent] 0 00091 00292 00492 00608 01105 01557 01586 01242 01187 01295 01338 01316 01414 0104 0042 00014
Edge Count-Sequential[Agent x Agent Agent x Agent] 0 001 00025 00017 0 0 00019 00008 00009 00009 00016 00015 0 00007 0 0002 0
Hierarchy[Agent x Agent Agent x Agent] 0 04827 01679 00696 00796 01496 01339 00727 00775 01273 01492 01711 0075 00717 00786 00882 05587
Exclusivity[Agent x Agent Agent x Agent] Average 00001 00003 00003 00003 00004 00004 00005 00005 00005 00006 00006 00006 00006 00006 00006 00004 00002
Network Centralization-Closeness[Agent x Agent Agent x Agent] 00001 00005 00012 00016 00018 00021 00027 0003 00028 00028 00031 0003 00032 00034 00024 00015 00004
Similarity Correlation[Agent x Agent Agent x Agent] Average 00001 00023 00036 00043 0005 0006 0007 00077 0007 00069 0008 00084 0008 00082 00053 0003 00008
Simmelian Ties[Agent x Agent Agent x Agent] Average 00002 00008 00021 00029 0004 0005 00059 00068 00061 00058 00065 00076 00076 0008 00045 00029 00004
Overall Complexity 00003 00015 0003 00043 00055 00067 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Expertise Correlation[Agent x Agent Agent x Agent] Average 00003 00015 0003 00042 00053 00065 00079 00088 00081 00079 00091 00097 00097 00101 0006 00036 00007
Density[Agent x Agent Agent x Agent] 00003 00015 0003 00043 00055 00067 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Centrality-Inverse Closeness[Agent x Agent Agent x Agent] Average 00003 00057 00191 00306 00387 0053 00733 00943 0084 00877 01111 01117 01125 01206 00587 00257 00018
Efficiency-Global[Agent x Agent Agent x Agent] 00003 00087 00223 00338 00421 00591 00815 0101 00904 00972 01245 01244 01203 01286 00632 00277 00026
Connectedness[Agent x Agent Agent x Agent] 00003 0021 00589 00932 0119 01953 02921 03446 0301 03223 04222 04187 04049 04402 02202 00899 00059
Distinctiveness Correlation[Agent x Agent Agent x Agent] Average 00006 00029 00059 00083 00106 00129 00156 00175 00162 00157 00181 00193 00193 00199 0012 00073 00014
Centrality-Closeness[Agent x Agent Agent x Agent] Average 00027 00028 00029 00031 00032 00035 00042 00048 00044 00044 00053 00052 00054 00058 00038 00031 00027
Boundary Spanner[Agent x Agent Agent x Agent] Average 00027 00191 00328 00328 00546 00574 00683 00601 00656 0071 00683 0071 00847 00656 00738 00546 00246
Network Centralization-Row Degree[Agent x Agent Agent x Agent] 00134 00314 006 00752 00794 0081 00851 00841 00848 00878 00893 00969 00996 00993 00842 00648 0024
Network Centralization-Column Degree[Agent x Agent Agent x Agent] 00134 00588 00819 00834 00849 00892 00878 00896 00876 00905 00948 00996 01024 00993 00842 0062 00322
Network Centralization-Out Degree[Agent x Agent Agent x Agent] 00135 00315 00601 00754 00797 00812 00853 00843 00851 0088 00895 00972 00999 00996 00845 0065 0024
Network Centralization-Total Degree[Agent x Agent Agent x Agent] 00135 00453 00672 00769 00813 00828 00869 0086 00867 00897 00925 00988 01016 00999 00847 00638 00282
Network Centralization-In Degree[Agent x Agent Agent x Agent] 00135 00589 00821 00836 00852 00895 00881 00898 00878 00908 0095 00999 01027 00996 00845 00622 00323
Clustering Coefficient-Watts-Strogatz[Agent x Agent Agent x Agent] Average 0023 00786 0126 01471 01728 01993 02334 02782 02382 02256 02604 02856 02775 02958 02157 01468 00355
Efficiency-Local[Agent x Agent Agent x Agent] 00238 01032 01587 01916 02253 02598 03034 0362 03167 03054 03644 03843 03678 03861 02663 01705 0041
Interlockers[Agent x Agent Agent x Agent] Average 00246 00546 00656 00628 00738 00902 00902 00902 00847 0071 00738 00874 00874 0082 00792 00683 00546
Breadth-Column[Agent x Agent Agent x Agent] 00273 01011 01803 02596 03224 03716 04399 05082 04672 04672 05355 05328 05519 05738 03825 02404 00601
Breadth-Row[Agent x Agent Agent x Agent] 00273 01339 02158 02923 03388 03934 04754 05301 04863 04973 05792 05765 05765 05984 04153 02514 00601
Clique Count[Agent x Agent Agent x Agent] Average 00301 02896 06421 10929 14699 17213 19781 22486 20601 20492 24071 2571 24918 24727 13689 06913 01148
Constraint-Burt[Agent x Agent Agent x Agent] Average 0038 01331 01404 01481 01869 02077 02643 02719 02506 02623 02827 0295 03034 02956 02548 01967 00948
Interdependence[Agent x Agent Agent x Agent] 00485 00073 00042 0003 00024 0002 00017 00015 00016 00016 00014 00013 00014 00013 00022 00037 00176
Effective Network Size-Burt[Agent x Agent Agent x Agent] Average 0053 03636 06977 10333 13118 15388 18575 21156 19688 19928 22983 2382 23991 2429 14392 08408 01957
Redundancy-Row[Agent x Agent Agent x Agent] 00601 03525 08443 12459 16093 19699 23716 26913 24754 23798 27541 29781 29617 30683 1765 10219 01311
Redundancy-Column[Agent x Agent Agent x Agent] 00601 03743 08634 12568 1623 2 23989 27077 24945 24016 27814 30055 29891 30902 17842 10383 01448
Node Levels[Agent x Agent Agent x Agent] Average 00601 09836 14344 23087 24126 36339 46913 42541 3918 43197 51913 44727 46421 48169 47568 23033 03169
Centrality-Bonacich Power[Agent x Agent Agent x Agent] Average 01038 05492 11066 15683 20027 2429 29372 32978 30383 29563 34071 36339 36284 37432 22541 13552 02514
Triad Count[Agent x Agent Agent x Agent] Average 01967 08279 27104 42432 5918 78251 96503 101475 94454 84508 94645 111421 110164 112732 67131 35574 01995
Efficiency[Agent x Agent Agent x Agent] 03333 09552 0964 09658 09658 09746 09798 09806 09795 09818 09835 09822 0982 09828 09809 09756 09607
Speed-Minimum[Agent x Agent Agent x Agent] 05 0125 0125 01 01111 00909 00909 01 01 01 00909 01111 01 01 00769 01111 01667
Edge Count-Skip[Agent x Agent Agent x Agent] 07895 07761 08988 09408 09304 0946 0933 09519 09487 09409 09455 09534 09465 09584 09188 0881 06196
Transitivity[Agent x Agent Agent x Agent] 08372 03475 03372 03238 03336 03626 0375 03495 03503 03253 0319 03266 03302 0336 03822 0405 0258
Edge Count-Pooled[Agent x Agent Agent x Agent] 08421 08657 09432 09669 09714 09764 09665 09751 09748 09704 09735 09737 09759 09788 09612 09435 08152
Speed-Average[Agent x Agent Agent x Agent] 08519 02994 03068 02802 02799 0233 02156 02387 0245 02484 02448 02516 02464 02429 02259 02373 04139
Edge Count-Lateral[Agent x Agent Agent x Agent] 09474 03731 03926 0392 04216 04061 03944 03828 03768 03651 03633 03872 03795 03985 03842 03891 05761
Resemblance Correlation[Agent x Agent Agent x Agent] Average 09994 09971 09941 09917 09894 09871 09844 09825 09838 09843 09819 09807 09807 09801 0988 09927 09986
Fragmentation[Agent x Agent Agent x Agent] 09997 0979 09411 09068 0881 08047 07079 06554 0699 06777 05778 05813 05951 05598 07798 09101 09941
Communicative Need[Agent x Agent Agent x Agent] 1 0608 07457 07662 0801 08069 08098 08233 0817 08124 0784 08169 08368 08365 08212 09004 08039
Upper Boundedness[Agent x Agent Agent x Agent] 1 09021 09909 0999 09987 0994 09966 09993 09989 09965 0995 09928 09987 09988 09986 09983 09073
Average Distance[Agent x Agent Agent x Agent] 11739 33399 326 35685 35731 42917 46372 41893 40814 40263 40845 39749 40577 41172 44266 42139 24163
Network Levels[Agent x Agent Agent x Agent] 2 8 8 10 9 11 11 10 10 10 11 9 10 10 13 9 6
Span Of Control[Agent x Agent Agent x Agent] 2375 27917 42188 48644 50903 52917 51932 54369 53981 5128 52176 55417 54426 55466 46089 40656 20909
Number of AGENT 32 94 112 137 162 183 221 228 213 219 254 250 256 260 190 141 62
Edge Count[Agent x Agent Agent x Agent] 38 201 405 574 733 889 1075 1207 1112 1082 1247 1330 1328 1370 825 496 92
Component Members-Weak[Agent x Agent Agent x Agent] Average 1728798 1248005 101776 862131 710437 559781 367623 299836 366667 345 232377 232213 225628 211885 50694 855628 1487596
Isolate Count[Agent x Agent Agent x Agent] 350 293 269 247 222 198 157 142 159 154 126 125 122 119 187 243 322
Component Count-Weak[Agent x Agent Agent x Agent] 355 301 272 250 226 201 162 146 162 157 128 128 126 122 191 249 329
Component Count-Strong[Agent x Agent Agent x Agent] 355 315 281 255 232 214 177 156 172 169 145 144 137 131 198 256 336
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Agents 221 228 213 219 254 250 256 260 190 141 62
Z -03746343246 00624390541 -08741467575 -04995124328 16858544608 14360982444 18107325691 20604887855 -23102450019 -53697586531 -103024439274
C+ 0 0 0 0 11858544608 21219527053 34326852743 49931740598 21829290579 0 0
C- 0 0 03741467575 03736591903 0 0 0 0 18102450019 6680003655 164824475824
Signal 0 0 0 0 0 0 0 1 0 1 1
Average 227
Std Dev 16015617378
k 05
Control Limit 4
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Component Members-Weak Average 367623 299836 366667 345 232377 232213 225628 211885 50694 855628 1487596
Z 09566024078 -01181147213 09414456731 05979300132 -11876316317 -1190231741 -12946324712 -15125184597 31653794044 86935922814 187130183455
C+ 04566024078 0 04414456731 05393756863 0 0 0 0 26653794044 108589716857 290719900313
C- 0 0 0 0 06876316317 13778633728 21724958439 31850143037 0 0 0
Signal 0 0 0 0 0 0 0 0 0 1 1
Average 307286
Std Dev 63074271515
k 05
Control Limit 4
1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Edge Count 38 201 405 574 733 889 1075 1207 1112 1082 1247 1330 1328 1370 825 496 92
Slope of Edge Count 163 204 169 159 156 186 132 -95 -30 165 83 -2 42 -545 -329 -404
Z -05039753822 15973457028 -01964649795 -07089823173 -08627375186 06748144948 -20927791293 -137269226972 -6393653789 59836399187 20500693512 -16784942813 -06150208054 -316223197419 -114547624997 -100197139539
C+ 0 10973457028 04008807233 0 0 01748144948 0 0 0 54836399187 70337092699 48552149886 37401941833 0 0 0
C- 00039753822 0 0 02089823173 05717198359 0 15927791293 148197018266 207133556155 142297156968 116796463456 128581406269 129731614323 440954811741 550502436738 645699576277
Signal 0 0 0 0 0 0 0 0 1 1 1 1 1 1 1 1 1
Average 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 9475 4825 43 3075 54 72 -1055 -2085 -309
Std Dev 195115350498 195115350498 195115350498 195115350498 195115350498 195115350498 195115350498 195115350498 1284143683549 1324547092406 1254033492376 115840047767 882307580533 710539700979 29504858357 278774102097 2505474007049
k 05
Control Limit 4
1988 1988 1988 1988 1988 1988
1989 1989 1989 1989 1989 1989
1990 1990 1990 1990 1990 1990
1991 1991 1991 1991 1991 1991
1992 1992 1992 1992 1992 1992
1993 1993 1993 1993 1993 1993
1994 1994 1994 1994 1994 1994
1995 1995 1995 1995 1995 1995
1996 1996 1996 1996 1996 1996
1997 1997 1997 1997 1997 1997
1998 1998 1998 1998 1998 1998
1999 1999 1999 1999 1999 1999
2000 2000 2000 2000 2000 2000
2001 2001 2001 2001 2001 2001
2002 2002 2002 2002 2002 2002
2003 2003 2003 2003 2003 2003
2004 2004 2004 2004 2004 2004
httpwwwmarocnlnieuwsforumsshowphppg=timeline
Significant Events Founded in Afghanistan Moves to Sudan WTC 1 and Somalia Leaves Sudan Returns to Afghanistan Saudia Arabia bomb Issues fatwa US embassy bombings USS Cole WTC 2 Afghanistan 3 Man is Captured Dirty bomb foiled 911 plotter captured Sheikh Mohammed captured Saudi bombings Morroco attacks Sectarian violence plot Madrid Europe truce offered
1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Betweeness 0 00001 00003 00006 00008 00016 00027 00029 00024 00025 00033 00031 00033 00036 0002 00008 0
Density 00003 00015 0003 00043 00055 00067 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Closeness 00001 00005 00012 00016 00018 00021 00027 0003 00028 00028 00031 0003 00032 00034 00024 00015 00004
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Closeness 00027 0003 00028 00028 00031 0003 00032 00034 00024 00015 00004
Z -08728715609 10910894512 -02182178902 -02182178902 17457431219 10910894512 24003967926 3709704134 -28368325731 -87287156094 -159299059872
C+ 0 05910894512 0 0 12457431219 18368325731 37372293657 69469334997 36101009266 0 0
C- 03728715609 0 0 0 0 0 0 0 23368325731 105655481825 259954541697
Signal 0 0 0 0 0 0 0 1 0 1 1
Average 00028333333
Std Dev 00001527525
k 05
Control Limit 4
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Density 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Z -08444407432 11042686642 -0259827921 -06495698025 16888814864 30529780716 28581071308 36375908938 -43521176765 -9223891195 -150700194171
C+ 0 06042686642 0 0 11888814864 3741859558 60999666888 92375575826 43854399061 0 0
C- 03444407432 0 0 01495698025 0 0 0 0 38521176765 125760088714 271460282886
Signal 0 0 0 0 0 0 1 1 1 1 1
Average 00084333333
Std Dev 00005131601
k 05
Control Limit 4
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Betweeness 00027 00029 00024 00025 00033 00031 00033 00036 0002 00008 0
Z 01324532357 09271726499 -10596258857 -06622661785 25166114784 17218920642 25166114784 37086905998 -26490647141 -74173811996 -105962588565
C+ 0 04271726499 0 0 20166114784 32385035426 5255115021 84638056208 53147409067 0 0
C- 0 0 05596258857 07218920642 0 0 0 0 21490647141 90664459137 191627047702
Signal 0 0 0 0 0 0 1 1 1 1 1
Average 00026666667
Std Dev 00002516611
k 05
Control Limit 4
1988 1988 1988
1989 1989 1989
1990 1990 1990
1991 1991 1991
1992 1992 1992
1993 1993 1993
1994 1994 1994
1995 1995 1995
1996 1996 1996
1997 1997 1997
1998 1998 1998
1999 1999 1999
2000 2000 2000
2001 2001 2001
2002 2002 2002
2003 2003 2003
2004 2004 2004
Page 14: Data Science Center Eindhoven - 4TU.Federation · 4TU AMI SRO Big Data Meeting Big Data: Mathematics in Action! November 24, 2017 Data Science Center Eindhoven The Mathematics Behind

Choice of Matrix B

Google choose other B to avoid unwanted boosting of pages

Practical Issues

bull The real Google matrix has size in the order of 30 billion columns x 30 billion rows

bull Every day around 250000 new pages are added bull Matrix has many zeros (100 links to a page)

which speeds up calculationsbull Doing the matrix calculation

requires fast computers butalso advanced math

Topic Streaming algorithms

High volume high velocity data makes exact counting of frequencies or number of different items practically infeasible but approximate answers suffice in several applications (web site counters customer counts in retail transactionshellip)

New algorithms using advanced probabilistic methods (random hash functions)bull Count-Min Sketch bull MinHashbull HyperLogLog

Topic Uncertainty Quantification

Virtual prototyping using mathematical models UQ does not deal with1 unknown uncertainty in the initial

conditions of parameters2 parametrisation of design building blocks dimension

reduction

For 1) polynomial chaos (Wiener chaos) inverse statistical models Bayesian analysis (calibration)

For 2) Model Order Reduction

Topic Network monitoring

0

0002

0004

0006

0008

001

0012

1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004

Measu

re V

alu

e

Time

Al-Qaeda Network Measures

Betweeness

Density

Closeness

Challengesbull monitor high number of variablesbull models to capture structural changesbull scalable algorithms for likelihood ratio calculations

Closeness CUSUM Statistic for Al-Qaeda (1994-2004)

0

1

2

3

4

5

6

7

8

1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004

C+

Change Point

Signal

Chart3

Betweeness
Density
Closeness
Time
Measure Value
Al-Qaeda Network Measures
0
00003
00001
00001
00015
00005
00003
0003
00012
00006
00043
00016
00008
00055
00018
00016
00067
00021
00027
0008
00027
00029
0009
0003
00024
00083
00028
00025
00081
00028
00033
00093
00031
00031
001
0003
00033
00099
00032
00036
00103
00034
0002
00062
00024
00008
00037
00015
0
00007
00004

Std Measures Control Charts

Std Measures Control Charts

Betweeness
Density
Closeness
Time
Value
Standard Measures

Chart1

Number of AGENT
Edge Count[Agent x Agent Agent x Agent]
Component Members-Weak[Agent x Agent Agent x Agent] Average
Isolate Count[Agent x Agent Agent x Agent]
Component Count-Weak[Agent x Agent Agent x Agent]
Component Count-Strong[Agent x Agent Agent x Agent]
32
38
1728798
350
355
355
94
201
1248005
293
301
315
112
405
101776
269
272
281
137
574
862131
247
250
255
162
733
710437
222
226
232
183
889
559781
198
201
214
221
1075
367623
157
162
177
228
1207
299836
142
146
156
213
1112
366667
159
162
172
219
1082
345
154
157
169
254
1247
232377
126
128
145
250
1330
232213
125
128
144
256
1328
225628
122
126
137
260
1370
211885
119
122
131
190
825
50694
187
191
198
141
496
855628
243
249
256
62
92
1487596
322
329
336

Control Charts

raw data

Chart4

C+
Closeness CUSUM Statistic for Al-Qaeda (1994-2004)
0
05910894512
0
0
12457431219
18368325731
37372293657
69469334997
36101009266
0
0

Std Measures Control Charts

Std Measures Control Charts

Betweeness
Density
Closeness
Time
Value
Standard Measures

Chart1

C+
Closeness CUSUM Chart for Al-Qaeda (1994-2004)

Control Charts

Number of AGENT
Edge Count[Agent x Agent Agent x Agent]
Component Members-Weak[Agent x Agent Agent x Agent] Average
Isolate Count[Agent x Agent Agent x Agent]
Component Count-Weak[Agent x Agent Agent x Agent]
Component Count-Strong[Agent x Agent Agent x Agent]
32
38
1728798
350
355
355
94
201
1248005
293
301
315
112
405
101776
269
272
281
137
574
862131
247
250
255
162
733
710437
222
226
232
183
889
559781
198
201
214
221
1075
367623
157
162
177
228
1207
299836
142
146
156
213
1112
366667
159
162
172
219
1082
345
154
157
169
254
1247
232377
126
128
145
250
1330
232213
125
128
144
256
1328
225628
122
126
137
260
1370
211885
119
122
131
190
825
50694
187
191
198
141
496
855628
243
249
256
62
92
1487596
322
329
336

raw data

Topic Precision farming

Topic Precision farming

Topic Compressed Sensing

Topic Network structure

bull dependencies between nodes in networks measured by degrees of direct neighbours

bull assortativity coefficient of Newman is nothing but Pearsonrsquos correlation statistic

bull inconsistent estimator when variances are infinitebull Spearman rank correlation behaves better but

calculation is computationally intensive bull requires heavy asymptotics

Van der Hofstad R and Litvak N (2014) Degree-Degree Dependencies in Random Graphs with Heavy-Tailed Degrees Internet Mathematics 10 (3-4) pp 287-334

Topic Topological Data Analysis

Common Big Data problem is to choose relevant ldquofeaturesrdquo from high-dimensional data

Combination of machine learning with topological tools (simplices cohomology) yields new algorithmsfor finding patterns and clustering

Presenter
Presentation Notes
httpsyoutubeXfWibrh6stw Gunnar Carsson

Mathematical contributions in general

bull Modellingbull Performance of algorithmsbull Statistical thinking

We need to work hard to makemathematical contributions more explicit

Conclusions

1 Mathematical methods are an important enabler in big data settings

2 Developments in big data settings not only require more computer speed and memory capacity but also new advanced mathematics

  • The Mathematics Behind Big Data
  • Outline
  • What is big data
  • Four Vrsquos of Big Data
  • High-dimensional data ldquo119951≪119953rdquo
  • Topic Google Search
  • Topic Google Search
  • Topic Google Search
  • Topic Google Search
  • Topic Google Search
  • Topic Google Search
  • Equivalent Random Surfer Model
  • Google Matrix
  • Choice of Matrix B
  • Practical Issues
  • Topic Streaming algorithms
  • Topic Uncertainty Quantification
  • Topic Network monitoring
  • Topic Precision farming
  • Topic Precision farming
  • Topic Compressed Sensing
  • Topic Network structure
  • Topic Topological Data Analysis
  • Mathematical contributions in general
  • Conclusions
1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Centrality-Betweenness[Agent x Agent Agent x Agent] Average 0 00001 00003 00006 00008 00016 00027 00029 00024 00025 00033 00031 00033 00036 0002 00008 0
Meta-Matrix Hamming Distance 0 00018 00036 0006 00065 00087 00124 0012 00146 00136 00135 00166 00164 00156 0015 0009 0004
Network Centralization-Betweenness[Agent x Agent Agent x Agent] 0 00091 00292 00492 00608 01105 01557 01586 01242 01187 01295 01338 01316 01414 0104 0042 00014
Edge Count-Sequential[Agent x Agent Agent x Agent] 0 001 00025 00017 0 0 00019 00008 00009 00009 00016 00015 0 00007 0 0002 0
Hierarchy[Agent x Agent Agent x Agent] 0 04827 01679 00696 00796 01496 01339 00727 00775 01273 01492 01711 0075 00717 00786 00882 05587
Exclusivity[Agent x Agent Agent x Agent] Average 00001 00003 00003 00003 00004 00004 00005 00005 00005 00006 00006 00006 00006 00006 00006 00004 00002
Network Centralization-Closeness[Agent x Agent Agent x Agent] 00001 00005 00012 00016 00018 00021 00027 0003 00028 00028 00031 0003 00032 00034 00024 00015 00004
Similarity Correlation[Agent x Agent Agent x Agent] Average 00001 00023 00036 00043 0005 0006 0007 00077 0007 00069 0008 00084 0008 00082 00053 0003 00008
Simmelian Ties[Agent x Agent Agent x Agent] Average 00002 00008 00021 00029 0004 0005 00059 00068 00061 00058 00065 00076 00076 0008 00045 00029 00004
Overall Complexity 00003 00015 0003 00043 00055 00067 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Expertise Correlation[Agent x Agent Agent x Agent] Average 00003 00015 0003 00042 00053 00065 00079 00088 00081 00079 00091 00097 00097 00101 0006 00036 00007
Density[Agent x Agent Agent x Agent] 00003 00015 0003 00043 00055 00067 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Centrality-Inverse Closeness[Agent x Agent Agent x Agent] Average 00003 00057 00191 00306 00387 0053 00733 00943 0084 00877 01111 01117 01125 01206 00587 00257 00018
Efficiency-Global[Agent x Agent Agent x Agent] 00003 00087 00223 00338 00421 00591 00815 0101 00904 00972 01245 01244 01203 01286 00632 00277 00026
Connectedness[Agent x Agent Agent x Agent] 00003 0021 00589 00932 0119 01953 02921 03446 0301 03223 04222 04187 04049 04402 02202 00899 00059
Distinctiveness Correlation[Agent x Agent Agent x Agent] Average 00006 00029 00059 00083 00106 00129 00156 00175 00162 00157 00181 00193 00193 00199 0012 00073 00014
Centrality-Closeness[Agent x Agent Agent x Agent] Average 00027 00028 00029 00031 00032 00035 00042 00048 00044 00044 00053 00052 00054 00058 00038 00031 00027
Boundary Spanner[Agent x Agent Agent x Agent] Average 00027 00191 00328 00328 00546 00574 00683 00601 00656 0071 00683 0071 00847 00656 00738 00546 00246
Network Centralization-Row Degree[Agent x Agent Agent x Agent] 00134 00314 006 00752 00794 0081 00851 00841 00848 00878 00893 00969 00996 00993 00842 00648 0024
Network Centralization-Column Degree[Agent x Agent Agent x Agent] 00134 00588 00819 00834 00849 00892 00878 00896 00876 00905 00948 00996 01024 00993 00842 0062 00322
Network Centralization-Out Degree[Agent x Agent Agent x Agent] 00135 00315 00601 00754 00797 00812 00853 00843 00851 0088 00895 00972 00999 00996 00845 0065 0024
Network Centralization-Total Degree[Agent x Agent Agent x Agent] 00135 00453 00672 00769 00813 00828 00869 0086 00867 00897 00925 00988 01016 00999 00847 00638 00282
Network Centralization-In Degree[Agent x Agent Agent x Agent] 00135 00589 00821 00836 00852 00895 00881 00898 00878 00908 0095 00999 01027 00996 00845 00622 00323
Clustering Coefficient-Watts-Strogatz[Agent x Agent Agent x Agent] Average 0023 00786 0126 01471 01728 01993 02334 02782 02382 02256 02604 02856 02775 02958 02157 01468 00355
Efficiency-Local[Agent x Agent Agent x Agent] 00238 01032 01587 01916 02253 02598 03034 0362 03167 03054 03644 03843 03678 03861 02663 01705 0041
Interlockers[Agent x Agent Agent x Agent] Average 00246 00546 00656 00628 00738 00902 00902 00902 00847 0071 00738 00874 00874 0082 00792 00683 00546
Breadth-Column[Agent x Agent Agent x Agent] 00273 01011 01803 02596 03224 03716 04399 05082 04672 04672 05355 05328 05519 05738 03825 02404 00601
Breadth-Row[Agent x Agent Agent x Agent] 00273 01339 02158 02923 03388 03934 04754 05301 04863 04973 05792 05765 05765 05984 04153 02514 00601
Clique Count[Agent x Agent Agent x Agent] Average 00301 02896 06421 10929 14699 17213 19781 22486 20601 20492 24071 2571 24918 24727 13689 06913 01148
Constraint-Burt[Agent x Agent Agent x Agent] Average 0038 01331 01404 01481 01869 02077 02643 02719 02506 02623 02827 0295 03034 02956 02548 01967 00948
Interdependence[Agent x Agent Agent x Agent] 00485 00073 00042 0003 00024 0002 00017 00015 00016 00016 00014 00013 00014 00013 00022 00037 00176
Effective Network Size-Burt[Agent x Agent Agent x Agent] Average 0053 03636 06977 10333 13118 15388 18575 21156 19688 19928 22983 2382 23991 2429 14392 08408 01957
Redundancy-Row[Agent x Agent Agent x Agent] 00601 03525 08443 12459 16093 19699 23716 26913 24754 23798 27541 29781 29617 30683 1765 10219 01311
Redundancy-Column[Agent x Agent Agent x Agent] 00601 03743 08634 12568 1623 2 23989 27077 24945 24016 27814 30055 29891 30902 17842 10383 01448
Node Levels[Agent x Agent Agent x Agent] Average 00601 09836 14344 23087 24126 36339 46913 42541 3918 43197 51913 44727 46421 48169 47568 23033 03169
Centrality-Bonacich Power[Agent x Agent Agent x Agent] Average 01038 05492 11066 15683 20027 2429 29372 32978 30383 29563 34071 36339 36284 37432 22541 13552 02514
Triad Count[Agent x Agent Agent x Agent] Average 01967 08279 27104 42432 5918 78251 96503 101475 94454 84508 94645 111421 110164 112732 67131 35574 01995
Efficiency[Agent x Agent Agent x Agent] 03333 09552 0964 09658 09658 09746 09798 09806 09795 09818 09835 09822 0982 09828 09809 09756 09607
Speed-Minimum[Agent x Agent Agent x Agent] 05 0125 0125 01 01111 00909 00909 01 01 01 00909 01111 01 01 00769 01111 01667
Edge Count-Skip[Agent x Agent Agent x Agent] 07895 07761 08988 09408 09304 0946 0933 09519 09487 09409 09455 09534 09465 09584 09188 0881 06196
Transitivity[Agent x Agent Agent x Agent] 08372 03475 03372 03238 03336 03626 0375 03495 03503 03253 0319 03266 03302 0336 03822 0405 0258
Edge Count-Pooled[Agent x Agent Agent x Agent] 08421 08657 09432 09669 09714 09764 09665 09751 09748 09704 09735 09737 09759 09788 09612 09435 08152
Speed-Average[Agent x Agent Agent x Agent] 08519 02994 03068 02802 02799 0233 02156 02387 0245 02484 02448 02516 02464 02429 02259 02373 04139
Edge Count-Lateral[Agent x Agent Agent x Agent] 09474 03731 03926 0392 04216 04061 03944 03828 03768 03651 03633 03872 03795 03985 03842 03891 05761
Resemblance Correlation[Agent x Agent Agent x Agent] Average 09994 09971 09941 09917 09894 09871 09844 09825 09838 09843 09819 09807 09807 09801 0988 09927 09986
Fragmentation[Agent x Agent Agent x Agent] 09997 0979 09411 09068 0881 08047 07079 06554 0699 06777 05778 05813 05951 05598 07798 09101 09941
Communicative Need[Agent x Agent Agent x Agent] 1 0608 07457 07662 0801 08069 08098 08233 0817 08124 0784 08169 08368 08365 08212 09004 08039
Upper Boundedness[Agent x Agent Agent x Agent] 1 09021 09909 0999 09987 0994 09966 09993 09989 09965 0995 09928 09987 09988 09986 09983 09073
Average Distance[Agent x Agent Agent x Agent] 11739 33399 326 35685 35731 42917 46372 41893 40814 40263 40845 39749 40577 41172 44266 42139 24163
Network Levels[Agent x Agent Agent x Agent] 2 8 8 10 9 11 11 10 10 10 11 9 10 10 13 9 6
Span Of Control[Agent x Agent Agent x Agent] 2375 27917 42188 48644 50903 52917 51932 54369 53981 5128 52176 55417 54426 55466 46089 40656 20909
Number of AGENT 32 94 112 137 162 183 221 228 213 219 254 250 256 260 190 141 62
Edge Count[Agent x Agent Agent x Agent] 38 201 405 574 733 889 1075 1207 1112 1082 1247 1330 1328 1370 825 496 92
Component Members-Weak[Agent x Agent Agent x Agent] Average 1728798 1248005 101776 862131 710437 559781 367623 299836 366667 345 232377 232213 225628 211885 50694 855628 1487596
Isolate Count[Agent x Agent Agent x Agent] 350 293 269 247 222 198 157 142 159 154 126 125 122 119 187 243 322
Component Count-Weak[Agent x Agent Agent x Agent] 355 301 272 250 226 201 162 146 162 157 128 128 126 122 191 249 329
Component Count-Strong[Agent x Agent Agent x Agent] 355 315 281 255 232 214 177 156 172 169 145 144 137 131 198 256 336
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Agents 221 228 213 219 254 250 256 260 190 141 62
Z -03746343246 00624390541 -08741467575 -04995124328 16858544608 14360982444 18107325691 20604887855 -23102450019 -53697586531 -103024439274
C+ 0 0 0 0 11858544608 21219527053 34326852743 49931740598 21829290579 0 0
C- 0 0 03741467575 03736591903 0 0 0 0 18102450019 6680003655 164824475824
Signal 0 0 0 0 0 0 0 1 0 1 1
Average 227
Std Dev 16015617378
k 05
Control Limit 4
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Component Members-Weak Average 367623 299836 366667 345 232377 232213 225628 211885 50694 855628 1487596
Z 09566024078 -01181147213 09414456731 05979300132 -11876316317 -1190231741 -12946324712 -15125184597 31653794044 86935922814 187130183455
C+ 04566024078 0 04414456731 05393756863 0 0 0 0 26653794044 108589716857 290719900313
C- 0 0 0 0 06876316317 13778633728 21724958439 31850143037 0 0 0
Signal 0 0 0 0 0 0 0 0 0 1 1
Average 307286
Std Dev 63074271515
k 05
Control Limit 4
1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Edge Count 38 201 405 574 733 889 1075 1207 1112 1082 1247 1330 1328 1370 825 496 92
Slope of Edge Count 163 204 169 159 156 186 132 -95 -30 165 83 -2 42 -545 -329 -404
Z -05039753822 15973457028 -01964649795 -07089823173 -08627375186 06748144948 -20927791293 -137269226972 -6393653789 59836399187 20500693512 -16784942813 -06150208054 -316223197419 -114547624997 -100197139539
C+ 0 10973457028 04008807233 0 0 01748144948 0 0 0 54836399187 70337092699 48552149886 37401941833 0 0 0
C- 00039753822 0 0 02089823173 05717198359 0 15927791293 148197018266 207133556155 142297156968 116796463456 128581406269 129731614323 440954811741 550502436738 645699576277
Signal 0 0 0 0 0 0 0 0 1 1 1 1 1 1 1 1 1
Average 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 9475 4825 43 3075 54 72 -1055 -2085 -309
Std Dev 195115350498 195115350498 195115350498 195115350498 195115350498 195115350498 195115350498 195115350498 1284143683549 1324547092406 1254033492376 115840047767 882307580533 710539700979 29504858357 278774102097 2505474007049
k 05
Control Limit 4
1988 1988 1988 1988 1988 1988
1989 1989 1989 1989 1989 1989
1990 1990 1990 1990 1990 1990
1991 1991 1991 1991 1991 1991
1992 1992 1992 1992 1992 1992
1993 1993 1993 1993 1993 1993
1994 1994 1994 1994 1994 1994
1995 1995 1995 1995 1995 1995
1996 1996 1996 1996 1996 1996
1997 1997 1997 1997 1997 1997
1998 1998 1998 1998 1998 1998
1999 1999 1999 1999 1999 1999
2000 2000 2000 2000 2000 2000
2001 2001 2001 2001 2001 2001
2002 2002 2002 2002 2002 2002
2003 2003 2003 2003 2003 2003
2004 2004 2004 2004 2004 2004
httpwwwmarocnlnieuwsforumsshowphppg=timeline
Significant Events Founded in Afghanistan Moves to Sudan WTC 1 and Somalia Leaves Sudan Returns to Afghanistan Saudia Arabia bomb Issues fatwa US embassy bombings USS Cole WTC 2 Afghanistan 3 Man is Captured Dirty bomb foiled 911 plotter captured Sheikh Mohammed captured Saudi bombings Morroco attacks Sectarian violence plot Madrid Europe truce offered
1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Betweeness 0 00001 00003 00006 00008 00016 00027 00029 00024 00025 00033 00031 00033 00036 0002 00008 0
Density 00003 00015 0003 00043 00055 00067 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Closeness 00001 00005 00012 00016 00018 00021 00027 0003 00028 00028 00031 0003 00032 00034 00024 00015 00004
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Closeness 00027 0003 00028 00028 00031 0003 00032 00034 00024 00015 00004
Z -08728715609 10910894512 -02182178902 -02182178902 17457431219 10910894512 24003967926 3709704134 -28368325731 -87287156094 -159299059872
C+ 0 05910894512 0 0 12457431219 18368325731 37372293657 69469334997 36101009266 0 0
C- 03728715609 0 0 0 0 0 0 0 23368325731 105655481825 259954541697
03728715609 05910894512 0 0 12457431219 18368325731 37372293657 69469334997 36101009266 105655481825 259954541697
Signal 0 0 0 0 0 0 0 1 0 1 1
Average 00028333333
Std Dev 00001527525
k 05
Control Limit 4
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Density 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Z -08444407432 11042686642 -0259827921 -06495698025 16888814864 30529780716 28581071308 36375908938 -43521176765 -9223891195 -150700194171
C+ 0 06042686642 0 0 11888814864 3741859558 60999666888 92375575826 43854399061 0 0
C- 03444407432 0 0 01495698025 0 0 0 0 38521176765 125760088714 271460282886
Signal 0 0 0 0 0 0 1 1 1 1 1
Average 00084333333
Std Dev 00005131601
k 05
Control Limit 4
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Betweeness 00027 00029 00024 00025 00033 00031 00033 00036 0002 00008 0
Z 01324532357 09271726499 -10596258857 -06622661785 25166114784 17218920642 25166114784 37086905998 -26490647141 -74173811996 -105962588565
C+ 0 04271726499 0 0 20166114784 32385035426 5255115021 84638056208 53147409067 0 0
C- 0 0 05596258857 07218920642 0 0 0 0 21490647141 90664459137 191627047702
Signal 0 0 0 0 0 0 1 1 1 1 1
Average 00026666667
Std Dev 00002516611
k 05
Control Limit 4
1994
1995
1996
1997
1998
1999
2000
2001
2002
2003
2004
1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Centrality-Betweenness[Agent x Agent Agent x Agent] Average 0 00001 00003 00006 00008 00016 00027 00029 00024 00025 00033 00031 00033 00036 0002 00008 0
Meta-Matrix Hamming Distance 0 00018 00036 0006 00065 00087 00124 0012 00146 00136 00135 00166 00164 00156 0015 0009 0004
Network Centralization-Betweenness[Agent x Agent Agent x Agent] 0 00091 00292 00492 00608 01105 01557 01586 01242 01187 01295 01338 01316 01414 0104 0042 00014
Edge Count-Sequential[Agent x Agent Agent x Agent] 0 001 00025 00017 0 0 00019 00008 00009 00009 00016 00015 0 00007 0 0002 0
Hierarchy[Agent x Agent Agent x Agent] 0 04827 01679 00696 00796 01496 01339 00727 00775 01273 01492 01711 0075 00717 00786 00882 05587
Exclusivity[Agent x Agent Agent x Agent] Average 00001 00003 00003 00003 00004 00004 00005 00005 00005 00006 00006 00006 00006 00006 00006 00004 00002
Network Centralization-Closeness[Agent x Agent Agent x Agent] 00001 00005 00012 00016 00018 00021 00027 0003 00028 00028 00031 0003 00032 00034 00024 00015 00004
Similarity Correlation[Agent x Agent Agent x Agent] Average 00001 00023 00036 00043 0005 0006 0007 00077 0007 00069 0008 00084 0008 00082 00053 0003 00008
Simmelian Ties[Agent x Agent Agent x Agent] Average 00002 00008 00021 00029 0004 0005 00059 00068 00061 00058 00065 00076 00076 0008 00045 00029 00004
Overall Complexity 00003 00015 0003 00043 00055 00067 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Expertise Correlation[Agent x Agent Agent x Agent] Average 00003 00015 0003 00042 00053 00065 00079 00088 00081 00079 00091 00097 00097 00101 0006 00036 00007
Density[Agent x Agent Agent x Agent] 00003 00015 0003 00043 00055 00067 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Centrality-Inverse Closeness[Agent x Agent Agent x Agent] Average 00003 00057 00191 00306 00387 0053 00733 00943 0084 00877 01111 01117 01125 01206 00587 00257 00018
Efficiency-Global[Agent x Agent Agent x Agent] 00003 00087 00223 00338 00421 00591 00815 0101 00904 00972 01245 01244 01203 01286 00632 00277 00026
Connectedness[Agent x Agent Agent x Agent] 00003 0021 00589 00932 0119 01953 02921 03446 0301 03223 04222 04187 04049 04402 02202 00899 00059
Distinctiveness Correlation[Agent x Agent Agent x Agent] Average 00006 00029 00059 00083 00106 00129 00156 00175 00162 00157 00181 00193 00193 00199 0012 00073 00014
Centrality-Closeness[Agent x Agent Agent x Agent] Average 00027 00028 00029 00031 00032 00035 00042 00048 00044 00044 00053 00052 00054 00058 00038 00031 00027
Boundary Spanner[Agent x Agent Agent x Agent] Average 00027 00191 00328 00328 00546 00574 00683 00601 00656 0071 00683 0071 00847 00656 00738 00546 00246
Network Centralization-Row Degree[Agent x Agent Agent x Agent] 00134 00314 006 00752 00794 0081 00851 00841 00848 00878 00893 00969 00996 00993 00842 00648 0024
Network Centralization-Column Degree[Agent x Agent Agent x Agent] 00134 00588 00819 00834 00849 00892 00878 00896 00876 00905 00948 00996 01024 00993 00842 0062 00322
Network Centralization-Out Degree[Agent x Agent Agent x Agent] 00135 00315 00601 00754 00797 00812 00853 00843 00851 0088 00895 00972 00999 00996 00845 0065 0024
Network Centralization-Total Degree[Agent x Agent Agent x Agent] 00135 00453 00672 00769 00813 00828 00869 0086 00867 00897 00925 00988 01016 00999 00847 00638 00282
Network Centralization-In Degree[Agent x Agent Agent x Agent] 00135 00589 00821 00836 00852 00895 00881 00898 00878 00908 0095 00999 01027 00996 00845 00622 00323
Clustering Coefficient-Watts-Strogatz[Agent x Agent Agent x Agent] Average 0023 00786 0126 01471 01728 01993 02334 02782 02382 02256 02604 02856 02775 02958 02157 01468 00355
Efficiency-Local[Agent x Agent Agent x Agent] 00238 01032 01587 01916 02253 02598 03034 0362 03167 03054 03644 03843 03678 03861 02663 01705 0041
Interlockers[Agent x Agent Agent x Agent] Average 00246 00546 00656 00628 00738 00902 00902 00902 00847 0071 00738 00874 00874 0082 00792 00683 00546
Breadth-Column[Agent x Agent Agent x Agent] 00273 01011 01803 02596 03224 03716 04399 05082 04672 04672 05355 05328 05519 05738 03825 02404 00601
Breadth-Row[Agent x Agent Agent x Agent] 00273 01339 02158 02923 03388 03934 04754 05301 04863 04973 05792 05765 05765 05984 04153 02514 00601
Clique Count[Agent x Agent Agent x Agent] Average 00301 02896 06421 10929 14699 17213 19781 22486 20601 20492 24071 2571 24918 24727 13689 06913 01148
Constraint-Burt[Agent x Agent Agent x Agent] Average 0038 01331 01404 01481 01869 02077 02643 02719 02506 02623 02827 0295 03034 02956 02548 01967 00948
Interdependence[Agent x Agent Agent x Agent] 00485 00073 00042 0003 00024 0002 00017 00015 00016 00016 00014 00013 00014 00013 00022 00037 00176
Effective Network Size-Burt[Agent x Agent Agent x Agent] Average 0053 03636 06977 10333 13118 15388 18575 21156 19688 19928 22983 2382 23991 2429 14392 08408 01957
Redundancy-Row[Agent x Agent Agent x Agent] 00601 03525 08443 12459 16093 19699 23716 26913 24754 23798 27541 29781 29617 30683 1765 10219 01311
Redundancy-Column[Agent x Agent Agent x Agent] 00601 03743 08634 12568 1623 2 23989 27077 24945 24016 27814 30055 29891 30902 17842 10383 01448
Node Levels[Agent x Agent Agent x Agent] Average 00601 09836 14344 23087 24126 36339 46913 42541 3918 43197 51913 44727 46421 48169 47568 23033 03169
Centrality-Bonacich Power[Agent x Agent Agent x Agent] Average 01038 05492 11066 15683 20027 2429 29372 32978 30383 29563 34071 36339 36284 37432 22541 13552 02514
Triad Count[Agent x Agent Agent x Agent] Average 01967 08279 27104 42432 5918 78251 96503 101475 94454 84508 94645 111421 110164 112732 67131 35574 01995
Efficiency[Agent x Agent Agent x Agent] 03333 09552 0964 09658 09658 09746 09798 09806 09795 09818 09835 09822 0982 09828 09809 09756 09607
Speed-Minimum[Agent x Agent Agent x Agent] 05 0125 0125 01 01111 00909 00909 01 01 01 00909 01111 01 01 00769 01111 01667
Edge Count-Skip[Agent x Agent Agent x Agent] 07895 07761 08988 09408 09304 0946 0933 09519 09487 09409 09455 09534 09465 09584 09188 0881 06196
Transitivity[Agent x Agent Agent x Agent] 08372 03475 03372 03238 03336 03626 0375 03495 03503 03253 0319 03266 03302 0336 03822 0405 0258
Edge Count-Pooled[Agent x Agent Agent x Agent] 08421 08657 09432 09669 09714 09764 09665 09751 09748 09704 09735 09737 09759 09788 09612 09435 08152
Speed-Average[Agent x Agent Agent x Agent] 08519 02994 03068 02802 02799 0233 02156 02387 0245 02484 02448 02516 02464 02429 02259 02373 04139
Edge Count-Lateral[Agent x Agent Agent x Agent] 09474 03731 03926 0392 04216 04061 03944 03828 03768 03651 03633 03872 03795 03985 03842 03891 05761
Resemblance Correlation[Agent x Agent Agent x Agent] Average 09994 09971 09941 09917 09894 09871 09844 09825 09838 09843 09819 09807 09807 09801 0988 09927 09986
Fragmentation[Agent x Agent Agent x Agent] 09997 0979 09411 09068 0881 08047 07079 06554 0699 06777 05778 05813 05951 05598 07798 09101 09941
Communicative Need[Agent x Agent Agent x Agent] 1 0608 07457 07662 0801 08069 08098 08233 0817 08124 0784 08169 08368 08365 08212 09004 08039
Upper Boundedness[Agent x Agent Agent x Agent] 1 09021 09909 0999 09987 0994 09966 09993 09989 09965 0995 09928 09987 09988 09986 09983 09073
Average Distance[Agent x Agent Agent x Agent] 11739 33399 326 35685 35731 42917 46372 41893 40814 40263 40845 39749 40577 41172 44266 42139 24163
Network Levels[Agent x Agent Agent x Agent] 2 8 8 10 9 11 11 10 10 10 11 9 10 10 13 9 6
Span Of Control[Agent x Agent Agent x Agent] 2375 27917 42188 48644 50903 52917 51932 54369 53981 5128 52176 55417 54426 55466 46089 40656 20909
Number of AGENT 32 94 112 137 162 183 221 228 213 219 254 250 256 260 190 141 62
Edge Count[Agent x Agent Agent x Agent] 38 201 405 574 733 889 1075 1207 1112 1082 1247 1330 1328 1370 825 496 92
Component Members-Weak[Agent x Agent Agent x Agent] Average 1728798 1248005 101776 862131 710437 559781 367623 299836 366667 345 232377 232213 225628 211885 50694 855628 1487596
Isolate Count[Agent x Agent Agent x Agent] 350 293 269 247 222 198 157 142 159 154 126 125 122 119 187 243 322
Component Count-Weak[Agent x Agent Agent x Agent] 355 301 272 250 226 201 162 146 162 157 128 128 126 122 191 249 329
Component Count-Strong[Agent x Agent Agent x Agent] 355 315 281 255 232 214 177 156 172 169 145 144 137 131 198 256 336
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Agents 221 228 213 219 254 250 256 260 190 141 62
Z -03746343246 00624390541 -08741467575 -04995124328 16858544608 14360982444 18107325691 20604887855 -23102450019 -53697586531 -103024439274
C+ 0 0 0 0 11858544608 21219527053 34326852743 49931740598 21829290579 0 0
C- 0 0 03741467575 03736591903 0 0 0 0 18102450019 6680003655 164824475824
Signal 0 0 0 0 0 0 0 1 0 1 1
Average 227
Std Dev 16015617378
k 05
Control Limit 4
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Component Members-Weak Average 367623 299836 366667 345 232377 232213 225628 211885 50694 855628 1487596
Z 09566024078 -01181147213 09414456731 05979300132 -11876316317 -1190231741 -12946324712 -15125184597 31653794044 86935922814 187130183455
C+ 04566024078 0 04414456731 05393756863 0 0 0 0 26653794044 108589716857 290719900313
C- 0 0 0 0 06876316317 13778633728 21724958439 31850143037 0 0 0
Signal 0 0 0 0 0 0 0 0 0 1 1
Average 307286
Std Dev 63074271515
k 05
Control Limit 4
1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Edge Count 38 201 405 574 733 889 1075 1207 1112 1082 1247 1330 1328 1370 825 496 92
Slope of Edge Count 163 204 169 159 156 186 132 -95 -30 165 83 -2 42 -545 -329 -404
Z -05039753822 15973457028 -01964649795 -07089823173 -08627375186 06748144948 -20927791293 -137269226972 -6393653789 59836399187 20500693512 -16784942813 -06150208054 -316223197419 -114547624997 -100197139539
C+ 0 10973457028 04008807233 0 0 01748144948 0 0 0 54836399187 70337092699 48552149886 37401941833 0 0 0
C- 00039753822 0 0 02089823173 05717198359 0 15927791293 148197018266 207133556155 142297156968 116796463456 128581406269 129731614323 440954811741 550502436738 645699576277
Signal 0 0 0 0 0 0 0 0 1 1 1 1 1 1 1 1 1
Average 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 9475 4825 43 3075 54 72 -1055 -2085 -309
Std Dev 195115350498 195115350498 195115350498 195115350498 195115350498 195115350498 195115350498 195115350498 1284143683549 1324547092406 1254033492376 115840047767 882307580533 710539700979 29504858357 278774102097 2505474007049
k 05
Control Limit 4
1988 1988 1988 1988 1988 1988
1989 1989 1989 1989 1989 1989
1990 1990 1990 1990 1990 1990
1991 1991 1991 1991 1991 1991
1992 1992 1992 1992 1992 1992
1993 1993 1993 1993 1993 1993
1994 1994 1994 1994 1994 1994
1995 1995 1995 1995 1995 1995
1996 1996 1996 1996 1996 1996
1997 1997 1997 1997 1997 1997
1998 1998 1998 1998 1998 1998
1999 1999 1999 1999 1999 1999
2000 2000 2000 2000 2000 2000
2001 2001 2001 2001 2001 2001
2002 2002 2002 2002 2002 2002
2003 2003 2003 2003 2003 2003
2004 2004 2004 2004 2004 2004
httpwwwmarocnlnieuwsforumsshowphppg=timeline
Significant Events Founded in Afghanistan Moves to Sudan WTC 1 and Somalia Leaves Sudan Returns to Afghanistan Saudia Arabia bomb Issues fatwa US embassy bombings USS Cole WTC 2 Afghanistan 3 Man is Captured Dirty bomb foiled 911 plotter captured Sheikh Mohammed captured Saudi bombings Morroco attacks Sectarian violence plot Madrid Europe truce offered
1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Betweeness 0 00001 00003 00006 00008 00016 00027 00029 00024 00025 00033 00031 00033 00036 0002 00008 0
Density 00003 00015 0003 00043 00055 00067 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Closeness 00001 00005 00012 00016 00018 00021 00027 0003 00028 00028 00031 0003 00032 00034 00024 00015 00004
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Closeness 00027 0003 00028 00028 00031 0003 00032 00034 00024 00015 00004
Z -08728715609 10910894512 -02182178902 -02182178902 17457431219 10910894512 24003967926 3709704134 -28368325731 -87287156094 -159299059872
C+ 0 05910894512 0 0 12457431219 18368325731 37372293657 69469334997 36101009266 0 0
C- 03728715609 0 0 0 0 0 0 0 23368325731 105655481825 259954541697
Signal 0 0 0 0 0 0 0 1 0 1 1
Average 00028333333
Std Dev 00001527525
k 05
Control Limit 4
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Density 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Z -08444407432 11042686642 -0259827921 -06495698025 16888814864 30529780716 28581071308 36375908938 -43521176765 -9223891195 -150700194171
C+ 0 06042686642 0 0 11888814864 3741859558 60999666888 92375575826 43854399061 0 0
C- 03444407432 0 0 01495698025 0 0 0 0 38521176765 125760088714 271460282886
Signal 0 0 0 0 0 0 1 1 1 1 1
Average 00084333333
Std Dev 00005131601
k 05
Control Limit 4
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Betweeness 00027 00029 00024 00025 00033 00031 00033 00036 0002 00008 0
Z 01324532357 09271726499 -10596258857 -06622661785 25166114784 17218920642 25166114784 37086905998 -26490647141 -74173811996 -105962588565
C+ 0 04271726499 0 0 20166114784 32385035426 5255115021 84638056208 53147409067 0 0
C- 0 0 05596258857 07218920642 0 0 0 0 21490647141 90664459137 191627047702
Signal 0 0 0 0 0 0 1 1 1 1 1
Average 00026666667
Std Dev 00002516611
k 05
Control Limit 4
1988 1988 1988
1989 1989 1989
1990 1990 1990
1991 1991 1991
1992 1992 1992
1993 1993 1993
1994 1994 1994
1995 1995 1995
1996 1996 1996
1997 1997 1997
1998 1998 1998
1999 1999 1999
2000 2000 2000
2001 2001 2001
2002 2002 2002
2003 2003 2003
2004 2004 2004
Page 15: Data Science Center Eindhoven - 4TU.Federation · 4TU AMI SRO Big Data Meeting Big Data: Mathematics in Action! November 24, 2017 Data Science Center Eindhoven The Mathematics Behind

Practical Issues

bull The real Google matrix has size in the order of 30 billion columns x 30 billion rows

bull Every day around 250000 new pages are added bull Matrix has many zeros (100 links to a page)

which speeds up calculationsbull Doing the matrix calculation

requires fast computers butalso advanced math

Topic Streaming algorithms

High volume high velocity data makes exact counting of frequencies or number of different items practically infeasible but approximate answers suffice in several applications (web site counters customer counts in retail transactionshellip)

New algorithms using advanced probabilistic methods (random hash functions)bull Count-Min Sketch bull MinHashbull HyperLogLog

Topic Uncertainty Quantification

Virtual prototyping using mathematical models UQ does not deal with1 unknown uncertainty in the initial

conditions of parameters2 parametrisation of design building blocks dimension

reduction

For 1) polynomial chaos (Wiener chaos) inverse statistical models Bayesian analysis (calibration)

For 2) Model Order Reduction

Topic Network monitoring

0

0002

0004

0006

0008

001

0012

1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004

Measu

re V

alu

e

Time

Al-Qaeda Network Measures

Betweeness

Density

Closeness

Challengesbull monitor high number of variablesbull models to capture structural changesbull scalable algorithms for likelihood ratio calculations

Closeness CUSUM Statistic for Al-Qaeda (1994-2004)

0

1

2

3

4

5

6

7

8

1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004

C+

Change Point

Signal

Chart3

Betweeness
Density
Closeness
Time
Measure Value
Al-Qaeda Network Measures
0
00003
00001
00001
00015
00005
00003
0003
00012
00006
00043
00016
00008
00055
00018
00016
00067
00021
00027
0008
00027
00029
0009
0003
00024
00083
00028
00025
00081
00028
00033
00093
00031
00031
001
0003
00033
00099
00032
00036
00103
00034
0002
00062
00024
00008
00037
00015
0
00007
00004

Std Measures Control Charts

Std Measures Control Charts

Betweeness
Density
Closeness
Time
Value
Standard Measures

Chart1

Number of AGENT
Edge Count[Agent x Agent Agent x Agent]
Component Members-Weak[Agent x Agent Agent x Agent] Average
Isolate Count[Agent x Agent Agent x Agent]
Component Count-Weak[Agent x Agent Agent x Agent]
Component Count-Strong[Agent x Agent Agent x Agent]
32
38
1728798
350
355
355
94
201
1248005
293
301
315
112
405
101776
269
272
281
137
574
862131
247
250
255
162
733
710437
222
226
232
183
889
559781
198
201
214
221
1075
367623
157
162
177
228
1207
299836
142
146
156
213
1112
366667
159
162
172
219
1082
345
154
157
169
254
1247
232377
126
128
145
250
1330
232213
125
128
144
256
1328
225628
122
126
137
260
1370
211885
119
122
131
190
825
50694
187
191
198
141
496
855628
243
249
256
62
92
1487596
322
329
336

Control Charts

raw data

Chart4

C+
Closeness CUSUM Statistic for Al-Qaeda (1994-2004)
0
05910894512
0
0
12457431219
18368325731
37372293657
69469334997
36101009266
0
0

Std Measures Control Charts

Std Measures Control Charts

Betweeness
Density
Closeness
Time
Value
Standard Measures

Chart1

C+
Closeness CUSUM Chart for Al-Qaeda (1994-2004)

Control Charts

Number of AGENT
Edge Count[Agent x Agent Agent x Agent]
Component Members-Weak[Agent x Agent Agent x Agent] Average
Isolate Count[Agent x Agent Agent x Agent]
Component Count-Weak[Agent x Agent Agent x Agent]
Component Count-Strong[Agent x Agent Agent x Agent]
32
38
1728798
350
355
355
94
201
1248005
293
301
315
112
405
101776
269
272
281
137
574
862131
247
250
255
162
733
710437
222
226
232
183
889
559781
198
201
214
221
1075
367623
157
162
177
228
1207
299836
142
146
156
213
1112
366667
159
162
172
219
1082
345
154
157
169
254
1247
232377
126
128
145
250
1330
232213
125
128
144
256
1328
225628
122
126
137
260
1370
211885
119
122
131
190
825
50694
187
191
198
141
496
855628
243
249
256
62
92
1487596
322
329
336

raw data

Topic Precision farming

Topic Precision farming

Topic Compressed Sensing

Topic Network structure

bull dependencies between nodes in networks measured by degrees of direct neighbours

bull assortativity coefficient of Newman is nothing but Pearsonrsquos correlation statistic

bull inconsistent estimator when variances are infinitebull Spearman rank correlation behaves better but

calculation is computationally intensive bull requires heavy asymptotics

Van der Hofstad R and Litvak N (2014) Degree-Degree Dependencies in Random Graphs with Heavy-Tailed Degrees Internet Mathematics 10 (3-4) pp 287-334

Topic Topological Data Analysis

Common Big Data problem is to choose relevant ldquofeaturesrdquo from high-dimensional data

Combination of machine learning with topological tools (simplices cohomology) yields new algorithmsfor finding patterns and clustering

Presenter
Presentation Notes
httpsyoutubeXfWibrh6stw Gunnar Carsson

Mathematical contributions in general

bull Modellingbull Performance of algorithmsbull Statistical thinking

We need to work hard to makemathematical contributions more explicit

Conclusions

1 Mathematical methods are an important enabler in big data settings

2 Developments in big data settings not only require more computer speed and memory capacity but also new advanced mathematics

  • The Mathematics Behind Big Data
  • Outline
  • What is big data
  • Four Vrsquos of Big Data
  • High-dimensional data ldquo119951≪119953rdquo
  • Topic Google Search
  • Topic Google Search
  • Topic Google Search
  • Topic Google Search
  • Topic Google Search
  • Topic Google Search
  • Equivalent Random Surfer Model
  • Google Matrix
  • Choice of Matrix B
  • Practical Issues
  • Topic Streaming algorithms
  • Topic Uncertainty Quantification
  • Topic Network monitoring
  • Topic Precision farming
  • Topic Precision farming
  • Topic Compressed Sensing
  • Topic Network structure
  • Topic Topological Data Analysis
  • Mathematical contributions in general
  • Conclusions
1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Centrality-Betweenness[Agent x Agent Agent x Agent] Average 0 00001 00003 00006 00008 00016 00027 00029 00024 00025 00033 00031 00033 00036 0002 00008 0
Meta-Matrix Hamming Distance 0 00018 00036 0006 00065 00087 00124 0012 00146 00136 00135 00166 00164 00156 0015 0009 0004
Network Centralization-Betweenness[Agent x Agent Agent x Agent] 0 00091 00292 00492 00608 01105 01557 01586 01242 01187 01295 01338 01316 01414 0104 0042 00014
Edge Count-Sequential[Agent x Agent Agent x Agent] 0 001 00025 00017 0 0 00019 00008 00009 00009 00016 00015 0 00007 0 0002 0
Hierarchy[Agent x Agent Agent x Agent] 0 04827 01679 00696 00796 01496 01339 00727 00775 01273 01492 01711 0075 00717 00786 00882 05587
Exclusivity[Agent x Agent Agent x Agent] Average 00001 00003 00003 00003 00004 00004 00005 00005 00005 00006 00006 00006 00006 00006 00006 00004 00002
Network Centralization-Closeness[Agent x Agent Agent x Agent] 00001 00005 00012 00016 00018 00021 00027 0003 00028 00028 00031 0003 00032 00034 00024 00015 00004
Similarity Correlation[Agent x Agent Agent x Agent] Average 00001 00023 00036 00043 0005 0006 0007 00077 0007 00069 0008 00084 0008 00082 00053 0003 00008
Simmelian Ties[Agent x Agent Agent x Agent] Average 00002 00008 00021 00029 0004 0005 00059 00068 00061 00058 00065 00076 00076 0008 00045 00029 00004
Overall Complexity 00003 00015 0003 00043 00055 00067 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Expertise Correlation[Agent x Agent Agent x Agent] Average 00003 00015 0003 00042 00053 00065 00079 00088 00081 00079 00091 00097 00097 00101 0006 00036 00007
Density[Agent x Agent Agent x Agent] 00003 00015 0003 00043 00055 00067 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Centrality-Inverse Closeness[Agent x Agent Agent x Agent] Average 00003 00057 00191 00306 00387 0053 00733 00943 0084 00877 01111 01117 01125 01206 00587 00257 00018
Efficiency-Global[Agent x Agent Agent x Agent] 00003 00087 00223 00338 00421 00591 00815 0101 00904 00972 01245 01244 01203 01286 00632 00277 00026
Connectedness[Agent x Agent Agent x Agent] 00003 0021 00589 00932 0119 01953 02921 03446 0301 03223 04222 04187 04049 04402 02202 00899 00059
Distinctiveness Correlation[Agent x Agent Agent x Agent] Average 00006 00029 00059 00083 00106 00129 00156 00175 00162 00157 00181 00193 00193 00199 0012 00073 00014
Centrality-Closeness[Agent x Agent Agent x Agent] Average 00027 00028 00029 00031 00032 00035 00042 00048 00044 00044 00053 00052 00054 00058 00038 00031 00027
Boundary Spanner[Agent x Agent Agent x Agent] Average 00027 00191 00328 00328 00546 00574 00683 00601 00656 0071 00683 0071 00847 00656 00738 00546 00246
Network Centralization-Row Degree[Agent x Agent Agent x Agent] 00134 00314 006 00752 00794 0081 00851 00841 00848 00878 00893 00969 00996 00993 00842 00648 0024
Network Centralization-Column Degree[Agent x Agent Agent x Agent] 00134 00588 00819 00834 00849 00892 00878 00896 00876 00905 00948 00996 01024 00993 00842 0062 00322
Network Centralization-Out Degree[Agent x Agent Agent x Agent] 00135 00315 00601 00754 00797 00812 00853 00843 00851 0088 00895 00972 00999 00996 00845 0065 0024
Network Centralization-Total Degree[Agent x Agent Agent x Agent] 00135 00453 00672 00769 00813 00828 00869 0086 00867 00897 00925 00988 01016 00999 00847 00638 00282
Network Centralization-In Degree[Agent x Agent Agent x Agent] 00135 00589 00821 00836 00852 00895 00881 00898 00878 00908 0095 00999 01027 00996 00845 00622 00323
Clustering Coefficient-Watts-Strogatz[Agent x Agent Agent x Agent] Average 0023 00786 0126 01471 01728 01993 02334 02782 02382 02256 02604 02856 02775 02958 02157 01468 00355
Efficiency-Local[Agent x Agent Agent x Agent] 00238 01032 01587 01916 02253 02598 03034 0362 03167 03054 03644 03843 03678 03861 02663 01705 0041
Interlockers[Agent x Agent Agent x Agent] Average 00246 00546 00656 00628 00738 00902 00902 00902 00847 0071 00738 00874 00874 0082 00792 00683 00546
Breadth-Column[Agent x Agent Agent x Agent] 00273 01011 01803 02596 03224 03716 04399 05082 04672 04672 05355 05328 05519 05738 03825 02404 00601
Breadth-Row[Agent x Agent Agent x Agent] 00273 01339 02158 02923 03388 03934 04754 05301 04863 04973 05792 05765 05765 05984 04153 02514 00601
Clique Count[Agent x Agent Agent x Agent] Average 00301 02896 06421 10929 14699 17213 19781 22486 20601 20492 24071 2571 24918 24727 13689 06913 01148
Constraint-Burt[Agent x Agent Agent x Agent] Average 0038 01331 01404 01481 01869 02077 02643 02719 02506 02623 02827 0295 03034 02956 02548 01967 00948
Interdependence[Agent x Agent Agent x Agent] 00485 00073 00042 0003 00024 0002 00017 00015 00016 00016 00014 00013 00014 00013 00022 00037 00176
Effective Network Size-Burt[Agent x Agent Agent x Agent] Average 0053 03636 06977 10333 13118 15388 18575 21156 19688 19928 22983 2382 23991 2429 14392 08408 01957
Redundancy-Row[Agent x Agent Agent x Agent] 00601 03525 08443 12459 16093 19699 23716 26913 24754 23798 27541 29781 29617 30683 1765 10219 01311
Redundancy-Column[Agent x Agent Agent x Agent] 00601 03743 08634 12568 1623 2 23989 27077 24945 24016 27814 30055 29891 30902 17842 10383 01448
Node Levels[Agent x Agent Agent x Agent] Average 00601 09836 14344 23087 24126 36339 46913 42541 3918 43197 51913 44727 46421 48169 47568 23033 03169
Centrality-Bonacich Power[Agent x Agent Agent x Agent] Average 01038 05492 11066 15683 20027 2429 29372 32978 30383 29563 34071 36339 36284 37432 22541 13552 02514
Triad Count[Agent x Agent Agent x Agent] Average 01967 08279 27104 42432 5918 78251 96503 101475 94454 84508 94645 111421 110164 112732 67131 35574 01995
Efficiency[Agent x Agent Agent x Agent] 03333 09552 0964 09658 09658 09746 09798 09806 09795 09818 09835 09822 0982 09828 09809 09756 09607
Speed-Minimum[Agent x Agent Agent x Agent] 05 0125 0125 01 01111 00909 00909 01 01 01 00909 01111 01 01 00769 01111 01667
Edge Count-Skip[Agent x Agent Agent x Agent] 07895 07761 08988 09408 09304 0946 0933 09519 09487 09409 09455 09534 09465 09584 09188 0881 06196
Transitivity[Agent x Agent Agent x Agent] 08372 03475 03372 03238 03336 03626 0375 03495 03503 03253 0319 03266 03302 0336 03822 0405 0258
Edge Count-Pooled[Agent x Agent Agent x Agent] 08421 08657 09432 09669 09714 09764 09665 09751 09748 09704 09735 09737 09759 09788 09612 09435 08152
Speed-Average[Agent x Agent Agent x Agent] 08519 02994 03068 02802 02799 0233 02156 02387 0245 02484 02448 02516 02464 02429 02259 02373 04139
Edge Count-Lateral[Agent x Agent Agent x Agent] 09474 03731 03926 0392 04216 04061 03944 03828 03768 03651 03633 03872 03795 03985 03842 03891 05761
Resemblance Correlation[Agent x Agent Agent x Agent] Average 09994 09971 09941 09917 09894 09871 09844 09825 09838 09843 09819 09807 09807 09801 0988 09927 09986
Fragmentation[Agent x Agent Agent x Agent] 09997 0979 09411 09068 0881 08047 07079 06554 0699 06777 05778 05813 05951 05598 07798 09101 09941
Communicative Need[Agent x Agent Agent x Agent] 1 0608 07457 07662 0801 08069 08098 08233 0817 08124 0784 08169 08368 08365 08212 09004 08039
Upper Boundedness[Agent x Agent Agent x Agent] 1 09021 09909 0999 09987 0994 09966 09993 09989 09965 0995 09928 09987 09988 09986 09983 09073
Average Distance[Agent x Agent Agent x Agent] 11739 33399 326 35685 35731 42917 46372 41893 40814 40263 40845 39749 40577 41172 44266 42139 24163
Network Levels[Agent x Agent Agent x Agent] 2 8 8 10 9 11 11 10 10 10 11 9 10 10 13 9 6
Span Of Control[Agent x Agent Agent x Agent] 2375 27917 42188 48644 50903 52917 51932 54369 53981 5128 52176 55417 54426 55466 46089 40656 20909
Number of AGENT 32 94 112 137 162 183 221 228 213 219 254 250 256 260 190 141 62
Edge Count[Agent x Agent Agent x Agent] 38 201 405 574 733 889 1075 1207 1112 1082 1247 1330 1328 1370 825 496 92
Component Members-Weak[Agent x Agent Agent x Agent] Average 1728798 1248005 101776 862131 710437 559781 367623 299836 366667 345 232377 232213 225628 211885 50694 855628 1487596
Isolate Count[Agent x Agent Agent x Agent] 350 293 269 247 222 198 157 142 159 154 126 125 122 119 187 243 322
Component Count-Weak[Agent x Agent Agent x Agent] 355 301 272 250 226 201 162 146 162 157 128 128 126 122 191 249 329
Component Count-Strong[Agent x Agent Agent x Agent] 355 315 281 255 232 214 177 156 172 169 145 144 137 131 198 256 336
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Agents 221 228 213 219 254 250 256 260 190 141 62
Z -03746343246 00624390541 -08741467575 -04995124328 16858544608 14360982444 18107325691 20604887855 -23102450019 -53697586531 -103024439274
C+ 0 0 0 0 11858544608 21219527053 34326852743 49931740598 21829290579 0 0
C- 0 0 03741467575 03736591903 0 0 0 0 18102450019 6680003655 164824475824
Signal 0 0 0 0 0 0 0 1 0 1 1
Average 227
Std Dev 16015617378
k 05
Control Limit 4
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Component Members-Weak Average 367623 299836 366667 345 232377 232213 225628 211885 50694 855628 1487596
Z 09566024078 -01181147213 09414456731 05979300132 -11876316317 -1190231741 -12946324712 -15125184597 31653794044 86935922814 187130183455
C+ 04566024078 0 04414456731 05393756863 0 0 0 0 26653794044 108589716857 290719900313
C- 0 0 0 0 06876316317 13778633728 21724958439 31850143037 0 0 0
Signal 0 0 0 0 0 0 0 0 0 1 1
Average 307286
Std Dev 63074271515
k 05
Control Limit 4
1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Edge Count 38 201 405 574 733 889 1075 1207 1112 1082 1247 1330 1328 1370 825 496 92
Slope of Edge Count 163 204 169 159 156 186 132 -95 -30 165 83 -2 42 -545 -329 -404
Z -05039753822 15973457028 -01964649795 -07089823173 -08627375186 06748144948 -20927791293 -137269226972 -6393653789 59836399187 20500693512 -16784942813 -06150208054 -316223197419 -114547624997 -100197139539
C+ 0 10973457028 04008807233 0 0 01748144948 0 0 0 54836399187 70337092699 48552149886 37401941833 0 0 0
C- 00039753822 0 0 02089823173 05717198359 0 15927791293 148197018266 207133556155 142297156968 116796463456 128581406269 129731614323 440954811741 550502436738 645699576277
Signal 0 0 0 0 0 0 0 0 1 1 1 1 1 1 1 1 1
Average 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 9475 4825 43 3075 54 72 -1055 -2085 -309
Std Dev 195115350498 195115350498 195115350498 195115350498 195115350498 195115350498 195115350498 195115350498 1284143683549 1324547092406 1254033492376 115840047767 882307580533 710539700979 29504858357 278774102097 2505474007049
k 05
Control Limit 4
1988 1988 1988 1988 1988 1988
1989 1989 1989 1989 1989 1989
1990 1990 1990 1990 1990 1990
1991 1991 1991 1991 1991 1991
1992 1992 1992 1992 1992 1992
1993 1993 1993 1993 1993 1993
1994 1994 1994 1994 1994 1994
1995 1995 1995 1995 1995 1995
1996 1996 1996 1996 1996 1996
1997 1997 1997 1997 1997 1997
1998 1998 1998 1998 1998 1998
1999 1999 1999 1999 1999 1999
2000 2000 2000 2000 2000 2000
2001 2001 2001 2001 2001 2001
2002 2002 2002 2002 2002 2002
2003 2003 2003 2003 2003 2003
2004 2004 2004 2004 2004 2004
httpwwwmarocnlnieuwsforumsshowphppg=timeline
Significant Events Founded in Afghanistan Moves to Sudan WTC 1 and Somalia Leaves Sudan Returns to Afghanistan Saudia Arabia bomb Issues fatwa US embassy bombings USS Cole WTC 2 Afghanistan 3 Man is Captured Dirty bomb foiled 911 plotter captured Sheikh Mohammed captured Saudi bombings Morroco attacks Sectarian violence plot Madrid Europe truce offered
1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Betweeness 0 00001 00003 00006 00008 00016 00027 00029 00024 00025 00033 00031 00033 00036 0002 00008 0
Density 00003 00015 0003 00043 00055 00067 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Closeness 00001 00005 00012 00016 00018 00021 00027 0003 00028 00028 00031 0003 00032 00034 00024 00015 00004
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Closeness 00027 0003 00028 00028 00031 0003 00032 00034 00024 00015 00004
Z -08728715609 10910894512 -02182178902 -02182178902 17457431219 10910894512 24003967926 3709704134 -28368325731 -87287156094 -159299059872
C+ 0 05910894512 0 0 12457431219 18368325731 37372293657 69469334997 36101009266 0 0
C- 03728715609 0 0 0 0 0 0 0 23368325731 105655481825 259954541697
03728715609 05910894512 0 0 12457431219 18368325731 37372293657 69469334997 36101009266 105655481825 259954541697
Signal 0 0 0 0 0 0 0 1 0 1 1
Average 00028333333
Std Dev 00001527525
k 05
Control Limit 4
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Density 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Z -08444407432 11042686642 -0259827921 -06495698025 16888814864 30529780716 28581071308 36375908938 -43521176765 -9223891195 -150700194171
C+ 0 06042686642 0 0 11888814864 3741859558 60999666888 92375575826 43854399061 0 0
C- 03444407432 0 0 01495698025 0 0 0 0 38521176765 125760088714 271460282886
Signal 0 0 0 0 0 0 1 1 1 1 1
Average 00084333333
Std Dev 00005131601
k 05
Control Limit 4
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Betweeness 00027 00029 00024 00025 00033 00031 00033 00036 0002 00008 0
Z 01324532357 09271726499 -10596258857 -06622661785 25166114784 17218920642 25166114784 37086905998 -26490647141 -74173811996 -105962588565
C+ 0 04271726499 0 0 20166114784 32385035426 5255115021 84638056208 53147409067 0 0
C- 0 0 05596258857 07218920642 0 0 0 0 21490647141 90664459137 191627047702
Signal 0 0 0 0 0 0 1 1 1 1 1
Average 00026666667
Std Dev 00002516611
k 05
Control Limit 4
1994
1995
1996
1997
1998
1999
2000
2001
2002
2003
2004
1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Centrality-Betweenness[Agent x Agent Agent x Agent] Average 0 00001 00003 00006 00008 00016 00027 00029 00024 00025 00033 00031 00033 00036 0002 00008 0
Meta-Matrix Hamming Distance 0 00018 00036 0006 00065 00087 00124 0012 00146 00136 00135 00166 00164 00156 0015 0009 0004
Network Centralization-Betweenness[Agent x Agent Agent x Agent] 0 00091 00292 00492 00608 01105 01557 01586 01242 01187 01295 01338 01316 01414 0104 0042 00014
Edge Count-Sequential[Agent x Agent Agent x Agent] 0 001 00025 00017 0 0 00019 00008 00009 00009 00016 00015 0 00007 0 0002 0
Hierarchy[Agent x Agent Agent x Agent] 0 04827 01679 00696 00796 01496 01339 00727 00775 01273 01492 01711 0075 00717 00786 00882 05587
Exclusivity[Agent x Agent Agent x Agent] Average 00001 00003 00003 00003 00004 00004 00005 00005 00005 00006 00006 00006 00006 00006 00006 00004 00002
Network Centralization-Closeness[Agent x Agent Agent x Agent] 00001 00005 00012 00016 00018 00021 00027 0003 00028 00028 00031 0003 00032 00034 00024 00015 00004
Similarity Correlation[Agent x Agent Agent x Agent] Average 00001 00023 00036 00043 0005 0006 0007 00077 0007 00069 0008 00084 0008 00082 00053 0003 00008
Simmelian Ties[Agent x Agent Agent x Agent] Average 00002 00008 00021 00029 0004 0005 00059 00068 00061 00058 00065 00076 00076 0008 00045 00029 00004
Overall Complexity 00003 00015 0003 00043 00055 00067 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Expertise Correlation[Agent x Agent Agent x Agent] Average 00003 00015 0003 00042 00053 00065 00079 00088 00081 00079 00091 00097 00097 00101 0006 00036 00007
Density[Agent x Agent Agent x Agent] 00003 00015 0003 00043 00055 00067 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Centrality-Inverse Closeness[Agent x Agent Agent x Agent] Average 00003 00057 00191 00306 00387 0053 00733 00943 0084 00877 01111 01117 01125 01206 00587 00257 00018
Efficiency-Global[Agent x Agent Agent x Agent] 00003 00087 00223 00338 00421 00591 00815 0101 00904 00972 01245 01244 01203 01286 00632 00277 00026
Connectedness[Agent x Agent Agent x Agent] 00003 0021 00589 00932 0119 01953 02921 03446 0301 03223 04222 04187 04049 04402 02202 00899 00059
Distinctiveness Correlation[Agent x Agent Agent x Agent] Average 00006 00029 00059 00083 00106 00129 00156 00175 00162 00157 00181 00193 00193 00199 0012 00073 00014
Centrality-Closeness[Agent x Agent Agent x Agent] Average 00027 00028 00029 00031 00032 00035 00042 00048 00044 00044 00053 00052 00054 00058 00038 00031 00027
Boundary Spanner[Agent x Agent Agent x Agent] Average 00027 00191 00328 00328 00546 00574 00683 00601 00656 0071 00683 0071 00847 00656 00738 00546 00246
Network Centralization-Row Degree[Agent x Agent Agent x Agent] 00134 00314 006 00752 00794 0081 00851 00841 00848 00878 00893 00969 00996 00993 00842 00648 0024
Network Centralization-Column Degree[Agent x Agent Agent x Agent] 00134 00588 00819 00834 00849 00892 00878 00896 00876 00905 00948 00996 01024 00993 00842 0062 00322
Network Centralization-Out Degree[Agent x Agent Agent x Agent] 00135 00315 00601 00754 00797 00812 00853 00843 00851 0088 00895 00972 00999 00996 00845 0065 0024
Network Centralization-Total Degree[Agent x Agent Agent x Agent] 00135 00453 00672 00769 00813 00828 00869 0086 00867 00897 00925 00988 01016 00999 00847 00638 00282
Network Centralization-In Degree[Agent x Agent Agent x Agent] 00135 00589 00821 00836 00852 00895 00881 00898 00878 00908 0095 00999 01027 00996 00845 00622 00323
Clustering Coefficient-Watts-Strogatz[Agent x Agent Agent x Agent] Average 0023 00786 0126 01471 01728 01993 02334 02782 02382 02256 02604 02856 02775 02958 02157 01468 00355
Efficiency-Local[Agent x Agent Agent x Agent] 00238 01032 01587 01916 02253 02598 03034 0362 03167 03054 03644 03843 03678 03861 02663 01705 0041
Interlockers[Agent x Agent Agent x Agent] Average 00246 00546 00656 00628 00738 00902 00902 00902 00847 0071 00738 00874 00874 0082 00792 00683 00546
Breadth-Column[Agent x Agent Agent x Agent] 00273 01011 01803 02596 03224 03716 04399 05082 04672 04672 05355 05328 05519 05738 03825 02404 00601
Breadth-Row[Agent x Agent Agent x Agent] 00273 01339 02158 02923 03388 03934 04754 05301 04863 04973 05792 05765 05765 05984 04153 02514 00601
Clique Count[Agent x Agent Agent x Agent] Average 00301 02896 06421 10929 14699 17213 19781 22486 20601 20492 24071 2571 24918 24727 13689 06913 01148
Constraint-Burt[Agent x Agent Agent x Agent] Average 0038 01331 01404 01481 01869 02077 02643 02719 02506 02623 02827 0295 03034 02956 02548 01967 00948
Interdependence[Agent x Agent Agent x Agent] 00485 00073 00042 0003 00024 0002 00017 00015 00016 00016 00014 00013 00014 00013 00022 00037 00176
Effective Network Size-Burt[Agent x Agent Agent x Agent] Average 0053 03636 06977 10333 13118 15388 18575 21156 19688 19928 22983 2382 23991 2429 14392 08408 01957
Redundancy-Row[Agent x Agent Agent x Agent] 00601 03525 08443 12459 16093 19699 23716 26913 24754 23798 27541 29781 29617 30683 1765 10219 01311
Redundancy-Column[Agent x Agent Agent x Agent] 00601 03743 08634 12568 1623 2 23989 27077 24945 24016 27814 30055 29891 30902 17842 10383 01448
Node Levels[Agent x Agent Agent x Agent] Average 00601 09836 14344 23087 24126 36339 46913 42541 3918 43197 51913 44727 46421 48169 47568 23033 03169
Centrality-Bonacich Power[Agent x Agent Agent x Agent] Average 01038 05492 11066 15683 20027 2429 29372 32978 30383 29563 34071 36339 36284 37432 22541 13552 02514
Triad Count[Agent x Agent Agent x Agent] Average 01967 08279 27104 42432 5918 78251 96503 101475 94454 84508 94645 111421 110164 112732 67131 35574 01995
Efficiency[Agent x Agent Agent x Agent] 03333 09552 0964 09658 09658 09746 09798 09806 09795 09818 09835 09822 0982 09828 09809 09756 09607
Speed-Minimum[Agent x Agent Agent x Agent] 05 0125 0125 01 01111 00909 00909 01 01 01 00909 01111 01 01 00769 01111 01667
Edge Count-Skip[Agent x Agent Agent x Agent] 07895 07761 08988 09408 09304 0946 0933 09519 09487 09409 09455 09534 09465 09584 09188 0881 06196
Transitivity[Agent x Agent Agent x Agent] 08372 03475 03372 03238 03336 03626 0375 03495 03503 03253 0319 03266 03302 0336 03822 0405 0258
Edge Count-Pooled[Agent x Agent Agent x Agent] 08421 08657 09432 09669 09714 09764 09665 09751 09748 09704 09735 09737 09759 09788 09612 09435 08152
Speed-Average[Agent x Agent Agent x Agent] 08519 02994 03068 02802 02799 0233 02156 02387 0245 02484 02448 02516 02464 02429 02259 02373 04139
Edge Count-Lateral[Agent x Agent Agent x Agent] 09474 03731 03926 0392 04216 04061 03944 03828 03768 03651 03633 03872 03795 03985 03842 03891 05761
Resemblance Correlation[Agent x Agent Agent x Agent] Average 09994 09971 09941 09917 09894 09871 09844 09825 09838 09843 09819 09807 09807 09801 0988 09927 09986
Fragmentation[Agent x Agent Agent x Agent] 09997 0979 09411 09068 0881 08047 07079 06554 0699 06777 05778 05813 05951 05598 07798 09101 09941
Communicative Need[Agent x Agent Agent x Agent] 1 0608 07457 07662 0801 08069 08098 08233 0817 08124 0784 08169 08368 08365 08212 09004 08039
Upper Boundedness[Agent x Agent Agent x Agent] 1 09021 09909 0999 09987 0994 09966 09993 09989 09965 0995 09928 09987 09988 09986 09983 09073
Average Distance[Agent x Agent Agent x Agent] 11739 33399 326 35685 35731 42917 46372 41893 40814 40263 40845 39749 40577 41172 44266 42139 24163
Network Levels[Agent x Agent Agent x Agent] 2 8 8 10 9 11 11 10 10 10 11 9 10 10 13 9 6
Span Of Control[Agent x Agent Agent x Agent] 2375 27917 42188 48644 50903 52917 51932 54369 53981 5128 52176 55417 54426 55466 46089 40656 20909
Number of AGENT 32 94 112 137 162 183 221 228 213 219 254 250 256 260 190 141 62
Edge Count[Agent x Agent Agent x Agent] 38 201 405 574 733 889 1075 1207 1112 1082 1247 1330 1328 1370 825 496 92
Component Members-Weak[Agent x Agent Agent x Agent] Average 1728798 1248005 101776 862131 710437 559781 367623 299836 366667 345 232377 232213 225628 211885 50694 855628 1487596
Isolate Count[Agent x Agent Agent x Agent] 350 293 269 247 222 198 157 142 159 154 126 125 122 119 187 243 322
Component Count-Weak[Agent x Agent Agent x Agent] 355 301 272 250 226 201 162 146 162 157 128 128 126 122 191 249 329
Component Count-Strong[Agent x Agent Agent x Agent] 355 315 281 255 232 214 177 156 172 169 145 144 137 131 198 256 336
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Agents 221 228 213 219 254 250 256 260 190 141 62
Z -03746343246 00624390541 -08741467575 -04995124328 16858544608 14360982444 18107325691 20604887855 -23102450019 -53697586531 -103024439274
C+ 0 0 0 0 11858544608 21219527053 34326852743 49931740598 21829290579 0 0
C- 0 0 03741467575 03736591903 0 0 0 0 18102450019 6680003655 164824475824
Signal 0 0 0 0 0 0 0 1 0 1 1
Average 227
Std Dev 16015617378
k 05
Control Limit 4
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Component Members-Weak Average 367623 299836 366667 345 232377 232213 225628 211885 50694 855628 1487596
Z 09566024078 -01181147213 09414456731 05979300132 -11876316317 -1190231741 -12946324712 -15125184597 31653794044 86935922814 187130183455
C+ 04566024078 0 04414456731 05393756863 0 0 0 0 26653794044 108589716857 290719900313
C- 0 0 0 0 06876316317 13778633728 21724958439 31850143037 0 0 0
Signal 0 0 0 0 0 0 0 0 0 1 1
Average 307286
Std Dev 63074271515
k 05
Control Limit 4
1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Edge Count 38 201 405 574 733 889 1075 1207 1112 1082 1247 1330 1328 1370 825 496 92
Slope of Edge Count 163 204 169 159 156 186 132 -95 -30 165 83 -2 42 -545 -329 -404
Z -05039753822 15973457028 -01964649795 -07089823173 -08627375186 06748144948 -20927791293 -137269226972 -6393653789 59836399187 20500693512 -16784942813 -06150208054 -316223197419 -114547624997 -100197139539
C+ 0 10973457028 04008807233 0 0 01748144948 0 0 0 54836399187 70337092699 48552149886 37401941833 0 0 0
C- 00039753822 0 0 02089823173 05717198359 0 15927791293 148197018266 207133556155 142297156968 116796463456 128581406269 129731614323 440954811741 550502436738 645699576277
Signal 0 0 0 0 0 0 0 0 1 1 1 1 1 1 1 1 1
Average 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 9475 4825 43 3075 54 72 -1055 -2085 -309
Std Dev 195115350498 195115350498 195115350498 195115350498 195115350498 195115350498 195115350498 195115350498 1284143683549 1324547092406 1254033492376 115840047767 882307580533 710539700979 29504858357 278774102097 2505474007049
k 05
Control Limit 4
1988 1988 1988 1988 1988 1988
1989 1989 1989 1989 1989 1989
1990 1990 1990 1990 1990 1990
1991 1991 1991 1991 1991 1991
1992 1992 1992 1992 1992 1992
1993 1993 1993 1993 1993 1993
1994 1994 1994 1994 1994 1994
1995 1995 1995 1995 1995 1995
1996 1996 1996 1996 1996 1996
1997 1997 1997 1997 1997 1997
1998 1998 1998 1998 1998 1998
1999 1999 1999 1999 1999 1999
2000 2000 2000 2000 2000 2000
2001 2001 2001 2001 2001 2001
2002 2002 2002 2002 2002 2002
2003 2003 2003 2003 2003 2003
2004 2004 2004 2004 2004 2004
httpwwwmarocnlnieuwsforumsshowphppg=timeline
Significant Events Founded in Afghanistan Moves to Sudan WTC 1 and Somalia Leaves Sudan Returns to Afghanistan Saudia Arabia bomb Issues fatwa US embassy bombings USS Cole WTC 2 Afghanistan 3 Man is Captured Dirty bomb foiled 911 plotter captured Sheikh Mohammed captured Saudi bombings Morroco attacks Sectarian violence plot Madrid Europe truce offered
1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Betweeness 0 00001 00003 00006 00008 00016 00027 00029 00024 00025 00033 00031 00033 00036 0002 00008 0
Density 00003 00015 0003 00043 00055 00067 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Closeness 00001 00005 00012 00016 00018 00021 00027 0003 00028 00028 00031 0003 00032 00034 00024 00015 00004
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Closeness 00027 0003 00028 00028 00031 0003 00032 00034 00024 00015 00004
Z -08728715609 10910894512 -02182178902 -02182178902 17457431219 10910894512 24003967926 3709704134 -28368325731 -87287156094 -159299059872
C+ 0 05910894512 0 0 12457431219 18368325731 37372293657 69469334997 36101009266 0 0
C- 03728715609 0 0 0 0 0 0 0 23368325731 105655481825 259954541697
Signal 0 0 0 0 0 0 0 1 0 1 1
Average 00028333333
Std Dev 00001527525
k 05
Control Limit 4
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Density 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Z -08444407432 11042686642 -0259827921 -06495698025 16888814864 30529780716 28581071308 36375908938 -43521176765 -9223891195 -150700194171
C+ 0 06042686642 0 0 11888814864 3741859558 60999666888 92375575826 43854399061 0 0
C- 03444407432 0 0 01495698025 0 0 0 0 38521176765 125760088714 271460282886
Signal 0 0 0 0 0 0 1 1 1 1 1
Average 00084333333
Std Dev 00005131601
k 05
Control Limit 4
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Betweeness 00027 00029 00024 00025 00033 00031 00033 00036 0002 00008 0
Z 01324532357 09271726499 -10596258857 -06622661785 25166114784 17218920642 25166114784 37086905998 -26490647141 -74173811996 -105962588565
C+ 0 04271726499 0 0 20166114784 32385035426 5255115021 84638056208 53147409067 0 0
C- 0 0 05596258857 07218920642 0 0 0 0 21490647141 90664459137 191627047702
Signal 0 0 0 0 0 0 1 1 1 1 1
Average 00026666667
Std Dev 00002516611
k 05
Control Limit 4
1988 1988 1988
1989 1989 1989
1990 1990 1990
1991 1991 1991
1992 1992 1992
1993 1993 1993
1994 1994 1994
1995 1995 1995
1996 1996 1996
1997 1997 1997
1998 1998 1998
1999 1999 1999
2000 2000 2000
2001 2001 2001
2002 2002 2002
2003 2003 2003
2004 2004 2004
Page 16: Data Science Center Eindhoven - 4TU.Federation · 4TU AMI SRO Big Data Meeting Big Data: Mathematics in Action! November 24, 2017 Data Science Center Eindhoven The Mathematics Behind

Topic Streaming algorithms

High volume high velocity data makes exact counting of frequencies or number of different items practically infeasible but approximate answers suffice in several applications (web site counters customer counts in retail transactionshellip)

New algorithms using advanced probabilistic methods (random hash functions)bull Count-Min Sketch bull MinHashbull HyperLogLog

Topic Uncertainty Quantification

Virtual prototyping using mathematical models UQ does not deal with1 unknown uncertainty in the initial

conditions of parameters2 parametrisation of design building blocks dimension

reduction

For 1) polynomial chaos (Wiener chaos) inverse statistical models Bayesian analysis (calibration)

For 2) Model Order Reduction

Topic Network monitoring

0

0002

0004

0006

0008

001

0012

1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004

Measu

re V

alu

e

Time

Al-Qaeda Network Measures

Betweeness

Density

Closeness

Challengesbull monitor high number of variablesbull models to capture structural changesbull scalable algorithms for likelihood ratio calculations

Closeness CUSUM Statistic for Al-Qaeda (1994-2004)

0

1

2

3

4

5

6

7

8

1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004

C+

Change Point

Signal

Chart3

Betweeness
Density
Closeness
Time
Measure Value
Al-Qaeda Network Measures
0
00003
00001
00001
00015
00005
00003
0003
00012
00006
00043
00016
00008
00055
00018
00016
00067
00021
00027
0008
00027
00029
0009
0003
00024
00083
00028
00025
00081
00028
00033
00093
00031
00031
001
0003
00033
00099
00032
00036
00103
00034
0002
00062
00024
00008
00037
00015
0
00007
00004

Std Measures Control Charts

Std Measures Control Charts

Betweeness
Density
Closeness
Time
Value
Standard Measures

Chart1

Number of AGENT
Edge Count[Agent x Agent Agent x Agent]
Component Members-Weak[Agent x Agent Agent x Agent] Average
Isolate Count[Agent x Agent Agent x Agent]
Component Count-Weak[Agent x Agent Agent x Agent]
Component Count-Strong[Agent x Agent Agent x Agent]
32
38
1728798
350
355
355
94
201
1248005
293
301
315
112
405
101776
269
272
281
137
574
862131
247
250
255
162
733
710437
222
226
232
183
889
559781
198
201
214
221
1075
367623
157
162
177
228
1207
299836
142
146
156
213
1112
366667
159
162
172
219
1082
345
154
157
169
254
1247
232377
126
128
145
250
1330
232213
125
128
144
256
1328
225628
122
126
137
260
1370
211885
119
122
131
190
825
50694
187
191
198
141
496
855628
243
249
256
62
92
1487596
322
329
336

Control Charts

raw data

Chart4

C+
Closeness CUSUM Statistic for Al-Qaeda (1994-2004)
0
05910894512
0
0
12457431219
18368325731
37372293657
69469334997
36101009266
0
0

Std Measures Control Charts

Std Measures Control Charts

Betweeness
Density
Closeness
Time
Value
Standard Measures

Chart1

C+
Closeness CUSUM Chart for Al-Qaeda (1994-2004)

Control Charts

Number of AGENT
Edge Count[Agent x Agent Agent x Agent]
Component Members-Weak[Agent x Agent Agent x Agent] Average
Isolate Count[Agent x Agent Agent x Agent]
Component Count-Weak[Agent x Agent Agent x Agent]
Component Count-Strong[Agent x Agent Agent x Agent]
32
38
1728798
350
355
355
94
201
1248005
293
301
315
112
405
101776
269
272
281
137
574
862131
247
250
255
162
733
710437
222
226
232
183
889
559781
198
201
214
221
1075
367623
157
162
177
228
1207
299836
142
146
156
213
1112
366667
159
162
172
219
1082
345
154
157
169
254
1247
232377
126
128
145
250
1330
232213
125
128
144
256
1328
225628
122
126
137
260
1370
211885
119
122
131
190
825
50694
187
191
198
141
496
855628
243
249
256
62
92
1487596
322
329
336

raw data

Topic Precision farming

Topic Precision farming

Topic Compressed Sensing

Topic Network structure

bull dependencies between nodes in networks measured by degrees of direct neighbours

bull assortativity coefficient of Newman is nothing but Pearsonrsquos correlation statistic

bull inconsistent estimator when variances are infinitebull Spearman rank correlation behaves better but

calculation is computationally intensive bull requires heavy asymptotics

Van der Hofstad R and Litvak N (2014) Degree-Degree Dependencies in Random Graphs with Heavy-Tailed Degrees Internet Mathematics 10 (3-4) pp 287-334

Topic Topological Data Analysis

Common Big Data problem is to choose relevant ldquofeaturesrdquo from high-dimensional data

Combination of machine learning with topological tools (simplices cohomology) yields new algorithmsfor finding patterns and clustering

Presenter
Presentation Notes
httpsyoutubeXfWibrh6stw Gunnar Carsson

Mathematical contributions in general

bull Modellingbull Performance of algorithmsbull Statistical thinking

We need to work hard to makemathematical contributions more explicit

Conclusions

1 Mathematical methods are an important enabler in big data settings

2 Developments in big data settings not only require more computer speed and memory capacity but also new advanced mathematics

  • The Mathematics Behind Big Data
  • Outline
  • What is big data
  • Four Vrsquos of Big Data
  • High-dimensional data ldquo119951≪119953rdquo
  • Topic Google Search
  • Topic Google Search
  • Topic Google Search
  • Topic Google Search
  • Topic Google Search
  • Topic Google Search
  • Equivalent Random Surfer Model
  • Google Matrix
  • Choice of Matrix B
  • Practical Issues
  • Topic Streaming algorithms
  • Topic Uncertainty Quantification
  • Topic Network monitoring
  • Topic Precision farming
  • Topic Precision farming
  • Topic Compressed Sensing
  • Topic Network structure
  • Topic Topological Data Analysis
  • Mathematical contributions in general
  • Conclusions
1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Centrality-Betweenness[Agent x Agent Agent x Agent] Average 0 00001 00003 00006 00008 00016 00027 00029 00024 00025 00033 00031 00033 00036 0002 00008 0
Meta-Matrix Hamming Distance 0 00018 00036 0006 00065 00087 00124 0012 00146 00136 00135 00166 00164 00156 0015 0009 0004
Network Centralization-Betweenness[Agent x Agent Agent x Agent] 0 00091 00292 00492 00608 01105 01557 01586 01242 01187 01295 01338 01316 01414 0104 0042 00014
Edge Count-Sequential[Agent x Agent Agent x Agent] 0 001 00025 00017 0 0 00019 00008 00009 00009 00016 00015 0 00007 0 0002 0
Hierarchy[Agent x Agent Agent x Agent] 0 04827 01679 00696 00796 01496 01339 00727 00775 01273 01492 01711 0075 00717 00786 00882 05587
Exclusivity[Agent x Agent Agent x Agent] Average 00001 00003 00003 00003 00004 00004 00005 00005 00005 00006 00006 00006 00006 00006 00006 00004 00002
Network Centralization-Closeness[Agent x Agent Agent x Agent] 00001 00005 00012 00016 00018 00021 00027 0003 00028 00028 00031 0003 00032 00034 00024 00015 00004
Similarity Correlation[Agent x Agent Agent x Agent] Average 00001 00023 00036 00043 0005 0006 0007 00077 0007 00069 0008 00084 0008 00082 00053 0003 00008
Simmelian Ties[Agent x Agent Agent x Agent] Average 00002 00008 00021 00029 0004 0005 00059 00068 00061 00058 00065 00076 00076 0008 00045 00029 00004
Overall Complexity 00003 00015 0003 00043 00055 00067 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Expertise Correlation[Agent x Agent Agent x Agent] Average 00003 00015 0003 00042 00053 00065 00079 00088 00081 00079 00091 00097 00097 00101 0006 00036 00007
Density[Agent x Agent Agent x Agent] 00003 00015 0003 00043 00055 00067 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Centrality-Inverse Closeness[Agent x Agent Agent x Agent] Average 00003 00057 00191 00306 00387 0053 00733 00943 0084 00877 01111 01117 01125 01206 00587 00257 00018
Efficiency-Global[Agent x Agent Agent x Agent] 00003 00087 00223 00338 00421 00591 00815 0101 00904 00972 01245 01244 01203 01286 00632 00277 00026
Connectedness[Agent x Agent Agent x Agent] 00003 0021 00589 00932 0119 01953 02921 03446 0301 03223 04222 04187 04049 04402 02202 00899 00059
Distinctiveness Correlation[Agent x Agent Agent x Agent] Average 00006 00029 00059 00083 00106 00129 00156 00175 00162 00157 00181 00193 00193 00199 0012 00073 00014
Centrality-Closeness[Agent x Agent Agent x Agent] Average 00027 00028 00029 00031 00032 00035 00042 00048 00044 00044 00053 00052 00054 00058 00038 00031 00027
Boundary Spanner[Agent x Agent Agent x Agent] Average 00027 00191 00328 00328 00546 00574 00683 00601 00656 0071 00683 0071 00847 00656 00738 00546 00246
Network Centralization-Row Degree[Agent x Agent Agent x Agent] 00134 00314 006 00752 00794 0081 00851 00841 00848 00878 00893 00969 00996 00993 00842 00648 0024
Network Centralization-Column Degree[Agent x Agent Agent x Agent] 00134 00588 00819 00834 00849 00892 00878 00896 00876 00905 00948 00996 01024 00993 00842 0062 00322
Network Centralization-Out Degree[Agent x Agent Agent x Agent] 00135 00315 00601 00754 00797 00812 00853 00843 00851 0088 00895 00972 00999 00996 00845 0065 0024
Network Centralization-Total Degree[Agent x Agent Agent x Agent] 00135 00453 00672 00769 00813 00828 00869 0086 00867 00897 00925 00988 01016 00999 00847 00638 00282
Network Centralization-In Degree[Agent x Agent Agent x Agent] 00135 00589 00821 00836 00852 00895 00881 00898 00878 00908 0095 00999 01027 00996 00845 00622 00323
Clustering Coefficient-Watts-Strogatz[Agent x Agent Agent x Agent] Average 0023 00786 0126 01471 01728 01993 02334 02782 02382 02256 02604 02856 02775 02958 02157 01468 00355
Efficiency-Local[Agent x Agent Agent x Agent] 00238 01032 01587 01916 02253 02598 03034 0362 03167 03054 03644 03843 03678 03861 02663 01705 0041
Interlockers[Agent x Agent Agent x Agent] Average 00246 00546 00656 00628 00738 00902 00902 00902 00847 0071 00738 00874 00874 0082 00792 00683 00546
Breadth-Column[Agent x Agent Agent x Agent] 00273 01011 01803 02596 03224 03716 04399 05082 04672 04672 05355 05328 05519 05738 03825 02404 00601
Breadth-Row[Agent x Agent Agent x Agent] 00273 01339 02158 02923 03388 03934 04754 05301 04863 04973 05792 05765 05765 05984 04153 02514 00601
Clique Count[Agent x Agent Agent x Agent] Average 00301 02896 06421 10929 14699 17213 19781 22486 20601 20492 24071 2571 24918 24727 13689 06913 01148
Constraint-Burt[Agent x Agent Agent x Agent] Average 0038 01331 01404 01481 01869 02077 02643 02719 02506 02623 02827 0295 03034 02956 02548 01967 00948
Interdependence[Agent x Agent Agent x Agent] 00485 00073 00042 0003 00024 0002 00017 00015 00016 00016 00014 00013 00014 00013 00022 00037 00176
Effective Network Size-Burt[Agent x Agent Agent x Agent] Average 0053 03636 06977 10333 13118 15388 18575 21156 19688 19928 22983 2382 23991 2429 14392 08408 01957
Redundancy-Row[Agent x Agent Agent x Agent] 00601 03525 08443 12459 16093 19699 23716 26913 24754 23798 27541 29781 29617 30683 1765 10219 01311
Redundancy-Column[Agent x Agent Agent x Agent] 00601 03743 08634 12568 1623 2 23989 27077 24945 24016 27814 30055 29891 30902 17842 10383 01448
Node Levels[Agent x Agent Agent x Agent] Average 00601 09836 14344 23087 24126 36339 46913 42541 3918 43197 51913 44727 46421 48169 47568 23033 03169
Centrality-Bonacich Power[Agent x Agent Agent x Agent] Average 01038 05492 11066 15683 20027 2429 29372 32978 30383 29563 34071 36339 36284 37432 22541 13552 02514
Triad Count[Agent x Agent Agent x Agent] Average 01967 08279 27104 42432 5918 78251 96503 101475 94454 84508 94645 111421 110164 112732 67131 35574 01995
Efficiency[Agent x Agent Agent x Agent] 03333 09552 0964 09658 09658 09746 09798 09806 09795 09818 09835 09822 0982 09828 09809 09756 09607
Speed-Minimum[Agent x Agent Agent x Agent] 05 0125 0125 01 01111 00909 00909 01 01 01 00909 01111 01 01 00769 01111 01667
Edge Count-Skip[Agent x Agent Agent x Agent] 07895 07761 08988 09408 09304 0946 0933 09519 09487 09409 09455 09534 09465 09584 09188 0881 06196
Transitivity[Agent x Agent Agent x Agent] 08372 03475 03372 03238 03336 03626 0375 03495 03503 03253 0319 03266 03302 0336 03822 0405 0258
Edge Count-Pooled[Agent x Agent Agent x Agent] 08421 08657 09432 09669 09714 09764 09665 09751 09748 09704 09735 09737 09759 09788 09612 09435 08152
Speed-Average[Agent x Agent Agent x Agent] 08519 02994 03068 02802 02799 0233 02156 02387 0245 02484 02448 02516 02464 02429 02259 02373 04139
Edge Count-Lateral[Agent x Agent Agent x Agent] 09474 03731 03926 0392 04216 04061 03944 03828 03768 03651 03633 03872 03795 03985 03842 03891 05761
Resemblance Correlation[Agent x Agent Agent x Agent] Average 09994 09971 09941 09917 09894 09871 09844 09825 09838 09843 09819 09807 09807 09801 0988 09927 09986
Fragmentation[Agent x Agent Agent x Agent] 09997 0979 09411 09068 0881 08047 07079 06554 0699 06777 05778 05813 05951 05598 07798 09101 09941
Communicative Need[Agent x Agent Agent x Agent] 1 0608 07457 07662 0801 08069 08098 08233 0817 08124 0784 08169 08368 08365 08212 09004 08039
Upper Boundedness[Agent x Agent Agent x Agent] 1 09021 09909 0999 09987 0994 09966 09993 09989 09965 0995 09928 09987 09988 09986 09983 09073
Average Distance[Agent x Agent Agent x Agent] 11739 33399 326 35685 35731 42917 46372 41893 40814 40263 40845 39749 40577 41172 44266 42139 24163
Network Levels[Agent x Agent Agent x Agent] 2 8 8 10 9 11 11 10 10 10 11 9 10 10 13 9 6
Span Of Control[Agent x Agent Agent x Agent] 2375 27917 42188 48644 50903 52917 51932 54369 53981 5128 52176 55417 54426 55466 46089 40656 20909
Number of AGENT 32 94 112 137 162 183 221 228 213 219 254 250 256 260 190 141 62
Edge Count[Agent x Agent Agent x Agent] 38 201 405 574 733 889 1075 1207 1112 1082 1247 1330 1328 1370 825 496 92
Component Members-Weak[Agent x Agent Agent x Agent] Average 1728798 1248005 101776 862131 710437 559781 367623 299836 366667 345 232377 232213 225628 211885 50694 855628 1487596
Isolate Count[Agent x Agent Agent x Agent] 350 293 269 247 222 198 157 142 159 154 126 125 122 119 187 243 322
Component Count-Weak[Agent x Agent Agent x Agent] 355 301 272 250 226 201 162 146 162 157 128 128 126 122 191 249 329
Component Count-Strong[Agent x Agent Agent x Agent] 355 315 281 255 232 214 177 156 172 169 145 144 137 131 198 256 336
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Agents 221 228 213 219 254 250 256 260 190 141 62
Z -03746343246 00624390541 -08741467575 -04995124328 16858544608 14360982444 18107325691 20604887855 -23102450019 -53697586531 -103024439274
C+ 0 0 0 0 11858544608 21219527053 34326852743 49931740598 21829290579 0 0
C- 0 0 03741467575 03736591903 0 0 0 0 18102450019 6680003655 164824475824
Signal 0 0 0 0 0 0 0 1 0 1 1
Average 227
Std Dev 16015617378
k 05
Control Limit 4
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Component Members-Weak Average 367623 299836 366667 345 232377 232213 225628 211885 50694 855628 1487596
Z 09566024078 -01181147213 09414456731 05979300132 -11876316317 -1190231741 -12946324712 -15125184597 31653794044 86935922814 187130183455
C+ 04566024078 0 04414456731 05393756863 0 0 0 0 26653794044 108589716857 290719900313
C- 0 0 0 0 06876316317 13778633728 21724958439 31850143037 0 0 0
Signal 0 0 0 0 0 0 0 0 0 1 1
Average 307286
Std Dev 63074271515
k 05
Control Limit 4
1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Edge Count 38 201 405 574 733 889 1075 1207 1112 1082 1247 1330 1328 1370 825 496 92
Slope of Edge Count 163 204 169 159 156 186 132 -95 -30 165 83 -2 42 -545 -329 -404
Z -05039753822 15973457028 -01964649795 -07089823173 -08627375186 06748144948 -20927791293 -137269226972 -6393653789 59836399187 20500693512 -16784942813 -06150208054 -316223197419 -114547624997 -100197139539
C+ 0 10973457028 04008807233 0 0 01748144948 0 0 0 54836399187 70337092699 48552149886 37401941833 0 0 0
C- 00039753822 0 0 02089823173 05717198359 0 15927791293 148197018266 207133556155 142297156968 116796463456 128581406269 129731614323 440954811741 550502436738 645699576277
Signal 0 0 0 0 0 0 0 0 1 1 1 1 1 1 1 1 1
Average 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 9475 4825 43 3075 54 72 -1055 -2085 -309
Std Dev 195115350498 195115350498 195115350498 195115350498 195115350498 195115350498 195115350498 195115350498 1284143683549 1324547092406 1254033492376 115840047767 882307580533 710539700979 29504858357 278774102097 2505474007049
k 05
Control Limit 4
1988 1988 1988 1988 1988 1988
1989 1989 1989 1989 1989 1989
1990 1990 1990 1990 1990 1990
1991 1991 1991 1991 1991 1991
1992 1992 1992 1992 1992 1992
1993 1993 1993 1993 1993 1993
1994 1994 1994 1994 1994 1994
1995 1995 1995 1995 1995 1995
1996 1996 1996 1996 1996 1996
1997 1997 1997 1997 1997 1997
1998 1998 1998 1998 1998 1998
1999 1999 1999 1999 1999 1999
2000 2000 2000 2000 2000 2000
2001 2001 2001 2001 2001 2001
2002 2002 2002 2002 2002 2002
2003 2003 2003 2003 2003 2003
2004 2004 2004 2004 2004 2004
httpwwwmarocnlnieuwsforumsshowphppg=timeline
Significant Events Founded in Afghanistan Moves to Sudan WTC 1 and Somalia Leaves Sudan Returns to Afghanistan Saudia Arabia bomb Issues fatwa US embassy bombings USS Cole WTC 2 Afghanistan 3 Man is Captured Dirty bomb foiled 911 plotter captured Sheikh Mohammed captured Saudi bombings Morroco attacks Sectarian violence plot Madrid Europe truce offered
1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Betweeness 0 00001 00003 00006 00008 00016 00027 00029 00024 00025 00033 00031 00033 00036 0002 00008 0
Density 00003 00015 0003 00043 00055 00067 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Closeness 00001 00005 00012 00016 00018 00021 00027 0003 00028 00028 00031 0003 00032 00034 00024 00015 00004
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Closeness 00027 0003 00028 00028 00031 0003 00032 00034 00024 00015 00004
Z -08728715609 10910894512 -02182178902 -02182178902 17457431219 10910894512 24003967926 3709704134 -28368325731 -87287156094 -159299059872
C+ 0 05910894512 0 0 12457431219 18368325731 37372293657 69469334997 36101009266 0 0
C- 03728715609 0 0 0 0 0 0 0 23368325731 105655481825 259954541697
03728715609 05910894512 0 0 12457431219 18368325731 37372293657 69469334997 36101009266 105655481825 259954541697
Signal 0 0 0 0 0 0 0 1 0 1 1
Average 00028333333
Std Dev 00001527525
k 05
Control Limit 4
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Density 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Z -08444407432 11042686642 -0259827921 -06495698025 16888814864 30529780716 28581071308 36375908938 -43521176765 -9223891195 -150700194171
C+ 0 06042686642 0 0 11888814864 3741859558 60999666888 92375575826 43854399061 0 0
C- 03444407432 0 0 01495698025 0 0 0 0 38521176765 125760088714 271460282886
Signal 0 0 0 0 0 0 1 1 1 1 1
Average 00084333333
Std Dev 00005131601
k 05
Control Limit 4
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Betweeness 00027 00029 00024 00025 00033 00031 00033 00036 0002 00008 0
Z 01324532357 09271726499 -10596258857 -06622661785 25166114784 17218920642 25166114784 37086905998 -26490647141 -74173811996 -105962588565
C+ 0 04271726499 0 0 20166114784 32385035426 5255115021 84638056208 53147409067 0 0
C- 0 0 05596258857 07218920642 0 0 0 0 21490647141 90664459137 191627047702
Signal 0 0 0 0 0 0 1 1 1 1 1
Average 00026666667
Std Dev 00002516611
k 05
Control Limit 4
1994
1995
1996
1997
1998
1999
2000
2001
2002
2003
2004
1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Centrality-Betweenness[Agent x Agent Agent x Agent] Average 0 00001 00003 00006 00008 00016 00027 00029 00024 00025 00033 00031 00033 00036 0002 00008 0
Meta-Matrix Hamming Distance 0 00018 00036 0006 00065 00087 00124 0012 00146 00136 00135 00166 00164 00156 0015 0009 0004
Network Centralization-Betweenness[Agent x Agent Agent x Agent] 0 00091 00292 00492 00608 01105 01557 01586 01242 01187 01295 01338 01316 01414 0104 0042 00014
Edge Count-Sequential[Agent x Agent Agent x Agent] 0 001 00025 00017 0 0 00019 00008 00009 00009 00016 00015 0 00007 0 0002 0
Hierarchy[Agent x Agent Agent x Agent] 0 04827 01679 00696 00796 01496 01339 00727 00775 01273 01492 01711 0075 00717 00786 00882 05587
Exclusivity[Agent x Agent Agent x Agent] Average 00001 00003 00003 00003 00004 00004 00005 00005 00005 00006 00006 00006 00006 00006 00006 00004 00002
Network Centralization-Closeness[Agent x Agent Agent x Agent] 00001 00005 00012 00016 00018 00021 00027 0003 00028 00028 00031 0003 00032 00034 00024 00015 00004
Similarity Correlation[Agent x Agent Agent x Agent] Average 00001 00023 00036 00043 0005 0006 0007 00077 0007 00069 0008 00084 0008 00082 00053 0003 00008
Simmelian Ties[Agent x Agent Agent x Agent] Average 00002 00008 00021 00029 0004 0005 00059 00068 00061 00058 00065 00076 00076 0008 00045 00029 00004
Overall Complexity 00003 00015 0003 00043 00055 00067 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Expertise Correlation[Agent x Agent Agent x Agent] Average 00003 00015 0003 00042 00053 00065 00079 00088 00081 00079 00091 00097 00097 00101 0006 00036 00007
Density[Agent x Agent Agent x Agent] 00003 00015 0003 00043 00055 00067 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Centrality-Inverse Closeness[Agent x Agent Agent x Agent] Average 00003 00057 00191 00306 00387 0053 00733 00943 0084 00877 01111 01117 01125 01206 00587 00257 00018
Efficiency-Global[Agent x Agent Agent x Agent] 00003 00087 00223 00338 00421 00591 00815 0101 00904 00972 01245 01244 01203 01286 00632 00277 00026
Connectedness[Agent x Agent Agent x Agent] 00003 0021 00589 00932 0119 01953 02921 03446 0301 03223 04222 04187 04049 04402 02202 00899 00059
Distinctiveness Correlation[Agent x Agent Agent x Agent] Average 00006 00029 00059 00083 00106 00129 00156 00175 00162 00157 00181 00193 00193 00199 0012 00073 00014
Centrality-Closeness[Agent x Agent Agent x Agent] Average 00027 00028 00029 00031 00032 00035 00042 00048 00044 00044 00053 00052 00054 00058 00038 00031 00027
Boundary Spanner[Agent x Agent Agent x Agent] Average 00027 00191 00328 00328 00546 00574 00683 00601 00656 0071 00683 0071 00847 00656 00738 00546 00246
Network Centralization-Row Degree[Agent x Agent Agent x Agent] 00134 00314 006 00752 00794 0081 00851 00841 00848 00878 00893 00969 00996 00993 00842 00648 0024
Network Centralization-Column Degree[Agent x Agent Agent x Agent] 00134 00588 00819 00834 00849 00892 00878 00896 00876 00905 00948 00996 01024 00993 00842 0062 00322
Network Centralization-Out Degree[Agent x Agent Agent x Agent] 00135 00315 00601 00754 00797 00812 00853 00843 00851 0088 00895 00972 00999 00996 00845 0065 0024
Network Centralization-Total Degree[Agent x Agent Agent x Agent] 00135 00453 00672 00769 00813 00828 00869 0086 00867 00897 00925 00988 01016 00999 00847 00638 00282
Network Centralization-In Degree[Agent x Agent Agent x Agent] 00135 00589 00821 00836 00852 00895 00881 00898 00878 00908 0095 00999 01027 00996 00845 00622 00323
Clustering Coefficient-Watts-Strogatz[Agent x Agent Agent x Agent] Average 0023 00786 0126 01471 01728 01993 02334 02782 02382 02256 02604 02856 02775 02958 02157 01468 00355
Efficiency-Local[Agent x Agent Agent x Agent] 00238 01032 01587 01916 02253 02598 03034 0362 03167 03054 03644 03843 03678 03861 02663 01705 0041
Interlockers[Agent x Agent Agent x Agent] Average 00246 00546 00656 00628 00738 00902 00902 00902 00847 0071 00738 00874 00874 0082 00792 00683 00546
Breadth-Column[Agent x Agent Agent x Agent] 00273 01011 01803 02596 03224 03716 04399 05082 04672 04672 05355 05328 05519 05738 03825 02404 00601
Breadth-Row[Agent x Agent Agent x Agent] 00273 01339 02158 02923 03388 03934 04754 05301 04863 04973 05792 05765 05765 05984 04153 02514 00601
Clique Count[Agent x Agent Agent x Agent] Average 00301 02896 06421 10929 14699 17213 19781 22486 20601 20492 24071 2571 24918 24727 13689 06913 01148
Constraint-Burt[Agent x Agent Agent x Agent] Average 0038 01331 01404 01481 01869 02077 02643 02719 02506 02623 02827 0295 03034 02956 02548 01967 00948
Interdependence[Agent x Agent Agent x Agent] 00485 00073 00042 0003 00024 0002 00017 00015 00016 00016 00014 00013 00014 00013 00022 00037 00176
Effective Network Size-Burt[Agent x Agent Agent x Agent] Average 0053 03636 06977 10333 13118 15388 18575 21156 19688 19928 22983 2382 23991 2429 14392 08408 01957
Redundancy-Row[Agent x Agent Agent x Agent] 00601 03525 08443 12459 16093 19699 23716 26913 24754 23798 27541 29781 29617 30683 1765 10219 01311
Redundancy-Column[Agent x Agent Agent x Agent] 00601 03743 08634 12568 1623 2 23989 27077 24945 24016 27814 30055 29891 30902 17842 10383 01448
Node Levels[Agent x Agent Agent x Agent] Average 00601 09836 14344 23087 24126 36339 46913 42541 3918 43197 51913 44727 46421 48169 47568 23033 03169
Centrality-Bonacich Power[Agent x Agent Agent x Agent] Average 01038 05492 11066 15683 20027 2429 29372 32978 30383 29563 34071 36339 36284 37432 22541 13552 02514
Triad Count[Agent x Agent Agent x Agent] Average 01967 08279 27104 42432 5918 78251 96503 101475 94454 84508 94645 111421 110164 112732 67131 35574 01995
Efficiency[Agent x Agent Agent x Agent] 03333 09552 0964 09658 09658 09746 09798 09806 09795 09818 09835 09822 0982 09828 09809 09756 09607
Speed-Minimum[Agent x Agent Agent x Agent] 05 0125 0125 01 01111 00909 00909 01 01 01 00909 01111 01 01 00769 01111 01667
Edge Count-Skip[Agent x Agent Agent x Agent] 07895 07761 08988 09408 09304 0946 0933 09519 09487 09409 09455 09534 09465 09584 09188 0881 06196
Transitivity[Agent x Agent Agent x Agent] 08372 03475 03372 03238 03336 03626 0375 03495 03503 03253 0319 03266 03302 0336 03822 0405 0258
Edge Count-Pooled[Agent x Agent Agent x Agent] 08421 08657 09432 09669 09714 09764 09665 09751 09748 09704 09735 09737 09759 09788 09612 09435 08152
Speed-Average[Agent x Agent Agent x Agent] 08519 02994 03068 02802 02799 0233 02156 02387 0245 02484 02448 02516 02464 02429 02259 02373 04139
Edge Count-Lateral[Agent x Agent Agent x Agent] 09474 03731 03926 0392 04216 04061 03944 03828 03768 03651 03633 03872 03795 03985 03842 03891 05761
Resemblance Correlation[Agent x Agent Agent x Agent] Average 09994 09971 09941 09917 09894 09871 09844 09825 09838 09843 09819 09807 09807 09801 0988 09927 09986
Fragmentation[Agent x Agent Agent x Agent] 09997 0979 09411 09068 0881 08047 07079 06554 0699 06777 05778 05813 05951 05598 07798 09101 09941
Communicative Need[Agent x Agent Agent x Agent] 1 0608 07457 07662 0801 08069 08098 08233 0817 08124 0784 08169 08368 08365 08212 09004 08039
Upper Boundedness[Agent x Agent Agent x Agent] 1 09021 09909 0999 09987 0994 09966 09993 09989 09965 0995 09928 09987 09988 09986 09983 09073
Average Distance[Agent x Agent Agent x Agent] 11739 33399 326 35685 35731 42917 46372 41893 40814 40263 40845 39749 40577 41172 44266 42139 24163
Network Levels[Agent x Agent Agent x Agent] 2 8 8 10 9 11 11 10 10 10 11 9 10 10 13 9 6
Span Of Control[Agent x Agent Agent x Agent] 2375 27917 42188 48644 50903 52917 51932 54369 53981 5128 52176 55417 54426 55466 46089 40656 20909
Number of AGENT 32 94 112 137 162 183 221 228 213 219 254 250 256 260 190 141 62
Edge Count[Agent x Agent Agent x Agent] 38 201 405 574 733 889 1075 1207 1112 1082 1247 1330 1328 1370 825 496 92
Component Members-Weak[Agent x Agent Agent x Agent] Average 1728798 1248005 101776 862131 710437 559781 367623 299836 366667 345 232377 232213 225628 211885 50694 855628 1487596
Isolate Count[Agent x Agent Agent x Agent] 350 293 269 247 222 198 157 142 159 154 126 125 122 119 187 243 322
Component Count-Weak[Agent x Agent Agent x Agent] 355 301 272 250 226 201 162 146 162 157 128 128 126 122 191 249 329
Component Count-Strong[Agent x Agent Agent x Agent] 355 315 281 255 232 214 177 156 172 169 145 144 137 131 198 256 336
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Agents 221 228 213 219 254 250 256 260 190 141 62
Z -03746343246 00624390541 -08741467575 -04995124328 16858544608 14360982444 18107325691 20604887855 -23102450019 -53697586531 -103024439274
C+ 0 0 0 0 11858544608 21219527053 34326852743 49931740598 21829290579 0 0
C- 0 0 03741467575 03736591903 0 0 0 0 18102450019 6680003655 164824475824
Signal 0 0 0 0 0 0 0 1 0 1 1
Average 227
Std Dev 16015617378
k 05
Control Limit 4
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Component Members-Weak Average 367623 299836 366667 345 232377 232213 225628 211885 50694 855628 1487596
Z 09566024078 -01181147213 09414456731 05979300132 -11876316317 -1190231741 -12946324712 -15125184597 31653794044 86935922814 187130183455
C+ 04566024078 0 04414456731 05393756863 0 0 0 0 26653794044 108589716857 290719900313
C- 0 0 0 0 06876316317 13778633728 21724958439 31850143037 0 0 0
Signal 0 0 0 0 0 0 0 0 0 1 1
Average 307286
Std Dev 63074271515
k 05
Control Limit 4
1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Edge Count 38 201 405 574 733 889 1075 1207 1112 1082 1247 1330 1328 1370 825 496 92
Slope of Edge Count 163 204 169 159 156 186 132 -95 -30 165 83 -2 42 -545 -329 -404
Z -05039753822 15973457028 -01964649795 -07089823173 -08627375186 06748144948 -20927791293 -137269226972 -6393653789 59836399187 20500693512 -16784942813 -06150208054 -316223197419 -114547624997 -100197139539
C+ 0 10973457028 04008807233 0 0 01748144948 0 0 0 54836399187 70337092699 48552149886 37401941833 0 0 0
C- 00039753822 0 0 02089823173 05717198359 0 15927791293 148197018266 207133556155 142297156968 116796463456 128581406269 129731614323 440954811741 550502436738 645699576277
Signal 0 0 0 0 0 0 0 0 1 1 1 1 1 1 1 1 1
Average 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 9475 4825 43 3075 54 72 -1055 -2085 -309
Std Dev 195115350498 195115350498 195115350498 195115350498 195115350498 195115350498 195115350498 195115350498 1284143683549 1324547092406 1254033492376 115840047767 882307580533 710539700979 29504858357 278774102097 2505474007049
k 05
Control Limit 4
1988 1988 1988 1988 1988 1988
1989 1989 1989 1989 1989 1989
1990 1990 1990 1990 1990 1990
1991 1991 1991 1991 1991 1991
1992 1992 1992 1992 1992 1992
1993 1993 1993 1993 1993 1993
1994 1994 1994 1994 1994 1994
1995 1995 1995 1995 1995 1995
1996 1996 1996 1996 1996 1996
1997 1997 1997 1997 1997 1997
1998 1998 1998 1998 1998 1998
1999 1999 1999 1999 1999 1999
2000 2000 2000 2000 2000 2000
2001 2001 2001 2001 2001 2001
2002 2002 2002 2002 2002 2002
2003 2003 2003 2003 2003 2003
2004 2004 2004 2004 2004 2004
httpwwwmarocnlnieuwsforumsshowphppg=timeline
Significant Events Founded in Afghanistan Moves to Sudan WTC 1 and Somalia Leaves Sudan Returns to Afghanistan Saudia Arabia bomb Issues fatwa US embassy bombings USS Cole WTC 2 Afghanistan 3 Man is Captured Dirty bomb foiled 911 plotter captured Sheikh Mohammed captured Saudi bombings Morroco attacks Sectarian violence plot Madrid Europe truce offered
1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Betweeness 0 00001 00003 00006 00008 00016 00027 00029 00024 00025 00033 00031 00033 00036 0002 00008 0
Density 00003 00015 0003 00043 00055 00067 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Closeness 00001 00005 00012 00016 00018 00021 00027 0003 00028 00028 00031 0003 00032 00034 00024 00015 00004
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Closeness 00027 0003 00028 00028 00031 0003 00032 00034 00024 00015 00004
Z -08728715609 10910894512 -02182178902 -02182178902 17457431219 10910894512 24003967926 3709704134 -28368325731 -87287156094 -159299059872
C+ 0 05910894512 0 0 12457431219 18368325731 37372293657 69469334997 36101009266 0 0
C- 03728715609 0 0 0 0 0 0 0 23368325731 105655481825 259954541697
Signal 0 0 0 0 0 0 0 1 0 1 1
Average 00028333333
Std Dev 00001527525
k 05
Control Limit 4
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Density 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Z -08444407432 11042686642 -0259827921 -06495698025 16888814864 30529780716 28581071308 36375908938 -43521176765 -9223891195 -150700194171
C+ 0 06042686642 0 0 11888814864 3741859558 60999666888 92375575826 43854399061 0 0
C- 03444407432 0 0 01495698025 0 0 0 0 38521176765 125760088714 271460282886
Signal 0 0 0 0 0 0 1 1 1 1 1
Average 00084333333
Std Dev 00005131601
k 05
Control Limit 4
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Betweeness 00027 00029 00024 00025 00033 00031 00033 00036 0002 00008 0
Z 01324532357 09271726499 -10596258857 -06622661785 25166114784 17218920642 25166114784 37086905998 -26490647141 -74173811996 -105962588565
C+ 0 04271726499 0 0 20166114784 32385035426 5255115021 84638056208 53147409067 0 0
C- 0 0 05596258857 07218920642 0 0 0 0 21490647141 90664459137 191627047702
Signal 0 0 0 0 0 0 1 1 1 1 1
Average 00026666667
Std Dev 00002516611
k 05
Control Limit 4
1988 1988 1988
1989 1989 1989
1990 1990 1990
1991 1991 1991
1992 1992 1992
1993 1993 1993
1994 1994 1994
1995 1995 1995
1996 1996 1996
1997 1997 1997
1998 1998 1998
1999 1999 1999
2000 2000 2000
2001 2001 2001
2002 2002 2002
2003 2003 2003
2004 2004 2004
Page 17: Data Science Center Eindhoven - 4TU.Federation · 4TU AMI SRO Big Data Meeting Big Data: Mathematics in Action! November 24, 2017 Data Science Center Eindhoven The Mathematics Behind

Topic Uncertainty Quantification

Virtual prototyping using mathematical models UQ does not deal with1 unknown uncertainty in the initial

conditions of parameters2 parametrisation of design building blocks dimension

reduction

For 1) polynomial chaos (Wiener chaos) inverse statistical models Bayesian analysis (calibration)

For 2) Model Order Reduction

Topic Network monitoring

0

0002

0004

0006

0008

001

0012

1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004

Measu

re V

alu

e

Time

Al-Qaeda Network Measures

Betweeness

Density

Closeness

Challengesbull monitor high number of variablesbull models to capture structural changesbull scalable algorithms for likelihood ratio calculations

Closeness CUSUM Statistic for Al-Qaeda (1994-2004)

0

1

2

3

4

5

6

7

8

1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004

C+

Change Point

Signal

Chart3

Betweeness
Density
Closeness
Time
Measure Value
Al-Qaeda Network Measures
0
00003
00001
00001
00015
00005
00003
0003
00012
00006
00043
00016
00008
00055
00018
00016
00067
00021
00027
0008
00027
00029
0009
0003
00024
00083
00028
00025
00081
00028
00033
00093
00031
00031
001
0003
00033
00099
00032
00036
00103
00034
0002
00062
00024
00008
00037
00015
0
00007
00004

Std Measures Control Charts

Std Measures Control Charts

Betweeness
Density
Closeness
Time
Value
Standard Measures

Chart1

Number of AGENT
Edge Count[Agent x Agent Agent x Agent]
Component Members-Weak[Agent x Agent Agent x Agent] Average
Isolate Count[Agent x Agent Agent x Agent]
Component Count-Weak[Agent x Agent Agent x Agent]
Component Count-Strong[Agent x Agent Agent x Agent]
32
38
1728798
350
355
355
94
201
1248005
293
301
315
112
405
101776
269
272
281
137
574
862131
247
250
255
162
733
710437
222
226
232
183
889
559781
198
201
214
221
1075
367623
157
162
177
228
1207
299836
142
146
156
213
1112
366667
159
162
172
219
1082
345
154
157
169
254
1247
232377
126
128
145
250
1330
232213
125
128
144
256
1328
225628
122
126
137
260
1370
211885
119
122
131
190
825
50694
187
191
198
141
496
855628
243
249
256
62
92
1487596
322
329
336

Control Charts

raw data

Chart4

C+
Closeness CUSUM Statistic for Al-Qaeda (1994-2004)
0
05910894512
0
0
12457431219
18368325731
37372293657
69469334997
36101009266
0
0

Std Measures Control Charts

Std Measures Control Charts

Betweeness
Density
Closeness
Time
Value
Standard Measures

Chart1

C+
Closeness CUSUM Chart for Al-Qaeda (1994-2004)

Control Charts

Number of AGENT
Edge Count[Agent x Agent Agent x Agent]
Component Members-Weak[Agent x Agent Agent x Agent] Average
Isolate Count[Agent x Agent Agent x Agent]
Component Count-Weak[Agent x Agent Agent x Agent]
Component Count-Strong[Agent x Agent Agent x Agent]
32
38
1728798
350
355
355
94
201
1248005
293
301
315
112
405
101776
269
272
281
137
574
862131
247
250
255
162
733
710437
222
226
232
183
889
559781
198
201
214
221
1075
367623
157
162
177
228
1207
299836
142
146
156
213
1112
366667
159
162
172
219
1082
345
154
157
169
254
1247
232377
126
128
145
250
1330
232213
125
128
144
256
1328
225628
122
126
137
260
1370
211885
119
122
131
190
825
50694
187
191
198
141
496
855628
243
249
256
62
92
1487596
322
329
336

raw data

Topic Precision farming

Topic Precision farming

Topic Compressed Sensing

Topic Network structure

bull dependencies between nodes in networks measured by degrees of direct neighbours

bull assortativity coefficient of Newman is nothing but Pearsonrsquos correlation statistic

bull inconsistent estimator when variances are infinitebull Spearman rank correlation behaves better but

calculation is computationally intensive bull requires heavy asymptotics

Van der Hofstad R and Litvak N (2014) Degree-Degree Dependencies in Random Graphs with Heavy-Tailed Degrees Internet Mathematics 10 (3-4) pp 287-334

Topic Topological Data Analysis

Common Big Data problem is to choose relevant ldquofeaturesrdquo from high-dimensional data

Combination of machine learning with topological tools (simplices cohomology) yields new algorithmsfor finding patterns and clustering

Presenter
Presentation Notes
httpsyoutubeXfWibrh6stw Gunnar Carsson

Mathematical contributions in general

bull Modellingbull Performance of algorithmsbull Statistical thinking

We need to work hard to makemathematical contributions more explicit

Conclusions

1 Mathematical methods are an important enabler in big data settings

2 Developments in big data settings not only require more computer speed and memory capacity but also new advanced mathematics

  • The Mathematics Behind Big Data
  • Outline
  • What is big data
  • Four Vrsquos of Big Data
  • High-dimensional data ldquo119951≪119953rdquo
  • Topic Google Search
  • Topic Google Search
  • Topic Google Search
  • Topic Google Search
  • Topic Google Search
  • Topic Google Search
  • Equivalent Random Surfer Model
  • Google Matrix
  • Choice of Matrix B
  • Practical Issues
  • Topic Streaming algorithms
  • Topic Uncertainty Quantification
  • Topic Network monitoring
  • Topic Precision farming
  • Topic Precision farming
  • Topic Compressed Sensing
  • Topic Network structure
  • Topic Topological Data Analysis
  • Mathematical contributions in general
  • Conclusions
1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Centrality-Betweenness[Agent x Agent Agent x Agent] Average 0 00001 00003 00006 00008 00016 00027 00029 00024 00025 00033 00031 00033 00036 0002 00008 0
Meta-Matrix Hamming Distance 0 00018 00036 0006 00065 00087 00124 0012 00146 00136 00135 00166 00164 00156 0015 0009 0004
Network Centralization-Betweenness[Agent x Agent Agent x Agent] 0 00091 00292 00492 00608 01105 01557 01586 01242 01187 01295 01338 01316 01414 0104 0042 00014
Edge Count-Sequential[Agent x Agent Agent x Agent] 0 001 00025 00017 0 0 00019 00008 00009 00009 00016 00015 0 00007 0 0002 0
Hierarchy[Agent x Agent Agent x Agent] 0 04827 01679 00696 00796 01496 01339 00727 00775 01273 01492 01711 0075 00717 00786 00882 05587
Exclusivity[Agent x Agent Agent x Agent] Average 00001 00003 00003 00003 00004 00004 00005 00005 00005 00006 00006 00006 00006 00006 00006 00004 00002
Network Centralization-Closeness[Agent x Agent Agent x Agent] 00001 00005 00012 00016 00018 00021 00027 0003 00028 00028 00031 0003 00032 00034 00024 00015 00004
Similarity Correlation[Agent x Agent Agent x Agent] Average 00001 00023 00036 00043 0005 0006 0007 00077 0007 00069 0008 00084 0008 00082 00053 0003 00008
Simmelian Ties[Agent x Agent Agent x Agent] Average 00002 00008 00021 00029 0004 0005 00059 00068 00061 00058 00065 00076 00076 0008 00045 00029 00004
Overall Complexity 00003 00015 0003 00043 00055 00067 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Expertise Correlation[Agent x Agent Agent x Agent] Average 00003 00015 0003 00042 00053 00065 00079 00088 00081 00079 00091 00097 00097 00101 0006 00036 00007
Density[Agent x Agent Agent x Agent] 00003 00015 0003 00043 00055 00067 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Centrality-Inverse Closeness[Agent x Agent Agent x Agent] Average 00003 00057 00191 00306 00387 0053 00733 00943 0084 00877 01111 01117 01125 01206 00587 00257 00018
Efficiency-Global[Agent x Agent Agent x Agent] 00003 00087 00223 00338 00421 00591 00815 0101 00904 00972 01245 01244 01203 01286 00632 00277 00026
Connectedness[Agent x Agent Agent x Agent] 00003 0021 00589 00932 0119 01953 02921 03446 0301 03223 04222 04187 04049 04402 02202 00899 00059
Distinctiveness Correlation[Agent x Agent Agent x Agent] Average 00006 00029 00059 00083 00106 00129 00156 00175 00162 00157 00181 00193 00193 00199 0012 00073 00014
Centrality-Closeness[Agent x Agent Agent x Agent] Average 00027 00028 00029 00031 00032 00035 00042 00048 00044 00044 00053 00052 00054 00058 00038 00031 00027
Boundary Spanner[Agent x Agent Agent x Agent] Average 00027 00191 00328 00328 00546 00574 00683 00601 00656 0071 00683 0071 00847 00656 00738 00546 00246
Network Centralization-Row Degree[Agent x Agent Agent x Agent] 00134 00314 006 00752 00794 0081 00851 00841 00848 00878 00893 00969 00996 00993 00842 00648 0024
Network Centralization-Column Degree[Agent x Agent Agent x Agent] 00134 00588 00819 00834 00849 00892 00878 00896 00876 00905 00948 00996 01024 00993 00842 0062 00322
Network Centralization-Out Degree[Agent x Agent Agent x Agent] 00135 00315 00601 00754 00797 00812 00853 00843 00851 0088 00895 00972 00999 00996 00845 0065 0024
Network Centralization-Total Degree[Agent x Agent Agent x Agent] 00135 00453 00672 00769 00813 00828 00869 0086 00867 00897 00925 00988 01016 00999 00847 00638 00282
Network Centralization-In Degree[Agent x Agent Agent x Agent] 00135 00589 00821 00836 00852 00895 00881 00898 00878 00908 0095 00999 01027 00996 00845 00622 00323
Clustering Coefficient-Watts-Strogatz[Agent x Agent Agent x Agent] Average 0023 00786 0126 01471 01728 01993 02334 02782 02382 02256 02604 02856 02775 02958 02157 01468 00355
Efficiency-Local[Agent x Agent Agent x Agent] 00238 01032 01587 01916 02253 02598 03034 0362 03167 03054 03644 03843 03678 03861 02663 01705 0041
Interlockers[Agent x Agent Agent x Agent] Average 00246 00546 00656 00628 00738 00902 00902 00902 00847 0071 00738 00874 00874 0082 00792 00683 00546
Breadth-Column[Agent x Agent Agent x Agent] 00273 01011 01803 02596 03224 03716 04399 05082 04672 04672 05355 05328 05519 05738 03825 02404 00601
Breadth-Row[Agent x Agent Agent x Agent] 00273 01339 02158 02923 03388 03934 04754 05301 04863 04973 05792 05765 05765 05984 04153 02514 00601
Clique Count[Agent x Agent Agent x Agent] Average 00301 02896 06421 10929 14699 17213 19781 22486 20601 20492 24071 2571 24918 24727 13689 06913 01148
Constraint-Burt[Agent x Agent Agent x Agent] Average 0038 01331 01404 01481 01869 02077 02643 02719 02506 02623 02827 0295 03034 02956 02548 01967 00948
Interdependence[Agent x Agent Agent x Agent] 00485 00073 00042 0003 00024 0002 00017 00015 00016 00016 00014 00013 00014 00013 00022 00037 00176
Effective Network Size-Burt[Agent x Agent Agent x Agent] Average 0053 03636 06977 10333 13118 15388 18575 21156 19688 19928 22983 2382 23991 2429 14392 08408 01957
Redundancy-Row[Agent x Agent Agent x Agent] 00601 03525 08443 12459 16093 19699 23716 26913 24754 23798 27541 29781 29617 30683 1765 10219 01311
Redundancy-Column[Agent x Agent Agent x Agent] 00601 03743 08634 12568 1623 2 23989 27077 24945 24016 27814 30055 29891 30902 17842 10383 01448
Node Levels[Agent x Agent Agent x Agent] Average 00601 09836 14344 23087 24126 36339 46913 42541 3918 43197 51913 44727 46421 48169 47568 23033 03169
Centrality-Bonacich Power[Agent x Agent Agent x Agent] Average 01038 05492 11066 15683 20027 2429 29372 32978 30383 29563 34071 36339 36284 37432 22541 13552 02514
Triad Count[Agent x Agent Agent x Agent] Average 01967 08279 27104 42432 5918 78251 96503 101475 94454 84508 94645 111421 110164 112732 67131 35574 01995
Efficiency[Agent x Agent Agent x Agent] 03333 09552 0964 09658 09658 09746 09798 09806 09795 09818 09835 09822 0982 09828 09809 09756 09607
Speed-Minimum[Agent x Agent Agent x Agent] 05 0125 0125 01 01111 00909 00909 01 01 01 00909 01111 01 01 00769 01111 01667
Edge Count-Skip[Agent x Agent Agent x Agent] 07895 07761 08988 09408 09304 0946 0933 09519 09487 09409 09455 09534 09465 09584 09188 0881 06196
Transitivity[Agent x Agent Agent x Agent] 08372 03475 03372 03238 03336 03626 0375 03495 03503 03253 0319 03266 03302 0336 03822 0405 0258
Edge Count-Pooled[Agent x Agent Agent x Agent] 08421 08657 09432 09669 09714 09764 09665 09751 09748 09704 09735 09737 09759 09788 09612 09435 08152
Speed-Average[Agent x Agent Agent x Agent] 08519 02994 03068 02802 02799 0233 02156 02387 0245 02484 02448 02516 02464 02429 02259 02373 04139
Edge Count-Lateral[Agent x Agent Agent x Agent] 09474 03731 03926 0392 04216 04061 03944 03828 03768 03651 03633 03872 03795 03985 03842 03891 05761
Resemblance Correlation[Agent x Agent Agent x Agent] Average 09994 09971 09941 09917 09894 09871 09844 09825 09838 09843 09819 09807 09807 09801 0988 09927 09986
Fragmentation[Agent x Agent Agent x Agent] 09997 0979 09411 09068 0881 08047 07079 06554 0699 06777 05778 05813 05951 05598 07798 09101 09941
Communicative Need[Agent x Agent Agent x Agent] 1 0608 07457 07662 0801 08069 08098 08233 0817 08124 0784 08169 08368 08365 08212 09004 08039
Upper Boundedness[Agent x Agent Agent x Agent] 1 09021 09909 0999 09987 0994 09966 09993 09989 09965 0995 09928 09987 09988 09986 09983 09073
Average Distance[Agent x Agent Agent x Agent] 11739 33399 326 35685 35731 42917 46372 41893 40814 40263 40845 39749 40577 41172 44266 42139 24163
Network Levels[Agent x Agent Agent x Agent] 2 8 8 10 9 11 11 10 10 10 11 9 10 10 13 9 6
Span Of Control[Agent x Agent Agent x Agent] 2375 27917 42188 48644 50903 52917 51932 54369 53981 5128 52176 55417 54426 55466 46089 40656 20909
Number of AGENT 32 94 112 137 162 183 221 228 213 219 254 250 256 260 190 141 62
Edge Count[Agent x Agent Agent x Agent] 38 201 405 574 733 889 1075 1207 1112 1082 1247 1330 1328 1370 825 496 92
Component Members-Weak[Agent x Agent Agent x Agent] Average 1728798 1248005 101776 862131 710437 559781 367623 299836 366667 345 232377 232213 225628 211885 50694 855628 1487596
Isolate Count[Agent x Agent Agent x Agent] 350 293 269 247 222 198 157 142 159 154 126 125 122 119 187 243 322
Component Count-Weak[Agent x Agent Agent x Agent] 355 301 272 250 226 201 162 146 162 157 128 128 126 122 191 249 329
Component Count-Strong[Agent x Agent Agent x Agent] 355 315 281 255 232 214 177 156 172 169 145 144 137 131 198 256 336
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Agents 221 228 213 219 254 250 256 260 190 141 62
Z -03746343246 00624390541 -08741467575 -04995124328 16858544608 14360982444 18107325691 20604887855 -23102450019 -53697586531 -103024439274
C+ 0 0 0 0 11858544608 21219527053 34326852743 49931740598 21829290579 0 0
C- 0 0 03741467575 03736591903 0 0 0 0 18102450019 6680003655 164824475824
Signal 0 0 0 0 0 0 0 1 0 1 1
Average 227
Std Dev 16015617378
k 05
Control Limit 4
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Component Members-Weak Average 367623 299836 366667 345 232377 232213 225628 211885 50694 855628 1487596
Z 09566024078 -01181147213 09414456731 05979300132 -11876316317 -1190231741 -12946324712 -15125184597 31653794044 86935922814 187130183455
C+ 04566024078 0 04414456731 05393756863 0 0 0 0 26653794044 108589716857 290719900313
C- 0 0 0 0 06876316317 13778633728 21724958439 31850143037 0 0 0
Signal 0 0 0 0 0 0 0 0 0 1 1
Average 307286
Std Dev 63074271515
k 05
Control Limit 4
1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Edge Count 38 201 405 574 733 889 1075 1207 1112 1082 1247 1330 1328 1370 825 496 92
Slope of Edge Count 163 204 169 159 156 186 132 -95 -30 165 83 -2 42 -545 -329 -404
Z -05039753822 15973457028 -01964649795 -07089823173 -08627375186 06748144948 -20927791293 -137269226972 -6393653789 59836399187 20500693512 -16784942813 -06150208054 -316223197419 -114547624997 -100197139539
C+ 0 10973457028 04008807233 0 0 01748144948 0 0 0 54836399187 70337092699 48552149886 37401941833 0 0 0
C- 00039753822 0 0 02089823173 05717198359 0 15927791293 148197018266 207133556155 142297156968 116796463456 128581406269 129731614323 440954811741 550502436738 645699576277
Signal 0 0 0 0 0 0 0 0 1 1 1 1 1 1 1 1 1
Average 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 9475 4825 43 3075 54 72 -1055 -2085 -309
Std Dev 195115350498 195115350498 195115350498 195115350498 195115350498 195115350498 195115350498 195115350498 1284143683549 1324547092406 1254033492376 115840047767 882307580533 710539700979 29504858357 278774102097 2505474007049
k 05
Control Limit 4
1988 1988 1988 1988 1988 1988
1989 1989 1989 1989 1989 1989
1990 1990 1990 1990 1990 1990
1991 1991 1991 1991 1991 1991
1992 1992 1992 1992 1992 1992
1993 1993 1993 1993 1993 1993
1994 1994 1994 1994 1994 1994
1995 1995 1995 1995 1995 1995
1996 1996 1996 1996 1996 1996
1997 1997 1997 1997 1997 1997
1998 1998 1998 1998 1998 1998
1999 1999 1999 1999 1999 1999
2000 2000 2000 2000 2000 2000
2001 2001 2001 2001 2001 2001
2002 2002 2002 2002 2002 2002
2003 2003 2003 2003 2003 2003
2004 2004 2004 2004 2004 2004
httpwwwmarocnlnieuwsforumsshowphppg=timeline
Significant Events Founded in Afghanistan Moves to Sudan WTC 1 and Somalia Leaves Sudan Returns to Afghanistan Saudia Arabia bomb Issues fatwa US embassy bombings USS Cole WTC 2 Afghanistan 3 Man is Captured Dirty bomb foiled 911 plotter captured Sheikh Mohammed captured Saudi bombings Morroco attacks Sectarian violence plot Madrid Europe truce offered
1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Betweeness 0 00001 00003 00006 00008 00016 00027 00029 00024 00025 00033 00031 00033 00036 0002 00008 0
Density 00003 00015 0003 00043 00055 00067 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Closeness 00001 00005 00012 00016 00018 00021 00027 0003 00028 00028 00031 0003 00032 00034 00024 00015 00004
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Closeness 00027 0003 00028 00028 00031 0003 00032 00034 00024 00015 00004
Z -08728715609 10910894512 -02182178902 -02182178902 17457431219 10910894512 24003967926 3709704134 -28368325731 -87287156094 -159299059872
C+ 0 05910894512 0 0 12457431219 18368325731 37372293657 69469334997 36101009266 0 0
C- 03728715609 0 0 0 0 0 0 0 23368325731 105655481825 259954541697
03728715609 05910894512 0 0 12457431219 18368325731 37372293657 69469334997 36101009266 105655481825 259954541697
Signal 0 0 0 0 0 0 0 1 0 1 1
Average 00028333333
Std Dev 00001527525
k 05
Control Limit 4
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Density 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Z -08444407432 11042686642 -0259827921 -06495698025 16888814864 30529780716 28581071308 36375908938 -43521176765 -9223891195 -150700194171
C+ 0 06042686642 0 0 11888814864 3741859558 60999666888 92375575826 43854399061 0 0
C- 03444407432 0 0 01495698025 0 0 0 0 38521176765 125760088714 271460282886
Signal 0 0 0 0 0 0 1 1 1 1 1
Average 00084333333
Std Dev 00005131601
k 05
Control Limit 4
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Betweeness 00027 00029 00024 00025 00033 00031 00033 00036 0002 00008 0
Z 01324532357 09271726499 -10596258857 -06622661785 25166114784 17218920642 25166114784 37086905998 -26490647141 -74173811996 -105962588565
C+ 0 04271726499 0 0 20166114784 32385035426 5255115021 84638056208 53147409067 0 0
C- 0 0 05596258857 07218920642 0 0 0 0 21490647141 90664459137 191627047702
Signal 0 0 0 0 0 0 1 1 1 1 1
Average 00026666667
Std Dev 00002516611
k 05
Control Limit 4
1994
1995
1996
1997
1998
1999
2000
2001
2002
2003
2004
1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Centrality-Betweenness[Agent x Agent Agent x Agent] Average 0 00001 00003 00006 00008 00016 00027 00029 00024 00025 00033 00031 00033 00036 0002 00008 0
Meta-Matrix Hamming Distance 0 00018 00036 0006 00065 00087 00124 0012 00146 00136 00135 00166 00164 00156 0015 0009 0004
Network Centralization-Betweenness[Agent x Agent Agent x Agent] 0 00091 00292 00492 00608 01105 01557 01586 01242 01187 01295 01338 01316 01414 0104 0042 00014
Edge Count-Sequential[Agent x Agent Agent x Agent] 0 001 00025 00017 0 0 00019 00008 00009 00009 00016 00015 0 00007 0 0002 0
Hierarchy[Agent x Agent Agent x Agent] 0 04827 01679 00696 00796 01496 01339 00727 00775 01273 01492 01711 0075 00717 00786 00882 05587
Exclusivity[Agent x Agent Agent x Agent] Average 00001 00003 00003 00003 00004 00004 00005 00005 00005 00006 00006 00006 00006 00006 00006 00004 00002
Network Centralization-Closeness[Agent x Agent Agent x Agent] 00001 00005 00012 00016 00018 00021 00027 0003 00028 00028 00031 0003 00032 00034 00024 00015 00004
Similarity Correlation[Agent x Agent Agent x Agent] Average 00001 00023 00036 00043 0005 0006 0007 00077 0007 00069 0008 00084 0008 00082 00053 0003 00008
Simmelian Ties[Agent x Agent Agent x Agent] Average 00002 00008 00021 00029 0004 0005 00059 00068 00061 00058 00065 00076 00076 0008 00045 00029 00004
Overall Complexity 00003 00015 0003 00043 00055 00067 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Expertise Correlation[Agent x Agent Agent x Agent] Average 00003 00015 0003 00042 00053 00065 00079 00088 00081 00079 00091 00097 00097 00101 0006 00036 00007
Density[Agent x Agent Agent x Agent] 00003 00015 0003 00043 00055 00067 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Centrality-Inverse Closeness[Agent x Agent Agent x Agent] Average 00003 00057 00191 00306 00387 0053 00733 00943 0084 00877 01111 01117 01125 01206 00587 00257 00018
Efficiency-Global[Agent x Agent Agent x Agent] 00003 00087 00223 00338 00421 00591 00815 0101 00904 00972 01245 01244 01203 01286 00632 00277 00026
Connectedness[Agent x Agent Agent x Agent] 00003 0021 00589 00932 0119 01953 02921 03446 0301 03223 04222 04187 04049 04402 02202 00899 00059
Distinctiveness Correlation[Agent x Agent Agent x Agent] Average 00006 00029 00059 00083 00106 00129 00156 00175 00162 00157 00181 00193 00193 00199 0012 00073 00014
Centrality-Closeness[Agent x Agent Agent x Agent] Average 00027 00028 00029 00031 00032 00035 00042 00048 00044 00044 00053 00052 00054 00058 00038 00031 00027
Boundary Spanner[Agent x Agent Agent x Agent] Average 00027 00191 00328 00328 00546 00574 00683 00601 00656 0071 00683 0071 00847 00656 00738 00546 00246
Network Centralization-Row Degree[Agent x Agent Agent x Agent] 00134 00314 006 00752 00794 0081 00851 00841 00848 00878 00893 00969 00996 00993 00842 00648 0024
Network Centralization-Column Degree[Agent x Agent Agent x Agent] 00134 00588 00819 00834 00849 00892 00878 00896 00876 00905 00948 00996 01024 00993 00842 0062 00322
Network Centralization-Out Degree[Agent x Agent Agent x Agent] 00135 00315 00601 00754 00797 00812 00853 00843 00851 0088 00895 00972 00999 00996 00845 0065 0024
Network Centralization-Total Degree[Agent x Agent Agent x Agent] 00135 00453 00672 00769 00813 00828 00869 0086 00867 00897 00925 00988 01016 00999 00847 00638 00282
Network Centralization-In Degree[Agent x Agent Agent x Agent] 00135 00589 00821 00836 00852 00895 00881 00898 00878 00908 0095 00999 01027 00996 00845 00622 00323
Clustering Coefficient-Watts-Strogatz[Agent x Agent Agent x Agent] Average 0023 00786 0126 01471 01728 01993 02334 02782 02382 02256 02604 02856 02775 02958 02157 01468 00355
Efficiency-Local[Agent x Agent Agent x Agent] 00238 01032 01587 01916 02253 02598 03034 0362 03167 03054 03644 03843 03678 03861 02663 01705 0041
Interlockers[Agent x Agent Agent x Agent] Average 00246 00546 00656 00628 00738 00902 00902 00902 00847 0071 00738 00874 00874 0082 00792 00683 00546
Breadth-Column[Agent x Agent Agent x Agent] 00273 01011 01803 02596 03224 03716 04399 05082 04672 04672 05355 05328 05519 05738 03825 02404 00601
Breadth-Row[Agent x Agent Agent x Agent] 00273 01339 02158 02923 03388 03934 04754 05301 04863 04973 05792 05765 05765 05984 04153 02514 00601
Clique Count[Agent x Agent Agent x Agent] Average 00301 02896 06421 10929 14699 17213 19781 22486 20601 20492 24071 2571 24918 24727 13689 06913 01148
Constraint-Burt[Agent x Agent Agent x Agent] Average 0038 01331 01404 01481 01869 02077 02643 02719 02506 02623 02827 0295 03034 02956 02548 01967 00948
Interdependence[Agent x Agent Agent x Agent] 00485 00073 00042 0003 00024 0002 00017 00015 00016 00016 00014 00013 00014 00013 00022 00037 00176
Effective Network Size-Burt[Agent x Agent Agent x Agent] Average 0053 03636 06977 10333 13118 15388 18575 21156 19688 19928 22983 2382 23991 2429 14392 08408 01957
Redundancy-Row[Agent x Agent Agent x Agent] 00601 03525 08443 12459 16093 19699 23716 26913 24754 23798 27541 29781 29617 30683 1765 10219 01311
Redundancy-Column[Agent x Agent Agent x Agent] 00601 03743 08634 12568 1623 2 23989 27077 24945 24016 27814 30055 29891 30902 17842 10383 01448
Node Levels[Agent x Agent Agent x Agent] Average 00601 09836 14344 23087 24126 36339 46913 42541 3918 43197 51913 44727 46421 48169 47568 23033 03169
Centrality-Bonacich Power[Agent x Agent Agent x Agent] Average 01038 05492 11066 15683 20027 2429 29372 32978 30383 29563 34071 36339 36284 37432 22541 13552 02514
Triad Count[Agent x Agent Agent x Agent] Average 01967 08279 27104 42432 5918 78251 96503 101475 94454 84508 94645 111421 110164 112732 67131 35574 01995
Efficiency[Agent x Agent Agent x Agent] 03333 09552 0964 09658 09658 09746 09798 09806 09795 09818 09835 09822 0982 09828 09809 09756 09607
Speed-Minimum[Agent x Agent Agent x Agent] 05 0125 0125 01 01111 00909 00909 01 01 01 00909 01111 01 01 00769 01111 01667
Edge Count-Skip[Agent x Agent Agent x Agent] 07895 07761 08988 09408 09304 0946 0933 09519 09487 09409 09455 09534 09465 09584 09188 0881 06196
Transitivity[Agent x Agent Agent x Agent] 08372 03475 03372 03238 03336 03626 0375 03495 03503 03253 0319 03266 03302 0336 03822 0405 0258
Edge Count-Pooled[Agent x Agent Agent x Agent] 08421 08657 09432 09669 09714 09764 09665 09751 09748 09704 09735 09737 09759 09788 09612 09435 08152
Speed-Average[Agent x Agent Agent x Agent] 08519 02994 03068 02802 02799 0233 02156 02387 0245 02484 02448 02516 02464 02429 02259 02373 04139
Edge Count-Lateral[Agent x Agent Agent x Agent] 09474 03731 03926 0392 04216 04061 03944 03828 03768 03651 03633 03872 03795 03985 03842 03891 05761
Resemblance Correlation[Agent x Agent Agent x Agent] Average 09994 09971 09941 09917 09894 09871 09844 09825 09838 09843 09819 09807 09807 09801 0988 09927 09986
Fragmentation[Agent x Agent Agent x Agent] 09997 0979 09411 09068 0881 08047 07079 06554 0699 06777 05778 05813 05951 05598 07798 09101 09941
Communicative Need[Agent x Agent Agent x Agent] 1 0608 07457 07662 0801 08069 08098 08233 0817 08124 0784 08169 08368 08365 08212 09004 08039
Upper Boundedness[Agent x Agent Agent x Agent] 1 09021 09909 0999 09987 0994 09966 09993 09989 09965 0995 09928 09987 09988 09986 09983 09073
Average Distance[Agent x Agent Agent x Agent] 11739 33399 326 35685 35731 42917 46372 41893 40814 40263 40845 39749 40577 41172 44266 42139 24163
Network Levels[Agent x Agent Agent x Agent] 2 8 8 10 9 11 11 10 10 10 11 9 10 10 13 9 6
Span Of Control[Agent x Agent Agent x Agent] 2375 27917 42188 48644 50903 52917 51932 54369 53981 5128 52176 55417 54426 55466 46089 40656 20909
Number of AGENT 32 94 112 137 162 183 221 228 213 219 254 250 256 260 190 141 62
Edge Count[Agent x Agent Agent x Agent] 38 201 405 574 733 889 1075 1207 1112 1082 1247 1330 1328 1370 825 496 92
Component Members-Weak[Agent x Agent Agent x Agent] Average 1728798 1248005 101776 862131 710437 559781 367623 299836 366667 345 232377 232213 225628 211885 50694 855628 1487596
Isolate Count[Agent x Agent Agent x Agent] 350 293 269 247 222 198 157 142 159 154 126 125 122 119 187 243 322
Component Count-Weak[Agent x Agent Agent x Agent] 355 301 272 250 226 201 162 146 162 157 128 128 126 122 191 249 329
Component Count-Strong[Agent x Agent Agent x Agent] 355 315 281 255 232 214 177 156 172 169 145 144 137 131 198 256 336
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Agents 221 228 213 219 254 250 256 260 190 141 62
Z -03746343246 00624390541 -08741467575 -04995124328 16858544608 14360982444 18107325691 20604887855 -23102450019 -53697586531 -103024439274
C+ 0 0 0 0 11858544608 21219527053 34326852743 49931740598 21829290579 0 0
C- 0 0 03741467575 03736591903 0 0 0 0 18102450019 6680003655 164824475824
Signal 0 0 0 0 0 0 0 1 0 1 1
Average 227
Std Dev 16015617378
k 05
Control Limit 4
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Component Members-Weak Average 367623 299836 366667 345 232377 232213 225628 211885 50694 855628 1487596
Z 09566024078 -01181147213 09414456731 05979300132 -11876316317 -1190231741 -12946324712 -15125184597 31653794044 86935922814 187130183455
C+ 04566024078 0 04414456731 05393756863 0 0 0 0 26653794044 108589716857 290719900313
C- 0 0 0 0 06876316317 13778633728 21724958439 31850143037 0 0 0
Signal 0 0 0 0 0 0 0 0 0 1 1
Average 307286
Std Dev 63074271515
k 05
Control Limit 4
1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Edge Count 38 201 405 574 733 889 1075 1207 1112 1082 1247 1330 1328 1370 825 496 92
Slope of Edge Count 163 204 169 159 156 186 132 -95 -30 165 83 -2 42 -545 -329 -404
Z -05039753822 15973457028 -01964649795 -07089823173 -08627375186 06748144948 -20927791293 -137269226972 -6393653789 59836399187 20500693512 -16784942813 -06150208054 -316223197419 -114547624997 -100197139539
C+ 0 10973457028 04008807233 0 0 01748144948 0 0 0 54836399187 70337092699 48552149886 37401941833 0 0 0
C- 00039753822 0 0 02089823173 05717198359 0 15927791293 148197018266 207133556155 142297156968 116796463456 128581406269 129731614323 440954811741 550502436738 645699576277
Signal 0 0 0 0 0 0 0 0 1 1 1 1 1 1 1 1 1
Average 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 9475 4825 43 3075 54 72 -1055 -2085 -309
Std Dev 195115350498 195115350498 195115350498 195115350498 195115350498 195115350498 195115350498 195115350498 1284143683549 1324547092406 1254033492376 115840047767 882307580533 710539700979 29504858357 278774102097 2505474007049
k 05
Control Limit 4
1988 1988 1988 1988 1988 1988
1989 1989 1989 1989 1989 1989
1990 1990 1990 1990 1990 1990
1991 1991 1991 1991 1991 1991
1992 1992 1992 1992 1992 1992
1993 1993 1993 1993 1993 1993
1994 1994 1994 1994 1994 1994
1995 1995 1995 1995 1995 1995
1996 1996 1996 1996 1996 1996
1997 1997 1997 1997 1997 1997
1998 1998 1998 1998 1998 1998
1999 1999 1999 1999 1999 1999
2000 2000 2000 2000 2000 2000
2001 2001 2001 2001 2001 2001
2002 2002 2002 2002 2002 2002
2003 2003 2003 2003 2003 2003
2004 2004 2004 2004 2004 2004
httpwwwmarocnlnieuwsforumsshowphppg=timeline
Significant Events Founded in Afghanistan Moves to Sudan WTC 1 and Somalia Leaves Sudan Returns to Afghanistan Saudia Arabia bomb Issues fatwa US embassy bombings USS Cole WTC 2 Afghanistan 3 Man is Captured Dirty bomb foiled 911 plotter captured Sheikh Mohammed captured Saudi bombings Morroco attacks Sectarian violence plot Madrid Europe truce offered
1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Betweeness 0 00001 00003 00006 00008 00016 00027 00029 00024 00025 00033 00031 00033 00036 0002 00008 0
Density 00003 00015 0003 00043 00055 00067 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Closeness 00001 00005 00012 00016 00018 00021 00027 0003 00028 00028 00031 0003 00032 00034 00024 00015 00004
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Closeness 00027 0003 00028 00028 00031 0003 00032 00034 00024 00015 00004
Z -08728715609 10910894512 -02182178902 -02182178902 17457431219 10910894512 24003967926 3709704134 -28368325731 -87287156094 -159299059872
C+ 0 05910894512 0 0 12457431219 18368325731 37372293657 69469334997 36101009266 0 0
C- 03728715609 0 0 0 0 0 0 0 23368325731 105655481825 259954541697
Signal 0 0 0 0 0 0 0 1 0 1 1
Average 00028333333
Std Dev 00001527525
k 05
Control Limit 4
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Density 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Z -08444407432 11042686642 -0259827921 -06495698025 16888814864 30529780716 28581071308 36375908938 -43521176765 -9223891195 -150700194171
C+ 0 06042686642 0 0 11888814864 3741859558 60999666888 92375575826 43854399061 0 0
C- 03444407432 0 0 01495698025 0 0 0 0 38521176765 125760088714 271460282886
Signal 0 0 0 0 0 0 1 1 1 1 1
Average 00084333333
Std Dev 00005131601
k 05
Control Limit 4
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Betweeness 00027 00029 00024 00025 00033 00031 00033 00036 0002 00008 0
Z 01324532357 09271726499 -10596258857 -06622661785 25166114784 17218920642 25166114784 37086905998 -26490647141 -74173811996 -105962588565
C+ 0 04271726499 0 0 20166114784 32385035426 5255115021 84638056208 53147409067 0 0
C- 0 0 05596258857 07218920642 0 0 0 0 21490647141 90664459137 191627047702
Signal 0 0 0 0 0 0 1 1 1 1 1
Average 00026666667
Std Dev 00002516611
k 05
Control Limit 4
1988 1988 1988
1989 1989 1989
1990 1990 1990
1991 1991 1991
1992 1992 1992
1993 1993 1993
1994 1994 1994
1995 1995 1995
1996 1996 1996
1997 1997 1997
1998 1998 1998
1999 1999 1999
2000 2000 2000
2001 2001 2001
2002 2002 2002
2003 2003 2003
2004 2004 2004
Page 18: Data Science Center Eindhoven - 4TU.Federation · 4TU AMI SRO Big Data Meeting Big Data: Mathematics in Action! November 24, 2017 Data Science Center Eindhoven The Mathematics Behind

Topic Network monitoring

0

0002

0004

0006

0008

001

0012

1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004

Measu

re V

alu

e

Time

Al-Qaeda Network Measures

Betweeness

Density

Closeness

Challengesbull monitor high number of variablesbull models to capture structural changesbull scalable algorithms for likelihood ratio calculations

Closeness CUSUM Statistic for Al-Qaeda (1994-2004)

0

1

2

3

4

5

6

7

8

1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004

C+

Change Point

Signal

Chart3

Betweeness
Density
Closeness
Time
Measure Value
Al-Qaeda Network Measures
0
00003
00001
00001
00015
00005
00003
0003
00012
00006
00043
00016
00008
00055
00018
00016
00067
00021
00027
0008
00027
00029
0009
0003
00024
00083
00028
00025
00081
00028
00033
00093
00031
00031
001
0003
00033
00099
00032
00036
00103
00034
0002
00062
00024
00008
00037
00015
0
00007
00004

Std Measures Control Charts

Std Measures Control Charts

Betweeness
Density
Closeness
Time
Value
Standard Measures

Chart1

Number of AGENT
Edge Count[Agent x Agent Agent x Agent]
Component Members-Weak[Agent x Agent Agent x Agent] Average
Isolate Count[Agent x Agent Agent x Agent]
Component Count-Weak[Agent x Agent Agent x Agent]
Component Count-Strong[Agent x Agent Agent x Agent]
32
38
1728798
350
355
355
94
201
1248005
293
301
315
112
405
101776
269
272
281
137
574
862131
247
250
255
162
733
710437
222
226
232
183
889
559781
198
201
214
221
1075
367623
157
162
177
228
1207
299836
142
146
156
213
1112
366667
159
162
172
219
1082
345
154
157
169
254
1247
232377
126
128
145
250
1330
232213
125
128
144
256
1328
225628
122
126
137
260
1370
211885
119
122
131
190
825
50694
187
191
198
141
496
855628
243
249
256
62
92
1487596
322
329
336

Control Charts

raw data

Chart4

C+
Closeness CUSUM Statistic for Al-Qaeda (1994-2004)
0
05910894512
0
0
12457431219
18368325731
37372293657
69469334997
36101009266
0
0

Std Measures Control Charts

Std Measures Control Charts

Betweeness
Density
Closeness
Time
Value
Standard Measures

Chart1

C+
Closeness CUSUM Chart for Al-Qaeda (1994-2004)

Control Charts

Number of AGENT
Edge Count[Agent x Agent Agent x Agent]
Component Members-Weak[Agent x Agent Agent x Agent] Average
Isolate Count[Agent x Agent Agent x Agent]
Component Count-Weak[Agent x Agent Agent x Agent]
Component Count-Strong[Agent x Agent Agent x Agent]
32
38
1728798
350
355
355
94
201
1248005
293
301
315
112
405
101776
269
272
281
137
574
862131
247
250
255
162
733
710437
222
226
232
183
889
559781
198
201
214
221
1075
367623
157
162
177
228
1207
299836
142
146
156
213
1112
366667
159
162
172
219
1082
345
154
157
169
254
1247
232377
126
128
145
250
1330
232213
125
128
144
256
1328
225628
122
126
137
260
1370
211885
119
122
131
190
825
50694
187
191
198
141
496
855628
243
249
256
62
92
1487596
322
329
336

raw data

Topic Precision farming

Topic Precision farming

Topic Compressed Sensing

Topic Network structure

bull dependencies between nodes in networks measured by degrees of direct neighbours

bull assortativity coefficient of Newman is nothing but Pearsonrsquos correlation statistic

bull inconsistent estimator when variances are infinitebull Spearman rank correlation behaves better but

calculation is computationally intensive bull requires heavy asymptotics

Van der Hofstad R and Litvak N (2014) Degree-Degree Dependencies in Random Graphs with Heavy-Tailed Degrees Internet Mathematics 10 (3-4) pp 287-334

Topic Topological Data Analysis

Common Big Data problem is to choose relevant ldquofeaturesrdquo from high-dimensional data

Combination of machine learning with topological tools (simplices cohomology) yields new algorithmsfor finding patterns and clustering

Presenter
Presentation Notes
httpsyoutubeXfWibrh6stw Gunnar Carsson

Mathematical contributions in general

bull Modellingbull Performance of algorithmsbull Statistical thinking

We need to work hard to makemathematical contributions more explicit

Conclusions

1 Mathematical methods are an important enabler in big data settings

2 Developments in big data settings not only require more computer speed and memory capacity but also new advanced mathematics

  • The Mathematics Behind Big Data
  • Outline
  • What is big data
  • Four Vrsquos of Big Data
  • High-dimensional data ldquo119951≪119953rdquo
  • Topic Google Search
  • Topic Google Search
  • Topic Google Search
  • Topic Google Search
  • Topic Google Search
  • Topic Google Search
  • Equivalent Random Surfer Model
  • Google Matrix
  • Choice of Matrix B
  • Practical Issues
  • Topic Streaming algorithms
  • Topic Uncertainty Quantification
  • Topic Network monitoring
  • Topic Precision farming
  • Topic Precision farming
  • Topic Compressed Sensing
  • Topic Network structure
  • Topic Topological Data Analysis
  • Mathematical contributions in general
  • Conclusions
1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Centrality-Betweenness[Agent x Agent Agent x Agent] Average 0 00001 00003 00006 00008 00016 00027 00029 00024 00025 00033 00031 00033 00036 0002 00008 0
Meta-Matrix Hamming Distance 0 00018 00036 0006 00065 00087 00124 0012 00146 00136 00135 00166 00164 00156 0015 0009 0004
Network Centralization-Betweenness[Agent x Agent Agent x Agent] 0 00091 00292 00492 00608 01105 01557 01586 01242 01187 01295 01338 01316 01414 0104 0042 00014
Edge Count-Sequential[Agent x Agent Agent x Agent] 0 001 00025 00017 0 0 00019 00008 00009 00009 00016 00015 0 00007 0 0002 0
Hierarchy[Agent x Agent Agent x Agent] 0 04827 01679 00696 00796 01496 01339 00727 00775 01273 01492 01711 0075 00717 00786 00882 05587
Exclusivity[Agent x Agent Agent x Agent] Average 00001 00003 00003 00003 00004 00004 00005 00005 00005 00006 00006 00006 00006 00006 00006 00004 00002
Network Centralization-Closeness[Agent x Agent Agent x Agent] 00001 00005 00012 00016 00018 00021 00027 0003 00028 00028 00031 0003 00032 00034 00024 00015 00004
Similarity Correlation[Agent x Agent Agent x Agent] Average 00001 00023 00036 00043 0005 0006 0007 00077 0007 00069 0008 00084 0008 00082 00053 0003 00008
Simmelian Ties[Agent x Agent Agent x Agent] Average 00002 00008 00021 00029 0004 0005 00059 00068 00061 00058 00065 00076 00076 0008 00045 00029 00004
Overall Complexity 00003 00015 0003 00043 00055 00067 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Expertise Correlation[Agent x Agent Agent x Agent] Average 00003 00015 0003 00042 00053 00065 00079 00088 00081 00079 00091 00097 00097 00101 0006 00036 00007
Density[Agent x Agent Agent x Agent] 00003 00015 0003 00043 00055 00067 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Centrality-Inverse Closeness[Agent x Agent Agent x Agent] Average 00003 00057 00191 00306 00387 0053 00733 00943 0084 00877 01111 01117 01125 01206 00587 00257 00018
Efficiency-Global[Agent x Agent Agent x Agent] 00003 00087 00223 00338 00421 00591 00815 0101 00904 00972 01245 01244 01203 01286 00632 00277 00026
Connectedness[Agent x Agent Agent x Agent] 00003 0021 00589 00932 0119 01953 02921 03446 0301 03223 04222 04187 04049 04402 02202 00899 00059
Distinctiveness Correlation[Agent x Agent Agent x Agent] Average 00006 00029 00059 00083 00106 00129 00156 00175 00162 00157 00181 00193 00193 00199 0012 00073 00014
Centrality-Closeness[Agent x Agent Agent x Agent] Average 00027 00028 00029 00031 00032 00035 00042 00048 00044 00044 00053 00052 00054 00058 00038 00031 00027
Boundary Spanner[Agent x Agent Agent x Agent] Average 00027 00191 00328 00328 00546 00574 00683 00601 00656 0071 00683 0071 00847 00656 00738 00546 00246
Network Centralization-Row Degree[Agent x Agent Agent x Agent] 00134 00314 006 00752 00794 0081 00851 00841 00848 00878 00893 00969 00996 00993 00842 00648 0024
Network Centralization-Column Degree[Agent x Agent Agent x Agent] 00134 00588 00819 00834 00849 00892 00878 00896 00876 00905 00948 00996 01024 00993 00842 0062 00322
Network Centralization-Out Degree[Agent x Agent Agent x Agent] 00135 00315 00601 00754 00797 00812 00853 00843 00851 0088 00895 00972 00999 00996 00845 0065 0024
Network Centralization-Total Degree[Agent x Agent Agent x Agent] 00135 00453 00672 00769 00813 00828 00869 0086 00867 00897 00925 00988 01016 00999 00847 00638 00282
Network Centralization-In Degree[Agent x Agent Agent x Agent] 00135 00589 00821 00836 00852 00895 00881 00898 00878 00908 0095 00999 01027 00996 00845 00622 00323
Clustering Coefficient-Watts-Strogatz[Agent x Agent Agent x Agent] Average 0023 00786 0126 01471 01728 01993 02334 02782 02382 02256 02604 02856 02775 02958 02157 01468 00355
Efficiency-Local[Agent x Agent Agent x Agent] 00238 01032 01587 01916 02253 02598 03034 0362 03167 03054 03644 03843 03678 03861 02663 01705 0041
Interlockers[Agent x Agent Agent x Agent] Average 00246 00546 00656 00628 00738 00902 00902 00902 00847 0071 00738 00874 00874 0082 00792 00683 00546
Breadth-Column[Agent x Agent Agent x Agent] 00273 01011 01803 02596 03224 03716 04399 05082 04672 04672 05355 05328 05519 05738 03825 02404 00601
Breadth-Row[Agent x Agent Agent x Agent] 00273 01339 02158 02923 03388 03934 04754 05301 04863 04973 05792 05765 05765 05984 04153 02514 00601
Clique Count[Agent x Agent Agent x Agent] Average 00301 02896 06421 10929 14699 17213 19781 22486 20601 20492 24071 2571 24918 24727 13689 06913 01148
Constraint-Burt[Agent x Agent Agent x Agent] Average 0038 01331 01404 01481 01869 02077 02643 02719 02506 02623 02827 0295 03034 02956 02548 01967 00948
Interdependence[Agent x Agent Agent x Agent] 00485 00073 00042 0003 00024 0002 00017 00015 00016 00016 00014 00013 00014 00013 00022 00037 00176
Effective Network Size-Burt[Agent x Agent Agent x Agent] Average 0053 03636 06977 10333 13118 15388 18575 21156 19688 19928 22983 2382 23991 2429 14392 08408 01957
Redundancy-Row[Agent x Agent Agent x Agent] 00601 03525 08443 12459 16093 19699 23716 26913 24754 23798 27541 29781 29617 30683 1765 10219 01311
Redundancy-Column[Agent x Agent Agent x Agent] 00601 03743 08634 12568 1623 2 23989 27077 24945 24016 27814 30055 29891 30902 17842 10383 01448
Node Levels[Agent x Agent Agent x Agent] Average 00601 09836 14344 23087 24126 36339 46913 42541 3918 43197 51913 44727 46421 48169 47568 23033 03169
Centrality-Bonacich Power[Agent x Agent Agent x Agent] Average 01038 05492 11066 15683 20027 2429 29372 32978 30383 29563 34071 36339 36284 37432 22541 13552 02514
Triad Count[Agent x Agent Agent x Agent] Average 01967 08279 27104 42432 5918 78251 96503 101475 94454 84508 94645 111421 110164 112732 67131 35574 01995
Efficiency[Agent x Agent Agent x Agent] 03333 09552 0964 09658 09658 09746 09798 09806 09795 09818 09835 09822 0982 09828 09809 09756 09607
Speed-Minimum[Agent x Agent Agent x Agent] 05 0125 0125 01 01111 00909 00909 01 01 01 00909 01111 01 01 00769 01111 01667
Edge Count-Skip[Agent x Agent Agent x Agent] 07895 07761 08988 09408 09304 0946 0933 09519 09487 09409 09455 09534 09465 09584 09188 0881 06196
Transitivity[Agent x Agent Agent x Agent] 08372 03475 03372 03238 03336 03626 0375 03495 03503 03253 0319 03266 03302 0336 03822 0405 0258
Edge Count-Pooled[Agent x Agent Agent x Agent] 08421 08657 09432 09669 09714 09764 09665 09751 09748 09704 09735 09737 09759 09788 09612 09435 08152
Speed-Average[Agent x Agent Agent x Agent] 08519 02994 03068 02802 02799 0233 02156 02387 0245 02484 02448 02516 02464 02429 02259 02373 04139
Edge Count-Lateral[Agent x Agent Agent x Agent] 09474 03731 03926 0392 04216 04061 03944 03828 03768 03651 03633 03872 03795 03985 03842 03891 05761
Resemblance Correlation[Agent x Agent Agent x Agent] Average 09994 09971 09941 09917 09894 09871 09844 09825 09838 09843 09819 09807 09807 09801 0988 09927 09986
Fragmentation[Agent x Agent Agent x Agent] 09997 0979 09411 09068 0881 08047 07079 06554 0699 06777 05778 05813 05951 05598 07798 09101 09941
Communicative Need[Agent x Agent Agent x Agent] 1 0608 07457 07662 0801 08069 08098 08233 0817 08124 0784 08169 08368 08365 08212 09004 08039
Upper Boundedness[Agent x Agent Agent x Agent] 1 09021 09909 0999 09987 0994 09966 09993 09989 09965 0995 09928 09987 09988 09986 09983 09073
Average Distance[Agent x Agent Agent x Agent] 11739 33399 326 35685 35731 42917 46372 41893 40814 40263 40845 39749 40577 41172 44266 42139 24163
Network Levels[Agent x Agent Agent x Agent] 2 8 8 10 9 11 11 10 10 10 11 9 10 10 13 9 6
Span Of Control[Agent x Agent Agent x Agent] 2375 27917 42188 48644 50903 52917 51932 54369 53981 5128 52176 55417 54426 55466 46089 40656 20909
Number of AGENT 32 94 112 137 162 183 221 228 213 219 254 250 256 260 190 141 62
Edge Count[Agent x Agent Agent x Agent] 38 201 405 574 733 889 1075 1207 1112 1082 1247 1330 1328 1370 825 496 92
Component Members-Weak[Agent x Agent Agent x Agent] Average 1728798 1248005 101776 862131 710437 559781 367623 299836 366667 345 232377 232213 225628 211885 50694 855628 1487596
Isolate Count[Agent x Agent Agent x Agent] 350 293 269 247 222 198 157 142 159 154 126 125 122 119 187 243 322
Component Count-Weak[Agent x Agent Agent x Agent] 355 301 272 250 226 201 162 146 162 157 128 128 126 122 191 249 329
Component Count-Strong[Agent x Agent Agent x Agent] 355 315 281 255 232 214 177 156 172 169 145 144 137 131 198 256 336
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Agents 221 228 213 219 254 250 256 260 190 141 62
Z -03746343246 00624390541 -08741467575 -04995124328 16858544608 14360982444 18107325691 20604887855 -23102450019 -53697586531 -103024439274
C+ 0 0 0 0 11858544608 21219527053 34326852743 49931740598 21829290579 0 0
C- 0 0 03741467575 03736591903 0 0 0 0 18102450019 6680003655 164824475824
Signal 0 0 0 0 0 0 0 1 0 1 1
Average 227
Std Dev 16015617378
k 05
Control Limit 4
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Component Members-Weak Average 367623 299836 366667 345 232377 232213 225628 211885 50694 855628 1487596
Z 09566024078 -01181147213 09414456731 05979300132 -11876316317 -1190231741 -12946324712 -15125184597 31653794044 86935922814 187130183455
C+ 04566024078 0 04414456731 05393756863 0 0 0 0 26653794044 108589716857 290719900313
C- 0 0 0 0 06876316317 13778633728 21724958439 31850143037 0 0 0
Signal 0 0 0 0 0 0 0 0 0 1 1
Average 307286
Std Dev 63074271515
k 05
Control Limit 4
1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Edge Count 38 201 405 574 733 889 1075 1207 1112 1082 1247 1330 1328 1370 825 496 92
Slope of Edge Count 163 204 169 159 156 186 132 -95 -30 165 83 -2 42 -545 -329 -404
Z -05039753822 15973457028 -01964649795 -07089823173 -08627375186 06748144948 -20927791293 -137269226972 -6393653789 59836399187 20500693512 -16784942813 -06150208054 -316223197419 -114547624997 -100197139539
C+ 0 10973457028 04008807233 0 0 01748144948 0 0 0 54836399187 70337092699 48552149886 37401941833 0 0 0
C- 00039753822 0 0 02089823173 05717198359 0 15927791293 148197018266 207133556155 142297156968 116796463456 128581406269 129731614323 440954811741 550502436738 645699576277
Signal 0 0 0 0 0 0 0 0 1 1 1 1 1 1 1 1 1
Average 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 9475 4825 43 3075 54 72 -1055 -2085 -309
Std Dev 195115350498 195115350498 195115350498 195115350498 195115350498 195115350498 195115350498 195115350498 1284143683549 1324547092406 1254033492376 115840047767 882307580533 710539700979 29504858357 278774102097 2505474007049
k 05
Control Limit 4
1988 1988 1988 1988 1988 1988
1989 1989 1989 1989 1989 1989
1990 1990 1990 1990 1990 1990
1991 1991 1991 1991 1991 1991
1992 1992 1992 1992 1992 1992
1993 1993 1993 1993 1993 1993
1994 1994 1994 1994 1994 1994
1995 1995 1995 1995 1995 1995
1996 1996 1996 1996 1996 1996
1997 1997 1997 1997 1997 1997
1998 1998 1998 1998 1998 1998
1999 1999 1999 1999 1999 1999
2000 2000 2000 2000 2000 2000
2001 2001 2001 2001 2001 2001
2002 2002 2002 2002 2002 2002
2003 2003 2003 2003 2003 2003
2004 2004 2004 2004 2004 2004
httpwwwmarocnlnieuwsforumsshowphppg=timeline
Significant Events Founded in Afghanistan Moves to Sudan WTC 1 and Somalia Leaves Sudan Returns to Afghanistan Saudia Arabia bomb Issues fatwa US embassy bombings USS Cole WTC 2 Afghanistan 3 Man is Captured Dirty bomb foiled 911 plotter captured Sheikh Mohammed captured Saudi bombings Morroco attacks Sectarian violence plot Madrid Europe truce offered
1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Betweeness 0 00001 00003 00006 00008 00016 00027 00029 00024 00025 00033 00031 00033 00036 0002 00008 0
Density 00003 00015 0003 00043 00055 00067 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Closeness 00001 00005 00012 00016 00018 00021 00027 0003 00028 00028 00031 0003 00032 00034 00024 00015 00004
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Closeness 00027 0003 00028 00028 00031 0003 00032 00034 00024 00015 00004
Z -08728715609 10910894512 -02182178902 -02182178902 17457431219 10910894512 24003967926 3709704134 -28368325731 -87287156094 -159299059872
C+ 0 05910894512 0 0 12457431219 18368325731 37372293657 69469334997 36101009266 0 0
C- 03728715609 0 0 0 0 0 0 0 23368325731 105655481825 259954541697
03728715609 05910894512 0 0 12457431219 18368325731 37372293657 69469334997 36101009266 105655481825 259954541697
Signal 0 0 0 0 0 0 0 1 0 1 1
Average 00028333333
Std Dev 00001527525
k 05
Control Limit 4
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Density 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Z -08444407432 11042686642 -0259827921 -06495698025 16888814864 30529780716 28581071308 36375908938 -43521176765 -9223891195 -150700194171
C+ 0 06042686642 0 0 11888814864 3741859558 60999666888 92375575826 43854399061 0 0
C- 03444407432 0 0 01495698025 0 0 0 0 38521176765 125760088714 271460282886
Signal 0 0 0 0 0 0 1 1 1 1 1
Average 00084333333
Std Dev 00005131601
k 05
Control Limit 4
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Betweeness 00027 00029 00024 00025 00033 00031 00033 00036 0002 00008 0
Z 01324532357 09271726499 -10596258857 -06622661785 25166114784 17218920642 25166114784 37086905998 -26490647141 -74173811996 -105962588565
C+ 0 04271726499 0 0 20166114784 32385035426 5255115021 84638056208 53147409067 0 0
C- 0 0 05596258857 07218920642 0 0 0 0 21490647141 90664459137 191627047702
Signal 0 0 0 0 0 0 1 1 1 1 1
Average 00026666667
Std Dev 00002516611
k 05
Control Limit 4
1994
1995
1996
1997
1998
1999
2000
2001
2002
2003
2004
1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Centrality-Betweenness[Agent x Agent Agent x Agent] Average 0 00001 00003 00006 00008 00016 00027 00029 00024 00025 00033 00031 00033 00036 0002 00008 0
Meta-Matrix Hamming Distance 0 00018 00036 0006 00065 00087 00124 0012 00146 00136 00135 00166 00164 00156 0015 0009 0004
Network Centralization-Betweenness[Agent x Agent Agent x Agent] 0 00091 00292 00492 00608 01105 01557 01586 01242 01187 01295 01338 01316 01414 0104 0042 00014
Edge Count-Sequential[Agent x Agent Agent x Agent] 0 001 00025 00017 0 0 00019 00008 00009 00009 00016 00015 0 00007 0 0002 0
Hierarchy[Agent x Agent Agent x Agent] 0 04827 01679 00696 00796 01496 01339 00727 00775 01273 01492 01711 0075 00717 00786 00882 05587
Exclusivity[Agent x Agent Agent x Agent] Average 00001 00003 00003 00003 00004 00004 00005 00005 00005 00006 00006 00006 00006 00006 00006 00004 00002
Network Centralization-Closeness[Agent x Agent Agent x Agent] 00001 00005 00012 00016 00018 00021 00027 0003 00028 00028 00031 0003 00032 00034 00024 00015 00004
Similarity Correlation[Agent x Agent Agent x Agent] Average 00001 00023 00036 00043 0005 0006 0007 00077 0007 00069 0008 00084 0008 00082 00053 0003 00008
Simmelian Ties[Agent x Agent Agent x Agent] Average 00002 00008 00021 00029 0004 0005 00059 00068 00061 00058 00065 00076 00076 0008 00045 00029 00004
Overall Complexity 00003 00015 0003 00043 00055 00067 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Expertise Correlation[Agent x Agent Agent x Agent] Average 00003 00015 0003 00042 00053 00065 00079 00088 00081 00079 00091 00097 00097 00101 0006 00036 00007
Density[Agent x Agent Agent x Agent] 00003 00015 0003 00043 00055 00067 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Centrality-Inverse Closeness[Agent x Agent Agent x Agent] Average 00003 00057 00191 00306 00387 0053 00733 00943 0084 00877 01111 01117 01125 01206 00587 00257 00018
Efficiency-Global[Agent x Agent Agent x Agent] 00003 00087 00223 00338 00421 00591 00815 0101 00904 00972 01245 01244 01203 01286 00632 00277 00026
Connectedness[Agent x Agent Agent x Agent] 00003 0021 00589 00932 0119 01953 02921 03446 0301 03223 04222 04187 04049 04402 02202 00899 00059
Distinctiveness Correlation[Agent x Agent Agent x Agent] Average 00006 00029 00059 00083 00106 00129 00156 00175 00162 00157 00181 00193 00193 00199 0012 00073 00014
Centrality-Closeness[Agent x Agent Agent x Agent] Average 00027 00028 00029 00031 00032 00035 00042 00048 00044 00044 00053 00052 00054 00058 00038 00031 00027
Boundary Spanner[Agent x Agent Agent x Agent] Average 00027 00191 00328 00328 00546 00574 00683 00601 00656 0071 00683 0071 00847 00656 00738 00546 00246
Network Centralization-Row Degree[Agent x Agent Agent x Agent] 00134 00314 006 00752 00794 0081 00851 00841 00848 00878 00893 00969 00996 00993 00842 00648 0024
Network Centralization-Column Degree[Agent x Agent Agent x Agent] 00134 00588 00819 00834 00849 00892 00878 00896 00876 00905 00948 00996 01024 00993 00842 0062 00322
Network Centralization-Out Degree[Agent x Agent Agent x Agent] 00135 00315 00601 00754 00797 00812 00853 00843 00851 0088 00895 00972 00999 00996 00845 0065 0024
Network Centralization-Total Degree[Agent x Agent Agent x Agent] 00135 00453 00672 00769 00813 00828 00869 0086 00867 00897 00925 00988 01016 00999 00847 00638 00282
Network Centralization-In Degree[Agent x Agent Agent x Agent] 00135 00589 00821 00836 00852 00895 00881 00898 00878 00908 0095 00999 01027 00996 00845 00622 00323
Clustering Coefficient-Watts-Strogatz[Agent x Agent Agent x Agent] Average 0023 00786 0126 01471 01728 01993 02334 02782 02382 02256 02604 02856 02775 02958 02157 01468 00355
Efficiency-Local[Agent x Agent Agent x Agent] 00238 01032 01587 01916 02253 02598 03034 0362 03167 03054 03644 03843 03678 03861 02663 01705 0041
Interlockers[Agent x Agent Agent x Agent] Average 00246 00546 00656 00628 00738 00902 00902 00902 00847 0071 00738 00874 00874 0082 00792 00683 00546
Breadth-Column[Agent x Agent Agent x Agent] 00273 01011 01803 02596 03224 03716 04399 05082 04672 04672 05355 05328 05519 05738 03825 02404 00601
Breadth-Row[Agent x Agent Agent x Agent] 00273 01339 02158 02923 03388 03934 04754 05301 04863 04973 05792 05765 05765 05984 04153 02514 00601
Clique Count[Agent x Agent Agent x Agent] Average 00301 02896 06421 10929 14699 17213 19781 22486 20601 20492 24071 2571 24918 24727 13689 06913 01148
Constraint-Burt[Agent x Agent Agent x Agent] Average 0038 01331 01404 01481 01869 02077 02643 02719 02506 02623 02827 0295 03034 02956 02548 01967 00948
Interdependence[Agent x Agent Agent x Agent] 00485 00073 00042 0003 00024 0002 00017 00015 00016 00016 00014 00013 00014 00013 00022 00037 00176
Effective Network Size-Burt[Agent x Agent Agent x Agent] Average 0053 03636 06977 10333 13118 15388 18575 21156 19688 19928 22983 2382 23991 2429 14392 08408 01957
Redundancy-Row[Agent x Agent Agent x Agent] 00601 03525 08443 12459 16093 19699 23716 26913 24754 23798 27541 29781 29617 30683 1765 10219 01311
Redundancy-Column[Agent x Agent Agent x Agent] 00601 03743 08634 12568 1623 2 23989 27077 24945 24016 27814 30055 29891 30902 17842 10383 01448
Node Levels[Agent x Agent Agent x Agent] Average 00601 09836 14344 23087 24126 36339 46913 42541 3918 43197 51913 44727 46421 48169 47568 23033 03169
Centrality-Bonacich Power[Agent x Agent Agent x Agent] Average 01038 05492 11066 15683 20027 2429 29372 32978 30383 29563 34071 36339 36284 37432 22541 13552 02514
Triad Count[Agent x Agent Agent x Agent] Average 01967 08279 27104 42432 5918 78251 96503 101475 94454 84508 94645 111421 110164 112732 67131 35574 01995
Efficiency[Agent x Agent Agent x Agent] 03333 09552 0964 09658 09658 09746 09798 09806 09795 09818 09835 09822 0982 09828 09809 09756 09607
Speed-Minimum[Agent x Agent Agent x Agent] 05 0125 0125 01 01111 00909 00909 01 01 01 00909 01111 01 01 00769 01111 01667
Edge Count-Skip[Agent x Agent Agent x Agent] 07895 07761 08988 09408 09304 0946 0933 09519 09487 09409 09455 09534 09465 09584 09188 0881 06196
Transitivity[Agent x Agent Agent x Agent] 08372 03475 03372 03238 03336 03626 0375 03495 03503 03253 0319 03266 03302 0336 03822 0405 0258
Edge Count-Pooled[Agent x Agent Agent x Agent] 08421 08657 09432 09669 09714 09764 09665 09751 09748 09704 09735 09737 09759 09788 09612 09435 08152
Speed-Average[Agent x Agent Agent x Agent] 08519 02994 03068 02802 02799 0233 02156 02387 0245 02484 02448 02516 02464 02429 02259 02373 04139
Edge Count-Lateral[Agent x Agent Agent x Agent] 09474 03731 03926 0392 04216 04061 03944 03828 03768 03651 03633 03872 03795 03985 03842 03891 05761
Resemblance Correlation[Agent x Agent Agent x Agent] Average 09994 09971 09941 09917 09894 09871 09844 09825 09838 09843 09819 09807 09807 09801 0988 09927 09986
Fragmentation[Agent x Agent Agent x Agent] 09997 0979 09411 09068 0881 08047 07079 06554 0699 06777 05778 05813 05951 05598 07798 09101 09941
Communicative Need[Agent x Agent Agent x Agent] 1 0608 07457 07662 0801 08069 08098 08233 0817 08124 0784 08169 08368 08365 08212 09004 08039
Upper Boundedness[Agent x Agent Agent x Agent] 1 09021 09909 0999 09987 0994 09966 09993 09989 09965 0995 09928 09987 09988 09986 09983 09073
Average Distance[Agent x Agent Agent x Agent] 11739 33399 326 35685 35731 42917 46372 41893 40814 40263 40845 39749 40577 41172 44266 42139 24163
Network Levels[Agent x Agent Agent x Agent] 2 8 8 10 9 11 11 10 10 10 11 9 10 10 13 9 6
Span Of Control[Agent x Agent Agent x Agent] 2375 27917 42188 48644 50903 52917 51932 54369 53981 5128 52176 55417 54426 55466 46089 40656 20909
Number of AGENT 32 94 112 137 162 183 221 228 213 219 254 250 256 260 190 141 62
Edge Count[Agent x Agent Agent x Agent] 38 201 405 574 733 889 1075 1207 1112 1082 1247 1330 1328 1370 825 496 92
Component Members-Weak[Agent x Agent Agent x Agent] Average 1728798 1248005 101776 862131 710437 559781 367623 299836 366667 345 232377 232213 225628 211885 50694 855628 1487596
Isolate Count[Agent x Agent Agent x Agent] 350 293 269 247 222 198 157 142 159 154 126 125 122 119 187 243 322
Component Count-Weak[Agent x Agent Agent x Agent] 355 301 272 250 226 201 162 146 162 157 128 128 126 122 191 249 329
Component Count-Strong[Agent x Agent Agent x Agent] 355 315 281 255 232 214 177 156 172 169 145 144 137 131 198 256 336
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Agents 221 228 213 219 254 250 256 260 190 141 62
Z -03746343246 00624390541 -08741467575 -04995124328 16858544608 14360982444 18107325691 20604887855 -23102450019 -53697586531 -103024439274
C+ 0 0 0 0 11858544608 21219527053 34326852743 49931740598 21829290579 0 0
C- 0 0 03741467575 03736591903 0 0 0 0 18102450019 6680003655 164824475824
Signal 0 0 0 0 0 0 0 1 0 1 1
Average 227
Std Dev 16015617378
k 05
Control Limit 4
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Component Members-Weak Average 367623 299836 366667 345 232377 232213 225628 211885 50694 855628 1487596
Z 09566024078 -01181147213 09414456731 05979300132 -11876316317 -1190231741 -12946324712 -15125184597 31653794044 86935922814 187130183455
C+ 04566024078 0 04414456731 05393756863 0 0 0 0 26653794044 108589716857 290719900313
C- 0 0 0 0 06876316317 13778633728 21724958439 31850143037 0 0 0
Signal 0 0 0 0 0 0 0 0 0 1 1
Average 307286
Std Dev 63074271515
k 05
Control Limit 4
1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Edge Count 38 201 405 574 733 889 1075 1207 1112 1082 1247 1330 1328 1370 825 496 92
Slope of Edge Count 163 204 169 159 156 186 132 -95 -30 165 83 -2 42 -545 -329 -404
Z -05039753822 15973457028 -01964649795 -07089823173 -08627375186 06748144948 -20927791293 -137269226972 -6393653789 59836399187 20500693512 -16784942813 -06150208054 -316223197419 -114547624997 -100197139539
C+ 0 10973457028 04008807233 0 0 01748144948 0 0 0 54836399187 70337092699 48552149886 37401941833 0 0 0
C- 00039753822 0 0 02089823173 05717198359 0 15927791293 148197018266 207133556155 142297156968 116796463456 128581406269 129731614323 440954811741 550502436738 645699576277
Signal 0 0 0 0 0 0 0 0 1 1 1 1 1 1 1 1 1
Average 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 9475 4825 43 3075 54 72 -1055 -2085 -309
Std Dev 195115350498 195115350498 195115350498 195115350498 195115350498 195115350498 195115350498 195115350498 1284143683549 1324547092406 1254033492376 115840047767 882307580533 710539700979 29504858357 278774102097 2505474007049
k 05
Control Limit 4
1988 1988 1988 1988 1988 1988
1989 1989 1989 1989 1989 1989
1990 1990 1990 1990 1990 1990
1991 1991 1991 1991 1991 1991
1992 1992 1992 1992 1992 1992
1993 1993 1993 1993 1993 1993
1994 1994 1994 1994 1994 1994
1995 1995 1995 1995 1995 1995
1996 1996 1996 1996 1996 1996
1997 1997 1997 1997 1997 1997
1998 1998 1998 1998 1998 1998
1999 1999 1999 1999 1999 1999
2000 2000 2000 2000 2000 2000
2001 2001 2001 2001 2001 2001
2002 2002 2002 2002 2002 2002
2003 2003 2003 2003 2003 2003
2004 2004 2004 2004 2004 2004
httpwwwmarocnlnieuwsforumsshowphppg=timeline
Significant Events Founded in Afghanistan Moves to Sudan WTC 1 and Somalia Leaves Sudan Returns to Afghanistan Saudia Arabia bomb Issues fatwa US embassy bombings USS Cole WTC 2 Afghanistan 3 Man is Captured Dirty bomb foiled 911 plotter captured Sheikh Mohammed captured Saudi bombings Morroco attacks Sectarian violence plot Madrid Europe truce offered
1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Betweeness 0 00001 00003 00006 00008 00016 00027 00029 00024 00025 00033 00031 00033 00036 0002 00008 0
Density 00003 00015 0003 00043 00055 00067 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Closeness 00001 00005 00012 00016 00018 00021 00027 0003 00028 00028 00031 0003 00032 00034 00024 00015 00004
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Closeness 00027 0003 00028 00028 00031 0003 00032 00034 00024 00015 00004
Z -08728715609 10910894512 -02182178902 -02182178902 17457431219 10910894512 24003967926 3709704134 -28368325731 -87287156094 -159299059872
C+ 0 05910894512 0 0 12457431219 18368325731 37372293657 69469334997 36101009266 0 0
C- 03728715609 0 0 0 0 0 0 0 23368325731 105655481825 259954541697
Signal 0 0 0 0 0 0 0 1 0 1 1
Average 00028333333
Std Dev 00001527525
k 05
Control Limit 4
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Density 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Z -08444407432 11042686642 -0259827921 -06495698025 16888814864 30529780716 28581071308 36375908938 -43521176765 -9223891195 -150700194171
C+ 0 06042686642 0 0 11888814864 3741859558 60999666888 92375575826 43854399061 0 0
C- 03444407432 0 0 01495698025 0 0 0 0 38521176765 125760088714 271460282886
Signal 0 0 0 0 0 0 1 1 1 1 1
Average 00084333333
Std Dev 00005131601
k 05
Control Limit 4
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Betweeness 00027 00029 00024 00025 00033 00031 00033 00036 0002 00008 0
Z 01324532357 09271726499 -10596258857 -06622661785 25166114784 17218920642 25166114784 37086905998 -26490647141 -74173811996 -105962588565
C+ 0 04271726499 0 0 20166114784 32385035426 5255115021 84638056208 53147409067 0 0
C- 0 0 05596258857 07218920642 0 0 0 0 21490647141 90664459137 191627047702
Signal 0 0 0 0 0 0 1 1 1 1 1
Average 00026666667
Std Dev 00002516611
k 05
Control Limit 4
1988 1988 1988
1989 1989 1989
1990 1990 1990
1991 1991 1991
1992 1992 1992
1993 1993 1993
1994 1994 1994
1995 1995 1995
1996 1996 1996
1997 1997 1997
1998 1998 1998
1999 1999 1999
2000 2000 2000
2001 2001 2001
2002 2002 2002
2003 2003 2003
2004 2004 2004
Page 19: Data Science Center Eindhoven - 4TU.Federation · 4TU AMI SRO Big Data Meeting Big Data: Mathematics in Action! November 24, 2017 Data Science Center Eindhoven The Mathematics Behind

Chart3

Betweeness
Density
Closeness
Time
Measure Value
Al-Qaeda Network Measures
0
00003
00001
00001
00015
00005
00003
0003
00012
00006
00043
00016
00008
00055
00018
00016
00067
00021
00027
0008
00027
00029
0009
0003
00024
00083
00028
00025
00081
00028
00033
00093
00031
00031
001
0003
00033
00099
00032
00036
00103
00034
0002
00062
00024
00008
00037
00015
0
00007
00004

Std Measures Control Charts

Std Measures Control Charts

Betweeness
Density
Closeness
Time
Value
Standard Measures

Chart1

Number of AGENT
Edge Count[Agent x Agent Agent x Agent]
Component Members-Weak[Agent x Agent Agent x Agent] Average
Isolate Count[Agent x Agent Agent x Agent]
Component Count-Weak[Agent x Agent Agent x Agent]
Component Count-Strong[Agent x Agent Agent x Agent]
32
38
1728798
350
355
355
94
201
1248005
293
301
315
112
405
101776
269
272
281
137
574
862131
247
250
255
162
733
710437
222
226
232
183
889
559781
198
201
214
221
1075
367623
157
162
177
228
1207
299836
142
146
156
213
1112
366667
159
162
172
219
1082
345
154
157
169
254
1247
232377
126
128
145
250
1330
232213
125
128
144
256
1328
225628
122
126
137
260
1370
211885
119
122
131
190
825
50694
187
191
198
141
496
855628
243
249
256
62
92
1487596
322
329
336

Control Charts

raw data

Chart4

C+
Closeness CUSUM Statistic for Al-Qaeda (1994-2004)
0
05910894512
0
0
12457431219
18368325731
37372293657
69469334997
36101009266
0
0

Std Measures Control Charts

Std Measures Control Charts

Betweeness
Density
Closeness
Time
Value
Standard Measures

Chart1

C+
Closeness CUSUM Chart for Al-Qaeda (1994-2004)

Control Charts

Number of AGENT
Edge Count[Agent x Agent Agent x Agent]
Component Members-Weak[Agent x Agent Agent x Agent] Average
Isolate Count[Agent x Agent Agent x Agent]
Component Count-Weak[Agent x Agent Agent x Agent]
Component Count-Strong[Agent x Agent Agent x Agent]
32
38
1728798
350
355
355
94
201
1248005
293
301
315
112
405
101776
269
272
281
137
574
862131
247
250
255
162
733
710437
222
226
232
183
889
559781
198
201
214
221
1075
367623
157
162
177
228
1207
299836
142
146
156
213
1112
366667
159
162
172
219
1082
345
154
157
169
254
1247
232377
126
128
145
250
1330
232213
125
128
144
256
1328
225628
122
126
137
260
1370
211885
119
122
131
190
825
50694
187
191
198
141
496
855628
243
249
256
62
92
1487596
322
329
336

raw data

Topic Precision farming

Topic Precision farming

Topic Compressed Sensing

Topic Network structure

bull dependencies between nodes in networks measured by degrees of direct neighbours

bull assortativity coefficient of Newman is nothing but Pearsonrsquos correlation statistic

bull inconsistent estimator when variances are infinitebull Spearman rank correlation behaves better but

calculation is computationally intensive bull requires heavy asymptotics

Van der Hofstad R and Litvak N (2014) Degree-Degree Dependencies in Random Graphs with Heavy-Tailed Degrees Internet Mathematics 10 (3-4) pp 287-334

Topic Topological Data Analysis

Common Big Data problem is to choose relevant ldquofeaturesrdquo from high-dimensional data

Combination of machine learning with topological tools (simplices cohomology) yields new algorithmsfor finding patterns and clustering

Presenter
Presentation Notes
httpsyoutubeXfWibrh6stw Gunnar Carsson

Mathematical contributions in general

bull Modellingbull Performance of algorithmsbull Statistical thinking

We need to work hard to makemathematical contributions more explicit

Conclusions

1 Mathematical methods are an important enabler in big data settings

2 Developments in big data settings not only require more computer speed and memory capacity but also new advanced mathematics

  • The Mathematics Behind Big Data
  • Outline
  • What is big data
  • Four Vrsquos of Big Data
  • High-dimensional data ldquo119951≪119953rdquo
  • Topic Google Search
  • Topic Google Search
  • Topic Google Search
  • Topic Google Search
  • Topic Google Search
  • Topic Google Search
  • Equivalent Random Surfer Model
  • Google Matrix
  • Choice of Matrix B
  • Practical Issues
  • Topic Streaming algorithms
  • Topic Uncertainty Quantification
  • Topic Network monitoring
  • Topic Precision farming
  • Topic Precision farming
  • Topic Compressed Sensing
  • Topic Network structure
  • Topic Topological Data Analysis
  • Mathematical contributions in general
  • Conclusions
1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Centrality-Betweenness[Agent x Agent Agent x Agent] Average 0 00001 00003 00006 00008 00016 00027 00029 00024 00025 00033 00031 00033 00036 0002 00008 0
Meta-Matrix Hamming Distance 0 00018 00036 0006 00065 00087 00124 0012 00146 00136 00135 00166 00164 00156 0015 0009 0004
Network Centralization-Betweenness[Agent x Agent Agent x Agent] 0 00091 00292 00492 00608 01105 01557 01586 01242 01187 01295 01338 01316 01414 0104 0042 00014
Edge Count-Sequential[Agent x Agent Agent x Agent] 0 001 00025 00017 0 0 00019 00008 00009 00009 00016 00015 0 00007 0 0002 0
Hierarchy[Agent x Agent Agent x Agent] 0 04827 01679 00696 00796 01496 01339 00727 00775 01273 01492 01711 0075 00717 00786 00882 05587
Exclusivity[Agent x Agent Agent x Agent] Average 00001 00003 00003 00003 00004 00004 00005 00005 00005 00006 00006 00006 00006 00006 00006 00004 00002
Network Centralization-Closeness[Agent x Agent Agent x Agent] 00001 00005 00012 00016 00018 00021 00027 0003 00028 00028 00031 0003 00032 00034 00024 00015 00004
Similarity Correlation[Agent x Agent Agent x Agent] Average 00001 00023 00036 00043 0005 0006 0007 00077 0007 00069 0008 00084 0008 00082 00053 0003 00008
Simmelian Ties[Agent x Agent Agent x Agent] Average 00002 00008 00021 00029 0004 0005 00059 00068 00061 00058 00065 00076 00076 0008 00045 00029 00004
Overall Complexity 00003 00015 0003 00043 00055 00067 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Expertise Correlation[Agent x Agent Agent x Agent] Average 00003 00015 0003 00042 00053 00065 00079 00088 00081 00079 00091 00097 00097 00101 0006 00036 00007
Density[Agent x Agent Agent x Agent] 00003 00015 0003 00043 00055 00067 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Centrality-Inverse Closeness[Agent x Agent Agent x Agent] Average 00003 00057 00191 00306 00387 0053 00733 00943 0084 00877 01111 01117 01125 01206 00587 00257 00018
Efficiency-Global[Agent x Agent Agent x Agent] 00003 00087 00223 00338 00421 00591 00815 0101 00904 00972 01245 01244 01203 01286 00632 00277 00026
Connectedness[Agent x Agent Agent x Agent] 00003 0021 00589 00932 0119 01953 02921 03446 0301 03223 04222 04187 04049 04402 02202 00899 00059
Distinctiveness Correlation[Agent x Agent Agent x Agent] Average 00006 00029 00059 00083 00106 00129 00156 00175 00162 00157 00181 00193 00193 00199 0012 00073 00014
Centrality-Closeness[Agent x Agent Agent x Agent] Average 00027 00028 00029 00031 00032 00035 00042 00048 00044 00044 00053 00052 00054 00058 00038 00031 00027
Boundary Spanner[Agent x Agent Agent x Agent] Average 00027 00191 00328 00328 00546 00574 00683 00601 00656 0071 00683 0071 00847 00656 00738 00546 00246
Network Centralization-Row Degree[Agent x Agent Agent x Agent] 00134 00314 006 00752 00794 0081 00851 00841 00848 00878 00893 00969 00996 00993 00842 00648 0024
Network Centralization-Column Degree[Agent x Agent Agent x Agent] 00134 00588 00819 00834 00849 00892 00878 00896 00876 00905 00948 00996 01024 00993 00842 0062 00322
Network Centralization-Out Degree[Agent x Agent Agent x Agent] 00135 00315 00601 00754 00797 00812 00853 00843 00851 0088 00895 00972 00999 00996 00845 0065 0024
Network Centralization-Total Degree[Agent x Agent Agent x Agent] 00135 00453 00672 00769 00813 00828 00869 0086 00867 00897 00925 00988 01016 00999 00847 00638 00282
Network Centralization-In Degree[Agent x Agent Agent x Agent] 00135 00589 00821 00836 00852 00895 00881 00898 00878 00908 0095 00999 01027 00996 00845 00622 00323
Clustering Coefficient-Watts-Strogatz[Agent x Agent Agent x Agent] Average 0023 00786 0126 01471 01728 01993 02334 02782 02382 02256 02604 02856 02775 02958 02157 01468 00355
Efficiency-Local[Agent x Agent Agent x Agent] 00238 01032 01587 01916 02253 02598 03034 0362 03167 03054 03644 03843 03678 03861 02663 01705 0041
Interlockers[Agent x Agent Agent x Agent] Average 00246 00546 00656 00628 00738 00902 00902 00902 00847 0071 00738 00874 00874 0082 00792 00683 00546
Breadth-Column[Agent x Agent Agent x Agent] 00273 01011 01803 02596 03224 03716 04399 05082 04672 04672 05355 05328 05519 05738 03825 02404 00601
Breadth-Row[Agent x Agent Agent x Agent] 00273 01339 02158 02923 03388 03934 04754 05301 04863 04973 05792 05765 05765 05984 04153 02514 00601
Clique Count[Agent x Agent Agent x Agent] Average 00301 02896 06421 10929 14699 17213 19781 22486 20601 20492 24071 2571 24918 24727 13689 06913 01148
Constraint-Burt[Agent x Agent Agent x Agent] Average 0038 01331 01404 01481 01869 02077 02643 02719 02506 02623 02827 0295 03034 02956 02548 01967 00948
Interdependence[Agent x Agent Agent x Agent] 00485 00073 00042 0003 00024 0002 00017 00015 00016 00016 00014 00013 00014 00013 00022 00037 00176
Effective Network Size-Burt[Agent x Agent Agent x Agent] Average 0053 03636 06977 10333 13118 15388 18575 21156 19688 19928 22983 2382 23991 2429 14392 08408 01957
Redundancy-Row[Agent x Agent Agent x Agent] 00601 03525 08443 12459 16093 19699 23716 26913 24754 23798 27541 29781 29617 30683 1765 10219 01311
Redundancy-Column[Agent x Agent Agent x Agent] 00601 03743 08634 12568 1623 2 23989 27077 24945 24016 27814 30055 29891 30902 17842 10383 01448
Node Levels[Agent x Agent Agent x Agent] Average 00601 09836 14344 23087 24126 36339 46913 42541 3918 43197 51913 44727 46421 48169 47568 23033 03169
Centrality-Bonacich Power[Agent x Agent Agent x Agent] Average 01038 05492 11066 15683 20027 2429 29372 32978 30383 29563 34071 36339 36284 37432 22541 13552 02514
Triad Count[Agent x Agent Agent x Agent] Average 01967 08279 27104 42432 5918 78251 96503 101475 94454 84508 94645 111421 110164 112732 67131 35574 01995
Efficiency[Agent x Agent Agent x Agent] 03333 09552 0964 09658 09658 09746 09798 09806 09795 09818 09835 09822 0982 09828 09809 09756 09607
Speed-Minimum[Agent x Agent Agent x Agent] 05 0125 0125 01 01111 00909 00909 01 01 01 00909 01111 01 01 00769 01111 01667
Edge Count-Skip[Agent x Agent Agent x Agent] 07895 07761 08988 09408 09304 0946 0933 09519 09487 09409 09455 09534 09465 09584 09188 0881 06196
Transitivity[Agent x Agent Agent x Agent] 08372 03475 03372 03238 03336 03626 0375 03495 03503 03253 0319 03266 03302 0336 03822 0405 0258
Edge Count-Pooled[Agent x Agent Agent x Agent] 08421 08657 09432 09669 09714 09764 09665 09751 09748 09704 09735 09737 09759 09788 09612 09435 08152
Speed-Average[Agent x Agent Agent x Agent] 08519 02994 03068 02802 02799 0233 02156 02387 0245 02484 02448 02516 02464 02429 02259 02373 04139
Edge Count-Lateral[Agent x Agent Agent x Agent] 09474 03731 03926 0392 04216 04061 03944 03828 03768 03651 03633 03872 03795 03985 03842 03891 05761
Resemblance Correlation[Agent x Agent Agent x Agent] Average 09994 09971 09941 09917 09894 09871 09844 09825 09838 09843 09819 09807 09807 09801 0988 09927 09986
Fragmentation[Agent x Agent Agent x Agent] 09997 0979 09411 09068 0881 08047 07079 06554 0699 06777 05778 05813 05951 05598 07798 09101 09941
Communicative Need[Agent x Agent Agent x Agent] 1 0608 07457 07662 0801 08069 08098 08233 0817 08124 0784 08169 08368 08365 08212 09004 08039
Upper Boundedness[Agent x Agent Agent x Agent] 1 09021 09909 0999 09987 0994 09966 09993 09989 09965 0995 09928 09987 09988 09986 09983 09073
Average Distance[Agent x Agent Agent x Agent] 11739 33399 326 35685 35731 42917 46372 41893 40814 40263 40845 39749 40577 41172 44266 42139 24163
Network Levels[Agent x Agent Agent x Agent] 2 8 8 10 9 11 11 10 10 10 11 9 10 10 13 9 6
Span Of Control[Agent x Agent Agent x Agent] 2375 27917 42188 48644 50903 52917 51932 54369 53981 5128 52176 55417 54426 55466 46089 40656 20909
Number of AGENT 32 94 112 137 162 183 221 228 213 219 254 250 256 260 190 141 62
Edge Count[Agent x Agent Agent x Agent] 38 201 405 574 733 889 1075 1207 1112 1082 1247 1330 1328 1370 825 496 92
Component Members-Weak[Agent x Agent Agent x Agent] Average 1728798 1248005 101776 862131 710437 559781 367623 299836 366667 345 232377 232213 225628 211885 50694 855628 1487596
Isolate Count[Agent x Agent Agent x Agent] 350 293 269 247 222 198 157 142 159 154 126 125 122 119 187 243 322
Component Count-Weak[Agent x Agent Agent x Agent] 355 301 272 250 226 201 162 146 162 157 128 128 126 122 191 249 329
Component Count-Strong[Agent x Agent Agent x Agent] 355 315 281 255 232 214 177 156 172 169 145 144 137 131 198 256 336
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Agents 221 228 213 219 254 250 256 260 190 141 62
Z -03746343246 00624390541 -08741467575 -04995124328 16858544608 14360982444 18107325691 20604887855 -23102450019 -53697586531 -103024439274
C+ 0 0 0 0 11858544608 21219527053 34326852743 49931740598 21829290579 0 0
C- 0 0 03741467575 03736591903 0 0 0 0 18102450019 6680003655 164824475824
Signal 0 0 0 0 0 0 0 1 0 1 1
Average 227
Std Dev 16015617378
k 05
Control Limit 4
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Component Members-Weak Average 367623 299836 366667 345 232377 232213 225628 211885 50694 855628 1487596
Z 09566024078 -01181147213 09414456731 05979300132 -11876316317 -1190231741 -12946324712 -15125184597 31653794044 86935922814 187130183455
C+ 04566024078 0 04414456731 05393756863 0 0 0 0 26653794044 108589716857 290719900313
C- 0 0 0 0 06876316317 13778633728 21724958439 31850143037 0 0 0
Signal 0 0 0 0 0 0 0 0 0 1 1
Average 307286
Std Dev 63074271515
k 05
Control Limit 4
1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Edge Count 38 201 405 574 733 889 1075 1207 1112 1082 1247 1330 1328 1370 825 496 92
Slope of Edge Count 163 204 169 159 156 186 132 -95 -30 165 83 -2 42 -545 -329 -404
Z -05039753822 15973457028 -01964649795 -07089823173 -08627375186 06748144948 -20927791293 -137269226972 -6393653789 59836399187 20500693512 -16784942813 -06150208054 -316223197419 -114547624997 -100197139539
C+ 0 10973457028 04008807233 0 0 01748144948 0 0 0 54836399187 70337092699 48552149886 37401941833 0 0 0
C- 00039753822 0 0 02089823173 05717198359 0 15927791293 148197018266 207133556155 142297156968 116796463456 128581406269 129731614323 440954811741 550502436738 645699576277
Signal 0 0 0 0 0 0 0 0 1 1 1 1 1 1 1 1 1
Average 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 9475 4825 43 3075 54 72 -1055 -2085 -309
Std Dev 195115350498 195115350498 195115350498 195115350498 195115350498 195115350498 195115350498 195115350498 1284143683549 1324547092406 1254033492376 115840047767 882307580533 710539700979 29504858357 278774102097 2505474007049
k 05
Control Limit 4
1988 1988 1988 1988 1988 1988
1989 1989 1989 1989 1989 1989
1990 1990 1990 1990 1990 1990
1991 1991 1991 1991 1991 1991
1992 1992 1992 1992 1992 1992
1993 1993 1993 1993 1993 1993
1994 1994 1994 1994 1994 1994
1995 1995 1995 1995 1995 1995
1996 1996 1996 1996 1996 1996
1997 1997 1997 1997 1997 1997
1998 1998 1998 1998 1998 1998
1999 1999 1999 1999 1999 1999
2000 2000 2000 2000 2000 2000
2001 2001 2001 2001 2001 2001
2002 2002 2002 2002 2002 2002
2003 2003 2003 2003 2003 2003
2004 2004 2004 2004 2004 2004
httpwwwmarocnlnieuwsforumsshowphppg=timeline
Significant Events Founded in Afghanistan Moves to Sudan WTC 1 and Somalia Leaves Sudan Returns to Afghanistan Saudia Arabia bomb Issues fatwa US embassy bombings USS Cole WTC 2 Afghanistan 3 Man is Captured Dirty bomb foiled 911 plotter captured Sheikh Mohammed captured Saudi bombings Morroco attacks Sectarian violence plot Madrid Europe truce offered
1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Betweeness 0 00001 00003 00006 00008 00016 00027 00029 00024 00025 00033 00031 00033 00036 0002 00008 0
Density 00003 00015 0003 00043 00055 00067 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Closeness 00001 00005 00012 00016 00018 00021 00027 0003 00028 00028 00031 0003 00032 00034 00024 00015 00004
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Closeness 00027 0003 00028 00028 00031 0003 00032 00034 00024 00015 00004
Z -08728715609 10910894512 -02182178902 -02182178902 17457431219 10910894512 24003967926 3709704134 -28368325731 -87287156094 -159299059872
C+ 0 05910894512 0 0 12457431219 18368325731 37372293657 69469334997 36101009266 0 0
C- 03728715609 0 0 0 0 0 0 0 23368325731 105655481825 259954541697
03728715609 05910894512 0 0 12457431219 18368325731 37372293657 69469334997 36101009266 105655481825 259954541697
Signal 0 0 0 0 0 0 0 1 0 1 1
Average 00028333333
Std Dev 00001527525
k 05
Control Limit 4
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Density 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Z -08444407432 11042686642 -0259827921 -06495698025 16888814864 30529780716 28581071308 36375908938 -43521176765 -9223891195 -150700194171
C+ 0 06042686642 0 0 11888814864 3741859558 60999666888 92375575826 43854399061 0 0
C- 03444407432 0 0 01495698025 0 0 0 0 38521176765 125760088714 271460282886
Signal 0 0 0 0 0 0 1 1 1 1 1
Average 00084333333
Std Dev 00005131601
k 05
Control Limit 4
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Betweeness 00027 00029 00024 00025 00033 00031 00033 00036 0002 00008 0
Z 01324532357 09271726499 -10596258857 -06622661785 25166114784 17218920642 25166114784 37086905998 -26490647141 -74173811996 -105962588565
C+ 0 04271726499 0 0 20166114784 32385035426 5255115021 84638056208 53147409067 0 0
C- 0 0 05596258857 07218920642 0 0 0 0 21490647141 90664459137 191627047702
Signal 0 0 0 0 0 0 1 1 1 1 1
Average 00026666667
Std Dev 00002516611
k 05
Control Limit 4
1994
1995
1996
1997
1998
1999
2000
2001
2002
2003
2004
1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Centrality-Betweenness[Agent x Agent Agent x Agent] Average 0 00001 00003 00006 00008 00016 00027 00029 00024 00025 00033 00031 00033 00036 0002 00008 0
Meta-Matrix Hamming Distance 0 00018 00036 0006 00065 00087 00124 0012 00146 00136 00135 00166 00164 00156 0015 0009 0004
Network Centralization-Betweenness[Agent x Agent Agent x Agent] 0 00091 00292 00492 00608 01105 01557 01586 01242 01187 01295 01338 01316 01414 0104 0042 00014
Edge Count-Sequential[Agent x Agent Agent x Agent] 0 001 00025 00017 0 0 00019 00008 00009 00009 00016 00015 0 00007 0 0002 0
Hierarchy[Agent x Agent Agent x Agent] 0 04827 01679 00696 00796 01496 01339 00727 00775 01273 01492 01711 0075 00717 00786 00882 05587
Exclusivity[Agent x Agent Agent x Agent] Average 00001 00003 00003 00003 00004 00004 00005 00005 00005 00006 00006 00006 00006 00006 00006 00004 00002
Network Centralization-Closeness[Agent x Agent Agent x Agent] 00001 00005 00012 00016 00018 00021 00027 0003 00028 00028 00031 0003 00032 00034 00024 00015 00004
Similarity Correlation[Agent x Agent Agent x Agent] Average 00001 00023 00036 00043 0005 0006 0007 00077 0007 00069 0008 00084 0008 00082 00053 0003 00008
Simmelian Ties[Agent x Agent Agent x Agent] Average 00002 00008 00021 00029 0004 0005 00059 00068 00061 00058 00065 00076 00076 0008 00045 00029 00004
Overall Complexity 00003 00015 0003 00043 00055 00067 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Expertise Correlation[Agent x Agent Agent x Agent] Average 00003 00015 0003 00042 00053 00065 00079 00088 00081 00079 00091 00097 00097 00101 0006 00036 00007
Density[Agent x Agent Agent x Agent] 00003 00015 0003 00043 00055 00067 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Centrality-Inverse Closeness[Agent x Agent Agent x Agent] Average 00003 00057 00191 00306 00387 0053 00733 00943 0084 00877 01111 01117 01125 01206 00587 00257 00018
Efficiency-Global[Agent x Agent Agent x Agent] 00003 00087 00223 00338 00421 00591 00815 0101 00904 00972 01245 01244 01203 01286 00632 00277 00026
Connectedness[Agent x Agent Agent x Agent] 00003 0021 00589 00932 0119 01953 02921 03446 0301 03223 04222 04187 04049 04402 02202 00899 00059
Distinctiveness Correlation[Agent x Agent Agent x Agent] Average 00006 00029 00059 00083 00106 00129 00156 00175 00162 00157 00181 00193 00193 00199 0012 00073 00014
Centrality-Closeness[Agent x Agent Agent x Agent] Average 00027 00028 00029 00031 00032 00035 00042 00048 00044 00044 00053 00052 00054 00058 00038 00031 00027
Boundary Spanner[Agent x Agent Agent x Agent] Average 00027 00191 00328 00328 00546 00574 00683 00601 00656 0071 00683 0071 00847 00656 00738 00546 00246
Network Centralization-Row Degree[Agent x Agent Agent x Agent] 00134 00314 006 00752 00794 0081 00851 00841 00848 00878 00893 00969 00996 00993 00842 00648 0024
Network Centralization-Column Degree[Agent x Agent Agent x Agent] 00134 00588 00819 00834 00849 00892 00878 00896 00876 00905 00948 00996 01024 00993 00842 0062 00322
Network Centralization-Out Degree[Agent x Agent Agent x Agent] 00135 00315 00601 00754 00797 00812 00853 00843 00851 0088 00895 00972 00999 00996 00845 0065 0024
Network Centralization-Total Degree[Agent x Agent Agent x Agent] 00135 00453 00672 00769 00813 00828 00869 0086 00867 00897 00925 00988 01016 00999 00847 00638 00282
Network Centralization-In Degree[Agent x Agent Agent x Agent] 00135 00589 00821 00836 00852 00895 00881 00898 00878 00908 0095 00999 01027 00996 00845 00622 00323
Clustering Coefficient-Watts-Strogatz[Agent x Agent Agent x Agent] Average 0023 00786 0126 01471 01728 01993 02334 02782 02382 02256 02604 02856 02775 02958 02157 01468 00355
Efficiency-Local[Agent x Agent Agent x Agent] 00238 01032 01587 01916 02253 02598 03034 0362 03167 03054 03644 03843 03678 03861 02663 01705 0041
Interlockers[Agent x Agent Agent x Agent] Average 00246 00546 00656 00628 00738 00902 00902 00902 00847 0071 00738 00874 00874 0082 00792 00683 00546
Breadth-Column[Agent x Agent Agent x Agent] 00273 01011 01803 02596 03224 03716 04399 05082 04672 04672 05355 05328 05519 05738 03825 02404 00601
Breadth-Row[Agent x Agent Agent x Agent] 00273 01339 02158 02923 03388 03934 04754 05301 04863 04973 05792 05765 05765 05984 04153 02514 00601
Clique Count[Agent x Agent Agent x Agent] Average 00301 02896 06421 10929 14699 17213 19781 22486 20601 20492 24071 2571 24918 24727 13689 06913 01148
Constraint-Burt[Agent x Agent Agent x Agent] Average 0038 01331 01404 01481 01869 02077 02643 02719 02506 02623 02827 0295 03034 02956 02548 01967 00948
Interdependence[Agent x Agent Agent x Agent] 00485 00073 00042 0003 00024 0002 00017 00015 00016 00016 00014 00013 00014 00013 00022 00037 00176
Effective Network Size-Burt[Agent x Agent Agent x Agent] Average 0053 03636 06977 10333 13118 15388 18575 21156 19688 19928 22983 2382 23991 2429 14392 08408 01957
Redundancy-Row[Agent x Agent Agent x Agent] 00601 03525 08443 12459 16093 19699 23716 26913 24754 23798 27541 29781 29617 30683 1765 10219 01311
Redundancy-Column[Agent x Agent Agent x Agent] 00601 03743 08634 12568 1623 2 23989 27077 24945 24016 27814 30055 29891 30902 17842 10383 01448
Node Levels[Agent x Agent Agent x Agent] Average 00601 09836 14344 23087 24126 36339 46913 42541 3918 43197 51913 44727 46421 48169 47568 23033 03169
Centrality-Bonacich Power[Agent x Agent Agent x Agent] Average 01038 05492 11066 15683 20027 2429 29372 32978 30383 29563 34071 36339 36284 37432 22541 13552 02514
Triad Count[Agent x Agent Agent x Agent] Average 01967 08279 27104 42432 5918 78251 96503 101475 94454 84508 94645 111421 110164 112732 67131 35574 01995
Efficiency[Agent x Agent Agent x Agent] 03333 09552 0964 09658 09658 09746 09798 09806 09795 09818 09835 09822 0982 09828 09809 09756 09607
Speed-Minimum[Agent x Agent Agent x Agent] 05 0125 0125 01 01111 00909 00909 01 01 01 00909 01111 01 01 00769 01111 01667
Edge Count-Skip[Agent x Agent Agent x Agent] 07895 07761 08988 09408 09304 0946 0933 09519 09487 09409 09455 09534 09465 09584 09188 0881 06196
Transitivity[Agent x Agent Agent x Agent] 08372 03475 03372 03238 03336 03626 0375 03495 03503 03253 0319 03266 03302 0336 03822 0405 0258
Edge Count-Pooled[Agent x Agent Agent x Agent] 08421 08657 09432 09669 09714 09764 09665 09751 09748 09704 09735 09737 09759 09788 09612 09435 08152
Speed-Average[Agent x Agent Agent x Agent] 08519 02994 03068 02802 02799 0233 02156 02387 0245 02484 02448 02516 02464 02429 02259 02373 04139
Edge Count-Lateral[Agent x Agent Agent x Agent] 09474 03731 03926 0392 04216 04061 03944 03828 03768 03651 03633 03872 03795 03985 03842 03891 05761
Resemblance Correlation[Agent x Agent Agent x Agent] Average 09994 09971 09941 09917 09894 09871 09844 09825 09838 09843 09819 09807 09807 09801 0988 09927 09986
Fragmentation[Agent x Agent Agent x Agent] 09997 0979 09411 09068 0881 08047 07079 06554 0699 06777 05778 05813 05951 05598 07798 09101 09941
Communicative Need[Agent x Agent Agent x Agent] 1 0608 07457 07662 0801 08069 08098 08233 0817 08124 0784 08169 08368 08365 08212 09004 08039
Upper Boundedness[Agent x Agent Agent x Agent] 1 09021 09909 0999 09987 0994 09966 09993 09989 09965 0995 09928 09987 09988 09986 09983 09073
Average Distance[Agent x Agent Agent x Agent] 11739 33399 326 35685 35731 42917 46372 41893 40814 40263 40845 39749 40577 41172 44266 42139 24163
Network Levels[Agent x Agent Agent x Agent] 2 8 8 10 9 11 11 10 10 10 11 9 10 10 13 9 6
Span Of Control[Agent x Agent Agent x Agent] 2375 27917 42188 48644 50903 52917 51932 54369 53981 5128 52176 55417 54426 55466 46089 40656 20909
Number of AGENT 32 94 112 137 162 183 221 228 213 219 254 250 256 260 190 141 62
Edge Count[Agent x Agent Agent x Agent] 38 201 405 574 733 889 1075 1207 1112 1082 1247 1330 1328 1370 825 496 92
Component Members-Weak[Agent x Agent Agent x Agent] Average 1728798 1248005 101776 862131 710437 559781 367623 299836 366667 345 232377 232213 225628 211885 50694 855628 1487596
Isolate Count[Agent x Agent Agent x Agent] 350 293 269 247 222 198 157 142 159 154 126 125 122 119 187 243 322
Component Count-Weak[Agent x Agent Agent x Agent] 355 301 272 250 226 201 162 146 162 157 128 128 126 122 191 249 329
Component Count-Strong[Agent x Agent Agent x Agent] 355 315 281 255 232 214 177 156 172 169 145 144 137 131 198 256 336
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Agents 221 228 213 219 254 250 256 260 190 141 62
Z -03746343246 00624390541 -08741467575 -04995124328 16858544608 14360982444 18107325691 20604887855 -23102450019 -53697586531 -103024439274
C+ 0 0 0 0 11858544608 21219527053 34326852743 49931740598 21829290579 0 0
C- 0 0 03741467575 03736591903 0 0 0 0 18102450019 6680003655 164824475824
Signal 0 0 0 0 0 0 0 1 0 1 1
Average 227
Std Dev 16015617378
k 05
Control Limit 4
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Component Members-Weak Average 367623 299836 366667 345 232377 232213 225628 211885 50694 855628 1487596
Z 09566024078 -01181147213 09414456731 05979300132 -11876316317 -1190231741 -12946324712 -15125184597 31653794044 86935922814 187130183455
C+ 04566024078 0 04414456731 05393756863 0 0 0 0 26653794044 108589716857 290719900313
C- 0 0 0 0 06876316317 13778633728 21724958439 31850143037 0 0 0
Signal 0 0 0 0 0 0 0 0 0 1 1
Average 307286
Std Dev 63074271515
k 05
Control Limit 4
1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Edge Count 38 201 405 574 733 889 1075 1207 1112 1082 1247 1330 1328 1370 825 496 92
Slope of Edge Count 163 204 169 159 156 186 132 -95 -30 165 83 -2 42 -545 -329 -404
Z -05039753822 15973457028 -01964649795 -07089823173 -08627375186 06748144948 -20927791293 -137269226972 -6393653789 59836399187 20500693512 -16784942813 -06150208054 -316223197419 -114547624997 -100197139539
C+ 0 10973457028 04008807233 0 0 01748144948 0 0 0 54836399187 70337092699 48552149886 37401941833 0 0 0
C- 00039753822 0 0 02089823173 05717198359 0 15927791293 148197018266 207133556155 142297156968 116796463456 128581406269 129731614323 440954811741 550502436738 645699576277
Signal 0 0 0 0 0 0 0 0 1 1 1 1 1 1 1 1 1
Average 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 9475 4825 43 3075 54 72 -1055 -2085 -309
Std Dev 195115350498 195115350498 195115350498 195115350498 195115350498 195115350498 195115350498 195115350498 1284143683549 1324547092406 1254033492376 115840047767 882307580533 710539700979 29504858357 278774102097 2505474007049
k 05
Control Limit 4
1988 1988 1988 1988 1988 1988
1989 1989 1989 1989 1989 1989
1990 1990 1990 1990 1990 1990
1991 1991 1991 1991 1991 1991
1992 1992 1992 1992 1992 1992
1993 1993 1993 1993 1993 1993
1994 1994 1994 1994 1994 1994
1995 1995 1995 1995 1995 1995
1996 1996 1996 1996 1996 1996
1997 1997 1997 1997 1997 1997
1998 1998 1998 1998 1998 1998
1999 1999 1999 1999 1999 1999
2000 2000 2000 2000 2000 2000
2001 2001 2001 2001 2001 2001
2002 2002 2002 2002 2002 2002
2003 2003 2003 2003 2003 2003
2004 2004 2004 2004 2004 2004
httpwwwmarocnlnieuwsforumsshowphppg=timeline
Significant Events Founded in Afghanistan Moves to Sudan WTC 1 and Somalia Leaves Sudan Returns to Afghanistan Saudia Arabia bomb Issues fatwa US embassy bombings USS Cole WTC 2 Afghanistan 3 Man is Captured Dirty bomb foiled 911 plotter captured Sheikh Mohammed captured Saudi bombings Morroco attacks Sectarian violence plot Madrid Europe truce offered
1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Betweeness 0 00001 00003 00006 00008 00016 00027 00029 00024 00025 00033 00031 00033 00036 0002 00008 0
Density 00003 00015 0003 00043 00055 00067 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Closeness 00001 00005 00012 00016 00018 00021 00027 0003 00028 00028 00031 0003 00032 00034 00024 00015 00004
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Closeness 00027 0003 00028 00028 00031 0003 00032 00034 00024 00015 00004
Z -08728715609 10910894512 -02182178902 -02182178902 17457431219 10910894512 24003967926 3709704134 -28368325731 -87287156094 -159299059872
C+ 0 05910894512 0 0 12457431219 18368325731 37372293657 69469334997 36101009266 0 0
C- 03728715609 0 0 0 0 0 0 0 23368325731 105655481825 259954541697
Signal 0 0 0 0 0 0 0 1 0 1 1
Average 00028333333
Std Dev 00001527525
k 05
Control Limit 4
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Density 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Z -08444407432 11042686642 -0259827921 -06495698025 16888814864 30529780716 28581071308 36375908938 -43521176765 -9223891195 -150700194171
C+ 0 06042686642 0 0 11888814864 3741859558 60999666888 92375575826 43854399061 0 0
C- 03444407432 0 0 01495698025 0 0 0 0 38521176765 125760088714 271460282886
Signal 0 0 0 0 0 0 1 1 1 1 1
Average 00084333333
Std Dev 00005131601
k 05
Control Limit 4
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Betweeness 00027 00029 00024 00025 00033 00031 00033 00036 0002 00008 0
Z 01324532357 09271726499 -10596258857 -06622661785 25166114784 17218920642 25166114784 37086905998 -26490647141 -74173811996 -105962588565
C+ 0 04271726499 0 0 20166114784 32385035426 5255115021 84638056208 53147409067 0 0
C- 0 0 05596258857 07218920642 0 0 0 0 21490647141 90664459137 191627047702
Signal 0 0 0 0 0 0 1 1 1 1 1
Average 00026666667
Std Dev 00002516611
k 05
Control Limit 4
1988 1988 1988
1989 1989 1989
1990 1990 1990
1991 1991 1991
1992 1992 1992
1993 1993 1993
1994 1994 1994
1995 1995 1995
1996 1996 1996
1997 1997 1997
1998 1998 1998
1999 1999 1999
2000 2000 2000
2001 2001 2001
2002 2002 2002
2003 2003 2003
2004 2004 2004
Page 20: Data Science Center Eindhoven - 4TU.Federation · 4TU AMI SRO Big Data Meeting Big Data: Mathematics in Action! November 24, 2017 Data Science Center Eindhoven The Mathematics Behind

Std Measures Control Charts

Std Measures Control Charts

Betweeness
Density
Closeness
Time
Value
Standard Measures

Chart1

Number of AGENT
Edge Count[Agent x Agent Agent x Agent]
Component Members-Weak[Agent x Agent Agent x Agent] Average
Isolate Count[Agent x Agent Agent x Agent]
Component Count-Weak[Agent x Agent Agent x Agent]
Component Count-Strong[Agent x Agent Agent x Agent]
32
38
1728798
350
355
355
94
201
1248005
293
301
315
112
405
101776
269
272
281
137
574
862131
247
250
255
162
733
710437
222
226
232
183
889
559781
198
201
214
221
1075
367623
157
162
177
228
1207
299836
142
146
156
213
1112
366667
159
162
172
219
1082
345
154
157
169
254
1247
232377
126
128
145
250
1330
232213
125
128
144
256
1328
225628
122
126
137
260
1370
211885
119
122
131
190
825
50694
187
191
198
141
496
855628
243
249
256
62
92
1487596
322
329
336

Control Charts

raw data

Chart4

C+
Closeness CUSUM Statistic for Al-Qaeda (1994-2004)
0
05910894512
0
0
12457431219
18368325731
37372293657
69469334997
36101009266
0
0

Std Measures Control Charts

Std Measures Control Charts

Betweeness
Density
Closeness
Time
Value
Standard Measures

Chart1

C+
Closeness CUSUM Chart for Al-Qaeda (1994-2004)

Control Charts

Number of AGENT
Edge Count[Agent x Agent Agent x Agent]
Component Members-Weak[Agent x Agent Agent x Agent] Average
Isolate Count[Agent x Agent Agent x Agent]
Component Count-Weak[Agent x Agent Agent x Agent]
Component Count-Strong[Agent x Agent Agent x Agent]
32
38
1728798
350
355
355
94
201
1248005
293
301
315
112
405
101776
269
272
281
137
574
862131
247
250
255
162
733
710437
222
226
232
183
889
559781
198
201
214
221
1075
367623
157
162
177
228
1207
299836
142
146
156
213
1112
366667
159
162
172
219
1082
345
154
157
169
254
1247
232377
126
128
145
250
1330
232213
125
128
144
256
1328
225628
122
126
137
260
1370
211885
119
122
131
190
825
50694
187
191
198
141
496
855628
243
249
256
62
92
1487596
322
329
336

raw data

Topic Precision farming

Topic Precision farming

Topic Compressed Sensing

Topic Network structure

bull dependencies between nodes in networks measured by degrees of direct neighbours

bull assortativity coefficient of Newman is nothing but Pearsonrsquos correlation statistic

bull inconsistent estimator when variances are infinitebull Spearman rank correlation behaves better but

calculation is computationally intensive bull requires heavy asymptotics

Van der Hofstad R and Litvak N (2014) Degree-Degree Dependencies in Random Graphs with Heavy-Tailed Degrees Internet Mathematics 10 (3-4) pp 287-334

Topic Topological Data Analysis

Common Big Data problem is to choose relevant ldquofeaturesrdquo from high-dimensional data

Combination of machine learning with topological tools (simplices cohomology) yields new algorithmsfor finding patterns and clustering

Presenter
Presentation Notes
httpsyoutubeXfWibrh6stw Gunnar Carsson

Mathematical contributions in general

bull Modellingbull Performance of algorithmsbull Statistical thinking

We need to work hard to makemathematical contributions more explicit

Conclusions

1 Mathematical methods are an important enabler in big data settings

2 Developments in big data settings not only require more computer speed and memory capacity but also new advanced mathematics

  • The Mathematics Behind Big Data
  • Outline
  • What is big data
  • Four Vrsquos of Big Data
  • High-dimensional data ldquo119951≪119953rdquo
  • Topic Google Search
  • Topic Google Search
  • Topic Google Search
  • Topic Google Search
  • Topic Google Search
  • Topic Google Search
  • Equivalent Random Surfer Model
  • Google Matrix
  • Choice of Matrix B
  • Practical Issues
  • Topic Streaming algorithms
  • Topic Uncertainty Quantification
  • Topic Network monitoring
  • Topic Precision farming
  • Topic Precision farming
  • Topic Compressed Sensing
  • Topic Network structure
  • Topic Topological Data Analysis
  • Mathematical contributions in general
  • Conclusions
1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Centrality-Betweenness[Agent x Agent Agent x Agent] Average 0 00001 00003 00006 00008 00016 00027 00029 00024 00025 00033 00031 00033 00036 0002 00008 0
Meta-Matrix Hamming Distance 0 00018 00036 0006 00065 00087 00124 0012 00146 00136 00135 00166 00164 00156 0015 0009 0004
Network Centralization-Betweenness[Agent x Agent Agent x Agent] 0 00091 00292 00492 00608 01105 01557 01586 01242 01187 01295 01338 01316 01414 0104 0042 00014
Edge Count-Sequential[Agent x Agent Agent x Agent] 0 001 00025 00017 0 0 00019 00008 00009 00009 00016 00015 0 00007 0 0002 0
Hierarchy[Agent x Agent Agent x Agent] 0 04827 01679 00696 00796 01496 01339 00727 00775 01273 01492 01711 0075 00717 00786 00882 05587
Exclusivity[Agent x Agent Agent x Agent] Average 00001 00003 00003 00003 00004 00004 00005 00005 00005 00006 00006 00006 00006 00006 00006 00004 00002
Network Centralization-Closeness[Agent x Agent Agent x Agent] 00001 00005 00012 00016 00018 00021 00027 0003 00028 00028 00031 0003 00032 00034 00024 00015 00004
Similarity Correlation[Agent x Agent Agent x Agent] Average 00001 00023 00036 00043 0005 0006 0007 00077 0007 00069 0008 00084 0008 00082 00053 0003 00008
Simmelian Ties[Agent x Agent Agent x Agent] Average 00002 00008 00021 00029 0004 0005 00059 00068 00061 00058 00065 00076 00076 0008 00045 00029 00004
Overall Complexity 00003 00015 0003 00043 00055 00067 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Expertise Correlation[Agent x Agent Agent x Agent] Average 00003 00015 0003 00042 00053 00065 00079 00088 00081 00079 00091 00097 00097 00101 0006 00036 00007
Density[Agent x Agent Agent x Agent] 00003 00015 0003 00043 00055 00067 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Centrality-Inverse Closeness[Agent x Agent Agent x Agent] Average 00003 00057 00191 00306 00387 0053 00733 00943 0084 00877 01111 01117 01125 01206 00587 00257 00018
Efficiency-Global[Agent x Agent Agent x Agent] 00003 00087 00223 00338 00421 00591 00815 0101 00904 00972 01245 01244 01203 01286 00632 00277 00026
Connectedness[Agent x Agent Agent x Agent] 00003 0021 00589 00932 0119 01953 02921 03446 0301 03223 04222 04187 04049 04402 02202 00899 00059
Distinctiveness Correlation[Agent x Agent Agent x Agent] Average 00006 00029 00059 00083 00106 00129 00156 00175 00162 00157 00181 00193 00193 00199 0012 00073 00014
Centrality-Closeness[Agent x Agent Agent x Agent] Average 00027 00028 00029 00031 00032 00035 00042 00048 00044 00044 00053 00052 00054 00058 00038 00031 00027
Boundary Spanner[Agent x Agent Agent x Agent] Average 00027 00191 00328 00328 00546 00574 00683 00601 00656 0071 00683 0071 00847 00656 00738 00546 00246
Network Centralization-Row Degree[Agent x Agent Agent x Agent] 00134 00314 006 00752 00794 0081 00851 00841 00848 00878 00893 00969 00996 00993 00842 00648 0024
Network Centralization-Column Degree[Agent x Agent Agent x Agent] 00134 00588 00819 00834 00849 00892 00878 00896 00876 00905 00948 00996 01024 00993 00842 0062 00322
Network Centralization-Out Degree[Agent x Agent Agent x Agent] 00135 00315 00601 00754 00797 00812 00853 00843 00851 0088 00895 00972 00999 00996 00845 0065 0024
Network Centralization-Total Degree[Agent x Agent Agent x Agent] 00135 00453 00672 00769 00813 00828 00869 0086 00867 00897 00925 00988 01016 00999 00847 00638 00282
Network Centralization-In Degree[Agent x Agent Agent x Agent] 00135 00589 00821 00836 00852 00895 00881 00898 00878 00908 0095 00999 01027 00996 00845 00622 00323
Clustering Coefficient-Watts-Strogatz[Agent x Agent Agent x Agent] Average 0023 00786 0126 01471 01728 01993 02334 02782 02382 02256 02604 02856 02775 02958 02157 01468 00355
Efficiency-Local[Agent x Agent Agent x Agent] 00238 01032 01587 01916 02253 02598 03034 0362 03167 03054 03644 03843 03678 03861 02663 01705 0041
Interlockers[Agent x Agent Agent x Agent] Average 00246 00546 00656 00628 00738 00902 00902 00902 00847 0071 00738 00874 00874 0082 00792 00683 00546
Breadth-Column[Agent x Agent Agent x Agent] 00273 01011 01803 02596 03224 03716 04399 05082 04672 04672 05355 05328 05519 05738 03825 02404 00601
Breadth-Row[Agent x Agent Agent x Agent] 00273 01339 02158 02923 03388 03934 04754 05301 04863 04973 05792 05765 05765 05984 04153 02514 00601
Clique Count[Agent x Agent Agent x Agent] Average 00301 02896 06421 10929 14699 17213 19781 22486 20601 20492 24071 2571 24918 24727 13689 06913 01148
Constraint-Burt[Agent x Agent Agent x Agent] Average 0038 01331 01404 01481 01869 02077 02643 02719 02506 02623 02827 0295 03034 02956 02548 01967 00948
Interdependence[Agent x Agent Agent x Agent] 00485 00073 00042 0003 00024 0002 00017 00015 00016 00016 00014 00013 00014 00013 00022 00037 00176
Effective Network Size-Burt[Agent x Agent Agent x Agent] Average 0053 03636 06977 10333 13118 15388 18575 21156 19688 19928 22983 2382 23991 2429 14392 08408 01957
Redundancy-Row[Agent x Agent Agent x Agent] 00601 03525 08443 12459 16093 19699 23716 26913 24754 23798 27541 29781 29617 30683 1765 10219 01311
Redundancy-Column[Agent x Agent Agent x Agent] 00601 03743 08634 12568 1623 2 23989 27077 24945 24016 27814 30055 29891 30902 17842 10383 01448
Node Levels[Agent x Agent Agent x Agent] Average 00601 09836 14344 23087 24126 36339 46913 42541 3918 43197 51913 44727 46421 48169 47568 23033 03169
Centrality-Bonacich Power[Agent x Agent Agent x Agent] Average 01038 05492 11066 15683 20027 2429 29372 32978 30383 29563 34071 36339 36284 37432 22541 13552 02514
Triad Count[Agent x Agent Agent x Agent] Average 01967 08279 27104 42432 5918 78251 96503 101475 94454 84508 94645 111421 110164 112732 67131 35574 01995
Efficiency[Agent x Agent Agent x Agent] 03333 09552 0964 09658 09658 09746 09798 09806 09795 09818 09835 09822 0982 09828 09809 09756 09607
Speed-Minimum[Agent x Agent Agent x Agent] 05 0125 0125 01 01111 00909 00909 01 01 01 00909 01111 01 01 00769 01111 01667
Edge Count-Skip[Agent x Agent Agent x Agent] 07895 07761 08988 09408 09304 0946 0933 09519 09487 09409 09455 09534 09465 09584 09188 0881 06196
Transitivity[Agent x Agent Agent x Agent] 08372 03475 03372 03238 03336 03626 0375 03495 03503 03253 0319 03266 03302 0336 03822 0405 0258
Edge Count-Pooled[Agent x Agent Agent x Agent] 08421 08657 09432 09669 09714 09764 09665 09751 09748 09704 09735 09737 09759 09788 09612 09435 08152
Speed-Average[Agent x Agent Agent x Agent] 08519 02994 03068 02802 02799 0233 02156 02387 0245 02484 02448 02516 02464 02429 02259 02373 04139
Edge Count-Lateral[Agent x Agent Agent x Agent] 09474 03731 03926 0392 04216 04061 03944 03828 03768 03651 03633 03872 03795 03985 03842 03891 05761
Resemblance Correlation[Agent x Agent Agent x Agent] Average 09994 09971 09941 09917 09894 09871 09844 09825 09838 09843 09819 09807 09807 09801 0988 09927 09986
Fragmentation[Agent x Agent Agent x Agent] 09997 0979 09411 09068 0881 08047 07079 06554 0699 06777 05778 05813 05951 05598 07798 09101 09941
Communicative Need[Agent x Agent Agent x Agent] 1 0608 07457 07662 0801 08069 08098 08233 0817 08124 0784 08169 08368 08365 08212 09004 08039
Upper Boundedness[Agent x Agent Agent x Agent] 1 09021 09909 0999 09987 0994 09966 09993 09989 09965 0995 09928 09987 09988 09986 09983 09073
Average Distance[Agent x Agent Agent x Agent] 11739 33399 326 35685 35731 42917 46372 41893 40814 40263 40845 39749 40577 41172 44266 42139 24163
Network Levels[Agent x Agent Agent x Agent] 2 8 8 10 9 11 11 10 10 10 11 9 10 10 13 9 6
Span Of Control[Agent x Agent Agent x Agent] 2375 27917 42188 48644 50903 52917 51932 54369 53981 5128 52176 55417 54426 55466 46089 40656 20909
Number of AGENT 32 94 112 137 162 183 221 228 213 219 254 250 256 260 190 141 62
Edge Count[Agent x Agent Agent x Agent] 38 201 405 574 733 889 1075 1207 1112 1082 1247 1330 1328 1370 825 496 92
Component Members-Weak[Agent x Agent Agent x Agent] Average 1728798 1248005 101776 862131 710437 559781 367623 299836 366667 345 232377 232213 225628 211885 50694 855628 1487596
Isolate Count[Agent x Agent Agent x Agent] 350 293 269 247 222 198 157 142 159 154 126 125 122 119 187 243 322
Component Count-Weak[Agent x Agent Agent x Agent] 355 301 272 250 226 201 162 146 162 157 128 128 126 122 191 249 329
Component Count-Strong[Agent x Agent Agent x Agent] 355 315 281 255 232 214 177 156 172 169 145 144 137 131 198 256 336
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Agents 221 228 213 219 254 250 256 260 190 141 62
Z -03746343246 00624390541 -08741467575 -04995124328 16858544608 14360982444 18107325691 20604887855 -23102450019 -53697586531 -103024439274
C+ 0 0 0 0 11858544608 21219527053 34326852743 49931740598 21829290579 0 0
C- 0 0 03741467575 03736591903 0 0 0 0 18102450019 6680003655 164824475824
Signal 0 0 0 0 0 0 0 1 0 1 1
Average 227
Std Dev 16015617378
k 05
Control Limit 4
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Component Members-Weak Average 367623 299836 366667 345 232377 232213 225628 211885 50694 855628 1487596
Z 09566024078 -01181147213 09414456731 05979300132 -11876316317 -1190231741 -12946324712 -15125184597 31653794044 86935922814 187130183455
C+ 04566024078 0 04414456731 05393756863 0 0 0 0 26653794044 108589716857 290719900313
C- 0 0 0 0 06876316317 13778633728 21724958439 31850143037 0 0 0
Signal 0 0 0 0 0 0 0 0 0 1 1
Average 307286
Std Dev 63074271515
k 05
Control Limit 4
1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Edge Count 38 201 405 574 733 889 1075 1207 1112 1082 1247 1330 1328 1370 825 496 92
Slope of Edge Count 163 204 169 159 156 186 132 -95 -30 165 83 -2 42 -545 -329 -404
Z -05039753822 15973457028 -01964649795 -07089823173 -08627375186 06748144948 -20927791293 -137269226972 -6393653789 59836399187 20500693512 -16784942813 -06150208054 -316223197419 -114547624997 -100197139539
C+ 0 10973457028 04008807233 0 0 01748144948 0 0 0 54836399187 70337092699 48552149886 37401941833 0 0 0
C- 00039753822 0 0 02089823173 05717198359 0 15927791293 148197018266 207133556155 142297156968 116796463456 128581406269 129731614323 440954811741 550502436738 645699576277
Signal 0 0 0 0 0 0 0 0 1 1 1 1 1 1 1 1 1
Average 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 9475 4825 43 3075 54 72 -1055 -2085 -309
Std Dev 195115350498 195115350498 195115350498 195115350498 195115350498 195115350498 195115350498 195115350498 1284143683549 1324547092406 1254033492376 115840047767 882307580533 710539700979 29504858357 278774102097 2505474007049
k 05
Control Limit 4
1988 1988 1988 1988 1988 1988
1989 1989 1989 1989 1989 1989
1990 1990 1990 1990 1990 1990
1991 1991 1991 1991 1991 1991
1992 1992 1992 1992 1992 1992
1993 1993 1993 1993 1993 1993
1994 1994 1994 1994 1994 1994
1995 1995 1995 1995 1995 1995
1996 1996 1996 1996 1996 1996
1997 1997 1997 1997 1997 1997
1998 1998 1998 1998 1998 1998
1999 1999 1999 1999 1999 1999
2000 2000 2000 2000 2000 2000
2001 2001 2001 2001 2001 2001
2002 2002 2002 2002 2002 2002
2003 2003 2003 2003 2003 2003
2004 2004 2004 2004 2004 2004
httpwwwmarocnlnieuwsforumsshowphppg=timeline
Significant Events Founded in Afghanistan Moves to Sudan WTC 1 and Somalia Leaves Sudan Returns to Afghanistan Saudia Arabia bomb Issues fatwa US embassy bombings USS Cole WTC 2 Afghanistan 3 Man is Captured Dirty bomb foiled 911 plotter captured Sheikh Mohammed captured Saudi bombings Morroco attacks Sectarian violence plot Madrid Europe truce offered
1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Betweeness 0 00001 00003 00006 00008 00016 00027 00029 00024 00025 00033 00031 00033 00036 0002 00008 0
Density 00003 00015 0003 00043 00055 00067 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Closeness 00001 00005 00012 00016 00018 00021 00027 0003 00028 00028 00031 0003 00032 00034 00024 00015 00004
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Closeness 00027 0003 00028 00028 00031 0003 00032 00034 00024 00015 00004
Z -08728715609 10910894512 -02182178902 -02182178902 17457431219 10910894512 24003967926 3709704134 -28368325731 -87287156094 -159299059872
C+ 0 05910894512 0 0 12457431219 18368325731 37372293657 69469334997 36101009266 0 0
C- 03728715609 0 0 0 0 0 0 0 23368325731 105655481825 259954541697
03728715609 05910894512 0 0 12457431219 18368325731 37372293657 69469334997 36101009266 105655481825 259954541697
Signal 0 0 0 0 0 0 0 1 0 1 1
Average 00028333333
Std Dev 00001527525
k 05
Control Limit 4
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Density 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Z -08444407432 11042686642 -0259827921 -06495698025 16888814864 30529780716 28581071308 36375908938 -43521176765 -9223891195 -150700194171
C+ 0 06042686642 0 0 11888814864 3741859558 60999666888 92375575826 43854399061 0 0
C- 03444407432 0 0 01495698025 0 0 0 0 38521176765 125760088714 271460282886
Signal 0 0 0 0 0 0 1 1 1 1 1
Average 00084333333
Std Dev 00005131601
k 05
Control Limit 4
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Betweeness 00027 00029 00024 00025 00033 00031 00033 00036 0002 00008 0
Z 01324532357 09271726499 -10596258857 -06622661785 25166114784 17218920642 25166114784 37086905998 -26490647141 -74173811996 -105962588565
C+ 0 04271726499 0 0 20166114784 32385035426 5255115021 84638056208 53147409067 0 0
C- 0 0 05596258857 07218920642 0 0 0 0 21490647141 90664459137 191627047702
Signal 0 0 0 0 0 0 1 1 1 1 1
Average 00026666667
Std Dev 00002516611
k 05
Control Limit 4
1994
1995
1996
1997
1998
1999
2000
2001
2002
2003
2004
1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Centrality-Betweenness[Agent x Agent Agent x Agent] Average 0 00001 00003 00006 00008 00016 00027 00029 00024 00025 00033 00031 00033 00036 0002 00008 0
Meta-Matrix Hamming Distance 0 00018 00036 0006 00065 00087 00124 0012 00146 00136 00135 00166 00164 00156 0015 0009 0004
Network Centralization-Betweenness[Agent x Agent Agent x Agent] 0 00091 00292 00492 00608 01105 01557 01586 01242 01187 01295 01338 01316 01414 0104 0042 00014
Edge Count-Sequential[Agent x Agent Agent x Agent] 0 001 00025 00017 0 0 00019 00008 00009 00009 00016 00015 0 00007 0 0002 0
Hierarchy[Agent x Agent Agent x Agent] 0 04827 01679 00696 00796 01496 01339 00727 00775 01273 01492 01711 0075 00717 00786 00882 05587
Exclusivity[Agent x Agent Agent x Agent] Average 00001 00003 00003 00003 00004 00004 00005 00005 00005 00006 00006 00006 00006 00006 00006 00004 00002
Network Centralization-Closeness[Agent x Agent Agent x Agent] 00001 00005 00012 00016 00018 00021 00027 0003 00028 00028 00031 0003 00032 00034 00024 00015 00004
Similarity Correlation[Agent x Agent Agent x Agent] Average 00001 00023 00036 00043 0005 0006 0007 00077 0007 00069 0008 00084 0008 00082 00053 0003 00008
Simmelian Ties[Agent x Agent Agent x Agent] Average 00002 00008 00021 00029 0004 0005 00059 00068 00061 00058 00065 00076 00076 0008 00045 00029 00004
Overall Complexity 00003 00015 0003 00043 00055 00067 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Expertise Correlation[Agent x Agent Agent x Agent] Average 00003 00015 0003 00042 00053 00065 00079 00088 00081 00079 00091 00097 00097 00101 0006 00036 00007
Density[Agent x Agent Agent x Agent] 00003 00015 0003 00043 00055 00067 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Centrality-Inverse Closeness[Agent x Agent Agent x Agent] Average 00003 00057 00191 00306 00387 0053 00733 00943 0084 00877 01111 01117 01125 01206 00587 00257 00018
Efficiency-Global[Agent x Agent Agent x Agent] 00003 00087 00223 00338 00421 00591 00815 0101 00904 00972 01245 01244 01203 01286 00632 00277 00026
Connectedness[Agent x Agent Agent x Agent] 00003 0021 00589 00932 0119 01953 02921 03446 0301 03223 04222 04187 04049 04402 02202 00899 00059
Distinctiveness Correlation[Agent x Agent Agent x Agent] Average 00006 00029 00059 00083 00106 00129 00156 00175 00162 00157 00181 00193 00193 00199 0012 00073 00014
Centrality-Closeness[Agent x Agent Agent x Agent] Average 00027 00028 00029 00031 00032 00035 00042 00048 00044 00044 00053 00052 00054 00058 00038 00031 00027
Boundary Spanner[Agent x Agent Agent x Agent] Average 00027 00191 00328 00328 00546 00574 00683 00601 00656 0071 00683 0071 00847 00656 00738 00546 00246
Network Centralization-Row Degree[Agent x Agent Agent x Agent] 00134 00314 006 00752 00794 0081 00851 00841 00848 00878 00893 00969 00996 00993 00842 00648 0024
Network Centralization-Column Degree[Agent x Agent Agent x Agent] 00134 00588 00819 00834 00849 00892 00878 00896 00876 00905 00948 00996 01024 00993 00842 0062 00322
Network Centralization-Out Degree[Agent x Agent Agent x Agent] 00135 00315 00601 00754 00797 00812 00853 00843 00851 0088 00895 00972 00999 00996 00845 0065 0024
Network Centralization-Total Degree[Agent x Agent Agent x Agent] 00135 00453 00672 00769 00813 00828 00869 0086 00867 00897 00925 00988 01016 00999 00847 00638 00282
Network Centralization-In Degree[Agent x Agent Agent x Agent] 00135 00589 00821 00836 00852 00895 00881 00898 00878 00908 0095 00999 01027 00996 00845 00622 00323
Clustering Coefficient-Watts-Strogatz[Agent x Agent Agent x Agent] Average 0023 00786 0126 01471 01728 01993 02334 02782 02382 02256 02604 02856 02775 02958 02157 01468 00355
Efficiency-Local[Agent x Agent Agent x Agent] 00238 01032 01587 01916 02253 02598 03034 0362 03167 03054 03644 03843 03678 03861 02663 01705 0041
Interlockers[Agent x Agent Agent x Agent] Average 00246 00546 00656 00628 00738 00902 00902 00902 00847 0071 00738 00874 00874 0082 00792 00683 00546
Breadth-Column[Agent x Agent Agent x Agent] 00273 01011 01803 02596 03224 03716 04399 05082 04672 04672 05355 05328 05519 05738 03825 02404 00601
Breadth-Row[Agent x Agent Agent x Agent] 00273 01339 02158 02923 03388 03934 04754 05301 04863 04973 05792 05765 05765 05984 04153 02514 00601
Clique Count[Agent x Agent Agent x Agent] Average 00301 02896 06421 10929 14699 17213 19781 22486 20601 20492 24071 2571 24918 24727 13689 06913 01148
Constraint-Burt[Agent x Agent Agent x Agent] Average 0038 01331 01404 01481 01869 02077 02643 02719 02506 02623 02827 0295 03034 02956 02548 01967 00948
Interdependence[Agent x Agent Agent x Agent] 00485 00073 00042 0003 00024 0002 00017 00015 00016 00016 00014 00013 00014 00013 00022 00037 00176
Effective Network Size-Burt[Agent x Agent Agent x Agent] Average 0053 03636 06977 10333 13118 15388 18575 21156 19688 19928 22983 2382 23991 2429 14392 08408 01957
Redundancy-Row[Agent x Agent Agent x Agent] 00601 03525 08443 12459 16093 19699 23716 26913 24754 23798 27541 29781 29617 30683 1765 10219 01311
Redundancy-Column[Agent x Agent Agent x Agent] 00601 03743 08634 12568 1623 2 23989 27077 24945 24016 27814 30055 29891 30902 17842 10383 01448
Node Levels[Agent x Agent Agent x Agent] Average 00601 09836 14344 23087 24126 36339 46913 42541 3918 43197 51913 44727 46421 48169 47568 23033 03169
Centrality-Bonacich Power[Agent x Agent Agent x Agent] Average 01038 05492 11066 15683 20027 2429 29372 32978 30383 29563 34071 36339 36284 37432 22541 13552 02514
Triad Count[Agent x Agent Agent x Agent] Average 01967 08279 27104 42432 5918 78251 96503 101475 94454 84508 94645 111421 110164 112732 67131 35574 01995
Efficiency[Agent x Agent Agent x Agent] 03333 09552 0964 09658 09658 09746 09798 09806 09795 09818 09835 09822 0982 09828 09809 09756 09607
Speed-Minimum[Agent x Agent Agent x Agent] 05 0125 0125 01 01111 00909 00909 01 01 01 00909 01111 01 01 00769 01111 01667
Edge Count-Skip[Agent x Agent Agent x Agent] 07895 07761 08988 09408 09304 0946 0933 09519 09487 09409 09455 09534 09465 09584 09188 0881 06196
Transitivity[Agent x Agent Agent x Agent] 08372 03475 03372 03238 03336 03626 0375 03495 03503 03253 0319 03266 03302 0336 03822 0405 0258
Edge Count-Pooled[Agent x Agent Agent x Agent] 08421 08657 09432 09669 09714 09764 09665 09751 09748 09704 09735 09737 09759 09788 09612 09435 08152
Speed-Average[Agent x Agent Agent x Agent] 08519 02994 03068 02802 02799 0233 02156 02387 0245 02484 02448 02516 02464 02429 02259 02373 04139
Edge Count-Lateral[Agent x Agent Agent x Agent] 09474 03731 03926 0392 04216 04061 03944 03828 03768 03651 03633 03872 03795 03985 03842 03891 05761
Resemblance Correlation[Agent x Agent Agent x Agent] Average 09994 09971 09941 09917 09894 09871 09844 09825 09838 09843 09819 09807 09807 09801 0988 09927 09986
Fragmentation[Agent x Agent Agent x Agent] 09997 0979 09411 09068 0881 08047 07079 06554 0699 06777 05778 05813 05951 05598 07798 09101 09941
Communicative Need[Agent x Agent Agent x Agent] 1 0608 07457 07662 0801 08069 08098 08233 0817 08124 0784 08169 08368 08365 08212 09004 08039
Upper Boundedness[Agent x Agent Agent x Agent] 1 09021 09909 0999 09987 0994 09966 09993 09989 09965 0995 09928 09987 09988 09986 09983 09073
Average Distance[Agent x Agent Agent x Agent] 11739 33399 326 35685 35731 42917 46372 41893 40814 40263 40845 39749 40577 41172 44266 42139 24163
Network Levels[Agent x Agent Agent x Agent] 2 8 8 10 9 11 11 10 10 10 11 9 10 10 13 9 6
Span Of Control[Agent x Agent Agent x Agent] 2375 27917 42188 48644 50903 52917 51932 54369 53981 5128 52176 55417 54426 55466 46089 40656 20909
Number of AGENT 32 94 112 137 162 183 221 228 213 219 254 250 256 260 190 141 62
Edge Count[Agent x Agent Agent x Agent] 38 201 405 574 733 889 1075 1207 1112 1082 1247 1330 1328 1370 825 496 92
Component Members-Weak[Agent x Agent Agent x Agent] Average 1728798 1248005 101776 862131 710437 559781 367623 299836 366667 345 232377 232213 225628 211885 50694 855628 1487596
Isolate Count[Agent x Agent Agent x Agent] 350 293 269 247 222 198 157 142 159 154 126 125 122 119 187 243 322
Component Count-Weak[Agent x Agent Agent x Agent] 355 301 272 250 226 201 162 146 162 157 128 128 126 122 191 249 329
Component Count-Strong[Agent x Agent Agent x Agent] 355 315 281 255 232 214 177 156 172 169 145 144 137 131 198 256 336
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Agents 221 228 213 219 254 250 256 260 190 141 62
Z -03746343246 00624390541 -08741467575 -04995124328 16858544608 14360982444 18107325691 20604887855 -23102450019 -53697586531 -103024439274
C+ 0 0 0 0 11858544608 21219527053 34326852743 49931740598 21829290579 0 0
C- 0 0 03741467575 03736591903 0 0 0 0 18102450019 6680003655 164824475824
Signal 0 0 0 0 0 0 0 1 0 1 1
Average 227
Std Dev 16015617378
k 05
Control Limit 4
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Component Members-Weak Average 367623 299836 366667 345 232377 232213 225628 211885 50694 855628 1487596
Z 09566024078 -01181147213 09414456731 05979300132 -11876316317 -1190231741 -12946324712 -15125184597 31653794044 86935922814 187130183455
C+ 04566024078 0 04414456731 05393756863 0 0 0 0 26653794044 108589716857 290719900313
C- 0 0 0 0 06876316317 13778633728 21724958439 31850143037 0 0 0
Signal 0 0 0 0 0 0 0 0 0 1 1
Average 307286
Std Dev 63074271515
k 05
Control Limit 4
1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Edge Count 38 201 405 574 733 889 1075 1207 1112 1082 1247 1330 1328 1370 825 496 92
Slope of Edge Count 163 204 169 159 156 186 132 -95 -30 165 83 -2 42 -545 -329 -404
Z -05039753822 15973457028 -01964649795 -07089823173 -08627375186 06748144948 -20927791293 -137269226972 -6393653789 59836399187 20500693512 -16784942813 -06150208054 -316223197419 -114547624997 -100197139539
C+ 0 10973457028 04008807233 0 0 01748144948 0 0 0 54836399187 70337092699 48552149886 37401941833 0 0 0
C- 00039753822 0 0 02089823173 05717198359 0 15927791293 148197018266 207133556155 142297156968 116796463456 128581406269 129731614323 440954811741 550502436738 645699576277
Signal 0 0 0 0 0 0 0 0 1 1 1 1 1 1 1 1 1
Average 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 9475 4825 43 3075 54 72 -1055 -2085 -309
Std Dev 195115350498 195115350498 195115350498 195115350498 195115350498 195115350498 195115350498 195115350498 1284143683549 1324547092406 1254033492376 115840047767 882307580533 710539700979 29504858357 278774102097 2505474007049
k 05
Control Limit 4
1988 1988 1988 1988 1988 1988
1989 1989 1989 1989 1989 1989
1990 1990 1990 1990 1990 1990
1991 1991 1991 1991 1991 1991
1992 1992 1992 1992 1992 1992
1993 1993 1993 1993 1993 1993
1994 1994 1994 1994 1994 1994
1995 1995 1995 1995 1995 1995
1996 1996 1996 1996 1996 1996
1997 1997 1997 1997 1997 1997
1998 1998 1998 1998 1998 1998
1999 1999 1999 1999 1999 1999
2000 2000 2000 2000 2000 2000
2001 2001 2001 2001 2001 2001
2002 2002 2002 2002 2002 2002
2003 2003 2003 2003 2003 2003
2004 2004 2004 2004 2004 2004
httpwwwmarocnlnieuwsforumsshowphppg=timeline
Significant Events Founded in Afghanistan Moves to Sudan WTC 1 and Somalia Leaves Sudan Returns to Afghanistan Saudia Arabia bomb Issues fatwa US embassy bombings USS Cole WTC 2 Afghanistan 3 Man is Captured Dirty bomb foiled 911 plotter captured Sheikh Mohammed captured Saudi bombings Morroco attacks Sectarian violence plot Madrid Europe truce offered
1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Betweeness 0 00001 00003 00006 00008 00016 00027 00029 00024 00025 00033 00031 00033 00036 0002 00008 0
Density 00003 00015 0003 00043 00055 00067 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Closeness 00001 00005 00012 00016 00018 00021 00027 0003 00028 00028 00031 0003 00032 00034 00024 00015 00004
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Closeness 00027 0003 00028 00028 00031 0003 00032 00034 00024 00015 00004
Z -08728715609 10910894512 -02182178902 -02182178902 17457431219 10910894512 24003967926 3709704134 -28368325731 -87287156094 -159299059872
C+ 0 05910894512 0 0 12457431219 18368325731 37372293657 69469334997 36101009266 0 0
C- 03728715609 0 0 0 0 0 0 0 23368325731 105655481825 259954541697
Signal 0 0 0 0 0 0 0 1 0 1 1
Average 00028333333
Std Dev 00001527525
k 05
Control Limit 4
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Density 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Z -08444407432 11042686642 -0259827921 -06495698025 16888814864 30529780716 28581071308 36375908938 -43521176765 -9223891195 -150700194171
C+ 0 06042686642 0 0 11888814864 3741859558 60999666888 92375575826 43854399061 0 0
C- 03444407432 0 0 01495698025 0 0 0 0 38521176765 125760088714 271460282886
Signal 0 0 0 0 0 0 1 1 1 1 1
Average 00084333333
Std Dev 00005131601
k 05
Control Limit 4
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Betweeness 00027 00029 00024 00025 00033 00031 00033 00036 0002 00008 0
Z 01324532357 09271726499 -10596258857 -06622661785 25166114784 17218920642 25166114784 37086905998 -26490647141 -74173811996 -105962588565
C+ 0 04271726499 0 0 20166114784 32385035426 5255115021 84638056208 53147409067 0 0
C- 0 0 05596258857 07218920642 0 0 0 0 21490647141 90664459137 191627047702
Signal 0 0 0 0 0 0 1 1 1 1 1
Average 00026666667
Std Dev 00002516611
k 05
Control Limit 4
Page 21: Data Science Center Eindhoven - 4TU.Federation · 4TU AMI SRO Big Data Meeting Big Data: Mathematics in Action! November 24, 2017 Data Science Center Eindhoven The Mathematics Behind

Std Measures Control Charts

Betweeness
Density
Closeness
Time
Value
Standard Measures

Chart1

Number of AGENT
Edge Count[Agent x Agent Agent x Agent]
Component Members-Weak[Agent x Agent Agent x Agent] Average
Isolate Count[Agent x Agent Agent x Agent]
Component Count-Weak[Agent x Agent Agent x Agent]
Component Count-Strong[Agent x Agent Agent x Agent]
32
38
1728798
350
355
355
94
201
1248005
293
301
315
112
405
101776
269
272
281
137
574
862131
247
250
255
162
733
710437
222
226
232
183
889
559781
198
201
214
221
1075
367623
157
162
177
228
1207
299836
142
146
156
213
1112
366667
159
162
172
219
1082
345
154
157
169
254
1247
232377
126
128
145
250
1330
232213
125
128
144
256
1328
225628
122
126
137
260
1370
211885
119
122
131
190
825
50694
187
191
198
141
496
855628
243
249
256
62
92
1487596
322
329
336

Control Charts

raw data

Chart4

C+
Closeness CUSUM Statistic for Al-Qaeda (1994-2004)
0
05910894512
0
0
12457431219
18368325731
37372293657
69469334997
36101009266
0
0

Std Measures Control Charts

Std Measures Control Charts

Betweeness
Density
Closeness
Time
Value
Standard Measures

Chart1

C+
Closeness CUSUM Chart for Al-Qaeda (1994-2004)

Control Charts

Number of AGENT
Edge Count[Agent x Agent Agent x Agent]
Component Members-Weak[Agent x Agent Agent x Agent] Average
Isolate Count[Agent x Agent Agent x Agent]
Component Count-Weak[Agent x Agent Agent x Agent]
Component Count-Strong[Agent x Agent Agent x Agent]
32
38
1728798
350
355
355
94
201
1248005
293
301
315
112
405
101776
269
272
281
137
574
862131
247
250
255
162
733
710437
222
226
232
183
889
559781
198
201
214
221
1075
367623
157
162
177
228
1207
299836
142
146
156
213
1112
366667
159
162
172
219
1082
345
154
157
169
254
1247
232377
126
128
145
250
1330
232213
125
128
144
256
1328
225628
122
126
137
260
1370
211885
119
122
131
190
825
50694
187
191
198
141
496
855628
243
249
256
62
92
1487596
322
329
336

raw data

Topic Precision farming

Topic Precision farming

Topic Compressed Sensing

Topic Network structure

bull dependencies between nodes in networks measured by degrees of direct neighbours

bull assortativity coefficient of Newman is nothing but Pearsonrsquos correlation statistic

bull inconsistent estimator when variances are infinitebull Spearman rank correlation behaves better but

calculation is computationally intensive bull requires heavy asymptotics

Van der Hofstad R and Litvak N (2014) Degree-Degree Dependencies in Random Graphs with Heavy-Tailed Degrees Internet Mathematics 10 (3-4) pp 287-334

Topic Topological Data Analysis

Common Big Data problem is to choose relevant ldquofeaturesrdquo from high-dimensional data

Combination of machine learning with topological tools (simplices cohomology) yields new algorithmsfor finding patterns and clustering

Presenter
Presentation Notes
httpsyoutubeXfWibrh6stw Gunnar Carsson

Mathematical contributions in general

bull Modellingbull Performance of algorithmsbull Statistical thinking

We need to work hard to makemathematical contributions more explicit

Conclusions

1 Mathematical methods are an important enabler in big data settings

2 Developments in big data settings not only require more computer speed and memory capacity but also new advanced mathematics

  • The Mathematics Behind Big Data
  • Outline
  • What is big data
  • Four Vrsquos of Big Data
  • High-dimensional data ldquo119951≪119953rdquo
  • Topic Google Search
  • Topic Google Search
  • Topic Google Search
  • Topic Google Search
  • Topic Google Search
  • Topic Google Search
  • Equivalent Random Surfer Model
  • Google Matrix
  • Choice of Matrix B
  • Practical Issues
  • Topic Streaming algorithms
  • Topic Uncertainty Quantification
  • Topic Network monitoring
  • Topic Precision farming
  • Topic Precision farming
  • Topic Compressed Sensing
  • Topic Network structure
  • Topic Topological Data Analysis
  • Mathematical contributions in general
  • Conclusions
1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Centrality-Betweenness[Agent x Agent Agent x Agent] Average 0 00001 00003 00006 00008 00016 00027 00029 00024 00025 00033 00031 00033 00036 0002 00008 0
Meta-Matrix Hamming Distance 0 00018 00036 0006 00065 00087 00124 0012 00146 00136 00135 00166 00164 00156 0015 0009 0004
Network Centralization-Betweenness[Agent x Agent Agent x Agent] 0 00091 00292 00492 00608 01105 01557 01586 01242 01187 01295 01338 01316 01414 0104 0042 00014
Edge Count-Sequential[Agent x Agent Agent x Agent] 0 001 00025 00017 0 0 00019 00008 00009 00009 00016 00015 0 00007 0 0002 0
Hierarchy[Agent x Agent Agent x Agent] 0 04827 01679 00696 00796 01496 01339 00727 00775 01273 01492 01711 0075 00717 00786 00882 05587
Exclusivity[Agent x Agent Agent x Agent] Average 00001 00003 00003 00003 00004 00004 00005 00005 00005 00006 00006 00006 00006 00006 00006 00004 00002
Network Centralization-Closeness[Agent x Agent Agent x Agent] 00001 00005 00012 00016 00018 00021 00027 0003 00028 00028 00031 0003 00032 00034 00024 00015 00004
Similarity Correlation[Agent x Agent Agent x Agent] Average 00001 00023 00036 00043 0005 0006 0007 00077 0007 00069 0008 00084 0008 00082 00053 0003 00008
Simmelian Ties[Agent x Agent Agent x Agent] Average 00002 00008 00021 00029 0004 0005 00059 00068 00061 00058 00065 00076 00076 0008 00045 00029 00004
Overall Complexity 00003 00015 0003 00043 00055 00067 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Expertise Correlation[Agent x Agent Agent x Agent] Average 00003 00015 0003 00042 00053 00065 00079 00088 00081 00079 00091 00097 00097 00101 0006 00036 00007
Density[Agent x Agent Agent x Agent] 00003 00015 0003 00043 00055 00067 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Centrality-Inverse Closeness[Agent x Agent Agent x Agent] Average 00003 00057 00191 00306 00387 0053 00733 00943 0084 00877 01111 01117 01125 01206 00587 00257 00018
Efficiency-Global[Agent x Agent Agent x Agent] 00003 00087 00223 00338 00421 00591 00815 0101 00904 00972 01245 01244 01203 01286 00632 00277 00026
Connectedness[Agent x Agent Agent x Agent] 00003 0021 00589 00932 0119 01953 02921 03446 0301 03223 04222 04187 04049 04402 02202 00899 00059
Distinctiveness Correlation[Agent x Agent Agent x Agent] Average 00006 00029 00059 00083 00106 00129 00156 00175 00162 00157 00181 00193 00193 00199 0012 00073 00014
Centrality-Closeness[Agent x Agent Agent x Agent] Average 00027 00028 00029 00031 00032 00035 00042 00048 00044 00044 00053 00052 00054 00058 00038 00031 00027
Boundary Spanner[Agent x Agent Agent x Agent] Average 00027 00191 00328 00328 00546 00574 00683 00601 00656 0071 00683 0071 00847 00656 00738 00546 00246
Network Centralization-Row Degree[Agent x Agent Agent x Agent] 00134 00314 006 00752 00794 0081 00851 00841 00848 00878 00893 00969 00996 00993 00842 00648 0024
Network Centralization-Column Degree[Agent x Agent Agent x Agent] 00134 00588 00819 00834 00849 00892 00878 00896 00876 00905 00948 00996 01024 00993 00842 0062 00322
Network Centralization-Out Degree[Agent x Agent Agent x Agent] 00135 00315 00601 00754 00797 00812 00853 00843 00851 0088 00895 00972 00999 00996 00845 0065 0024
Network Centralization-Total Degree[Agent x Agent Agent x Agent] 00135 00453 00672 00769 00813 00828 00869 0086 00867 00897 00925 00988 01016 00999 00847 00638 00282
Network Centralization-In Degree[Agent x Agent Agent x Agent] 00135 00589 00821 00836 00852 00895 00881 00898 00878 00908 0095 00999 01027 00996 00845 00622 00323
Clustering Coefficient-Watts-Strogatz[Agent x Agent Agent x Agent] Average 0023 00786 0126 01471 01728 01993 02334 02782 02382 02256 02604 02856 02775 02958 02157 01468 00355
Efficiency-Local[Agent x Agent Agent x Agent] 00238 01032 01587 01916 02253 02598 03034 0362 03167 03054 03644 03843 03678 03861 02663 01705 0041
Interlockers[Agent x Agent Agent x Agent] Average 00246 00546 00656 00628 00738 00902 00902 00902 00847 0071 00738 00874 00874 0082 00792 00683 00546
Breadth-Column[Agent x Agent Agent x Agent] 00273 01011 01803 02596 03224 03716 04399 05082 04672 04672 05355 05328 05519 05738 03825 02404 00601
Breadth-Row[Agent x Agent Agent x Agent] 00273 01339 02158 02923 03388 03934 04754 05301 04863 04973 05792 05765 05765 05984 04153 02514 00601
Clique Count[Agent x Agent Agent x Agent] Average 00301 02896 06421 10929 14699 17213 19781 22486 20601 20492 24071 2571 24918 24727 13689 06913 01148
Constraint-Burt[Agent x Agent Agent x Agent] Average 0038 01331 01404 01481 01869 02077 02643 02719 02506 02623 02827 0295 03034 02956 02548 01967 00948
Interdependence[Agent x Agent Agent x Agent] 00485 00073 00042 0003 00024 0002 00017 00015 00016 00016 00014 00013 00014 00013 00022 00037 00176
Effective Network Size-Burt[Agent x Agent Agent x Agent] Average 0053 03636 06977 10333 13118 15388 18575 21156 19688 19928 22983 2382 23991 2429 14392 08408 01957
Redundancy-Row[Agent x Agent Agent x Agent] 00601 03525 08443 12459 16093 19699 23716 26913 24754 23798 27541 29781 29617 30683 1765 10219 01311
Redundancy-Column[Agent x Agent Agent x Agent] 00601 03743 08634 12568 1623 2 23989 27077 24945 24016 27814 30055 29891 30902 17842 10383 01448
Node Levels[Agent x Agent Agent x Agent] Average 00601 09836 14344 23087 24126 36339 46913 42541 3918 43197 51913 44727 46421 48169 47568 23033 03169
Centrality-Bonacich Power[Agent x Agent Agent x Agent] Average 01038 05492 11066 15683 20027 2429 29372 32978 30383 29563 34071 36339 36284 37432 22541 13552 02514
Triad Count[Agent x Agent Agent x Agent] Average 01967 08279 27104 42432 5918 78251 96503 101475 94454 84508 94645 111421 110164 112732 67131 35574 01995
Efficiency[Agent x Agent Agent x Agent] 03333 09552 0964 09658 09658 09746 09798 09806 09795 09818 09835 09822 0982 09828 09809 09756 09607
Speed-Minimum[Agent x Agent Agent x Agent] 05 0125 0125 01 01111 00909 00909 01 01 01 00909 01111 01 01 00769 01111 01667
Edge Count-Skip[Agent x Agent Agent x Agent] 07895 07761 08988 09408 09304 0946 0933 09519 09487 09409 09455 09534 09465 09584 09188 0881 06196
Transitivity[Agent x Agent Agent x Agent] 08372 03475 03372 03238 03336 03626 0375 03495 03503 03253 0319 03266 03302 0336 03822 0405 0258
Edge Count-Pooled[Agent x Agent Agent x Agent] 08421 08657 09432 09669 09714 09764 09665 09751 09748 09704 09735 09737 09759 09788 09612 09435 08152
Speed-Average[Agent x Agent Agent x Agent] 08519 02994 03068 02802 02799 0233 02156 02387 0245 02484 02448 02516 02464 02429 02259 02373 04139
Edge Count-Lateral[Agent x Agent Agent x Agent] 09474 03731 03926 0392 04216 04061 03944 03828 03768 03651 03633 03872 03795 03985 03842 03891 05761
Resemblance Correlation[Agent x Agent Agent x Agent] Average 09994 09971 09941 09917 09894 09871 09844 09825 09838 09843 09819 09807 09807 09801 0988 09927 09986
Fragmentation[Agent x Agent Agent x Agent] 09997 0979 09411 09068 0881 08047 07079 06554 0699 06777 05778 05813 05951 05598 07798 09101 09941
Communicative Need[Agent x Agent Agent x Agent] 1 0608 07457 07662 0801 08069 08098 08233 0817 08124 0784 08169 08368 08365 08212 09004 08039
Upper Boundedness[Agent x Agent Agent x Agent] 1 09021 09909 0999 09987 0994 09966 09993 09989 09965 0995 09928 09987 09988 09986 09983 09073
Average Distance[Agent x Agent Agent x Agent] 11739 33399 326 35685 35731 42917 46372 41893 40814 40263 40845 39749 40577 41172 44266 42139 24163
Network Levels[Agent x Agent Agent x Agent] 2 8 8 10 9 11 11 10 10 10 11 9 10 10 13 9 6
Span Of Control[Agent x Agent Agent x Agent] 2375 27917 42188 48644 50903 52917 51932 54369 53981 5128 52176 55417 54426 55466 46089 40656 20909
Number of AGENT 32 94 112 137 162 183 221 228 213 219 254 250 256 260 190 141 62
Edge Count[Agent x Agent Agent x Agent] 38 201 405 574 733 889 1075 1207 1112 1082 1247 1330 1328 1370 825 496 92
Component Members-Weak[Agent x Agent Agent x Agent] Average 1728798 1248005 101776 862131 710437 559781 367623 299836 366667 345 232377 232213 225628 211885 50694 855628 1487596
Isolate Count[Agent x Agent Agent x Agent] 350 293 269 247 222 198 157 142 159 154 126 125 122 119 187 243 322
Component Count-Weak[Agent x Agent Agent x Agent] 355 301 272 250 226 201 162 146 162 157 128 128 126 122 191 249 329
Component Count-Strong[Agent x Agent Agent x Agent] 355 315 281 255 232 214 177 156 172 169 145 144 137 131 198 256 336
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Agents 221 228 213 219 254 250 256 260 190 141 62
Z -03746343246 00624390541 -08741467575 -04995124328 16858544608 14360982444 18107325691 20604887855 -23102450019 -53697586531 -103024439274
C+ 0 0 0 0 11858544608 21219527053 34326852743 49931740598 21829290579 0 0
C- 0 0 03741467575 03736591903 0 0 0 0 18102450019 6680003655 164824475824
Signal 0 0 0 0 0 0 0 1 0 1 1
Average 227
Std Dev 16015617378
k 05
Control Limit 4
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Component Members-Weak Average 367623 299836 366667 345 232377 232213 225628 211885 50694 855628 1487596
Z 09566024078 -01181147213 09414456731 05979300132 -11876316317 -1190231741 -12946324712 -15125184597 31653794044 86935922814 187130183455
C+ 04566024078 0 04414456731 05393756863 0 0 0 0 26653794044 108589716857 290719900313
C- 0 0 0 0 06876316317 13778633728 21724958439 31850143037 0 0 0
Signal 0 0 0 0 0 0 0 0 0 1 1
Average 307286
Std Dev 63074271515
k 05
Control Limit 4
1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Edge Count 38 201 405 574 733 889 1075 1207 1112 1082 1247 1330 1328 1370 825 496 92
Slope of Edge Count 163 204 169 159 156 186 132 -95 -30 165 83 -2 42 -545 -329 -404
Z -05039753822 15973457028 -01964649795 -07089823173 -08627375186 06748144948 -20927791293 -137269226972 -6393653789 59836399187 20500693512 -16784942813 -06150208054 -316223197419 -114547624997 -100197139539
C+ 0 10973457028 04008807233 0 0 01748144948 0 0 0 54836399187 70337092699 48552149886 37401941833 0 0 0
C- 00039753822 0 0 02089823173 05717198359 0 15927791293 148197018266 207133556155 142297156968 116796463456 128581406269 129731614323 440954811741 550502436738 645699576277
Signal 0 0 0 0 0 0 0 0 1 1 1 1 1 1 1 1 1
Average 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 9475 4825 43 3075 54 72 -1055 -2085 -309
Std Dev 195115350498 195115350498 195115350498 195115350498 195115350498 195115350498 195115350498 195115350498 1284143683549 1324547092406 1254033492376 115840047767 882307580533 710539700979 29504858357 278774102097 2505474007049
k 05
Control Limit 4
1988 1988 1988 1988 1988 1988
1989 1989 1989 1989 1989 1989
1990 1990 1990 1990 1990 1990
1991 1991 1991 1991 1991 1991
1992 1992 1992 1992 1992 1992
1993 1993 1993 1993 1993 1993
1994 1994 1994 1994 1994 1994
1995 1995 1995 1995 1995 1995
1996 1996 1996 1996 1996 1996
1997 1997 1997 1997 1997 1997
1998 1998 1998 1998 1998 1998
1999 1999 1999 1999 1999 1999
2000 2000 2000 2000 2000 2000
2001 2001 2001 2001 2001 2001
2002 2002 2002 2002 2002 2002
2003 2003 2003 2003 2003 2003
2004 2004 2004 2004 2004 2004
httpwwwmarocnlnieuwsforumsshowphppg=timeline
Significant Events Founded in Afghanistan Moves to Sudan WTC 1 and Somalia Leaves Sudan Returns to Afghanistan Saudia Arabia bomb Issues fatwa US embassy bombings USS Cole WTC 2 Afghanistan 3 Man is Captured Dirty bomb foiled 911 plotter captured Sheikh Mohammed captured Saudi bombings Morroco attacks Sectarian violence plot Madrid Europe truce offered
1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Betweeness 0 00001 00003 00006 00008 00016 00027 00029 00024 00025 00033 00031 00033 00036 0002 00008 0
Density 00003 00015 0003 00043 00055 00067 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Closeness 00001 00005 00012 00016 00018 00021 00027 0003 00028 00028 00031 0003 00032 00034 00024 00015 00004
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Closeness 00027 0003 00028 00028 00031 0003 00032 00034 00024 00015 00004
Z -08728715609 10910894512 -02182178902 -02182178902 17457431219 10910894512 24003967926 3709704134 -28368325731 -87287156094 -159299059872
C+ 0 05910894512 0 0 12457431219 18368325731 37372293657 69469334997 36101009266 0 0
C- 03728715609 0 0 0 0 0 0 0 23368325731 105655481825 259954541697
03728715609 05910894512 0 0 12457431219 18368325731 37372293657 69469334997 36101009266 105655481825 259954541697
Signal 0 0 0 0 0 0 0 1 0 1 1
Average 00028333333
Std Dev 00001527525
k 05
Control Limit 4
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Density 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Z -08444407432 11042686642 -0259827921 -06495698025 16888814864 30529780716 28581071308 36375908938 -43521176765 -9223891195 -150700194171
C+ 0 06042686642 0 0 11888814864 3741859558 60999666888 92375575826 43854399061 0 0
C- 03444407432 0 0 01495698025 0 0 0 0 38521176765 125760088714 271460282886
Signal 0 0 0 0 0 0 1 1 1 1 1
Average 00084333333
Std Dev 00005131601
k 05
Control Limit 4
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Betweeness 00027 00029 00024 00025 00033 00031 00033 00036 0002 00008 0
Z 01324532357 09271726499 -10596258857 -06622661785 25166114784 17218920642 25166114784 37086905998 -26490647141 -74173811996 -105962588565
C+ 0 04271726499 0 0 20166114784 32385035426 5255115021 84638056208 53147409067 0 0
C- 0 0 05596258857 07218920642 0 0 0 0 21490647141 90664459137 191627047702
Signal 0 0 0 0 0 0 1 1 1 1 1
Average 00026666667
Std Dev 00002516611
k 05
Control Limit 4
1994
1995
1996
1997
1998
1999
2000
2001
2002
2003
2004
1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Centrality-Betweenness[Agent x Agent Agent x Agent] Average 0 00001 00003 00006 00008 00016 00027 00029 00024 00025 00033 00031 00033 00036 0002 00008 0
Meta-Matrix Hamming Distance 0 00018 00036 0006 00065 00087 00124 0012 00146 00136 00135 00166 00164 00156 0015 0009 0004
Network Centralization-Betweenness[Agent x Agent Agent x Agent] 0 00091 00292 00492 00608 01105 01557 01586 01242 01187 01295 01338 01316 01414 0104 0042 00014
Edge Count-Sequential[Agent x Agent Agent x Agent] 0 001 00025 00017 0 0 00019 00008 00009 00009 00016 00015 0 00007 0 0002 0
Hierarchy[Agent x Agent Agent x Agent] 0 04827 01679 00696 00796 01496 01339 00727 00775 01273 01492 01711 0075 00717 00786 00882 05587
Exclusivity[Agent x Agent Agent x Agent] Average 00001 00003 00003 00003 00004 00004 00005 00005 00005 00006 00006 00006 00006 00006 00006 00004 00002
Network Centralization-Closeness[Agent x Agent Agent x Agent] 00001 00005 00012 00016 00018 00021 00027 0003 00028 00028 00031 0003 00032 00034 00024 00015 00004
Similarity Correlation[Agent x Agent Agent x Agent] Average 00001 00023 00036 00043 0005 0006 0007 00077 0007 00069 0008 00084 0008 00082 00053 0003 00008
Simmelian Ties[Agent x Agent Agent x Agent] Average 00002 00008 00021 00029 0004 0005 00059 00068 00061 00058 00065 00076 00076 0008 00045 00029 00004
Overall Complexity 00003 00015 0003 00043 00055 00067 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Expertise Correlation[Agent x Agent Agent x Agent] Average 00003 00015 0003 00042 00053 00065 00079 00088 00081 00079 00091 00097 00097 00101 0006 00036 00007
Density[Agent x Agent Agent x Agent] 00003 00015 0003 00043 00055 00067 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Centrality-Inverse Closeness[Agent x Agent Agent x Agent] Average 00003 00057 00191 00306 00387 0053 00733 00943 0084 00877 01111 01117 01125 01206 00587 00257 00018
Efficiency-Global[Agent x Agent Agent x Agent] 00003 00087 00223 00338 00421 00591 00815 0101 00904 00972 01245 01244 01203 01286 00632 00277 00026
Connectedness[Agent x Agent Agent x Agent] 00003 0021 00589 00932 0119 01953 02921 03446 0301 03223 04222 04187 04049 04402 02202 00899 00059
Distinctiveness Correlation[Agent x Agent Agent x Agent] Average 00006 00029 00059 00083 00106 00129 00156 00175 00162 00157 00181 00193 00193 00199 0012 00073 00014
Centrality-Closeness[Agent x Agent Agent x Agent] Average 00027 00028 00029 00031 00032 00035 00042 00048 00044 00044 00053 00052 00054 00058 00038 00031 00027
Boundary Spanner[Agent x Agent Agent x Agent] Average 00027 00191 00328 00328 00546 00574 00683 00601 00656 0071 00683 0071 00847 00656 00738 00546 00246
Network Centralization-Row Degree[Agent x Agent Agent x Agent] 00134 00314 006 00752 00794 0081 00851 00841 00848 00878 00893 00969 00996 00993 00842 00648 0024
Network Centralization-Column Degree[Agent x Agent Agent x Agent] 00134 00588 00819 00834 00849 00892 00878 00896 00876 00905 00948 00996 01024 00993 00842 0062 00322
Network Centralization-Out Degree[Agent x Agent Agent x Agent] 00135 00315 00601 00754 00797 00812 00853 00843 00851 0088 00895 00972 00999 00996 00845 0065 0024
Network Centralization-Total Degree[Agent x Agent Agent x Agent] 00135 00453 00672 00769 00813 00828 00869 0086 00867 00897 00925 00988 01016 00999 00847 00638 00282
Network Centralization-In Degree[Agent x Agent Agent x Agent] 00135 00589 00821 00836 00852 00895 00881 00898 00878 00908 0095 00999 01027 00996 00845 00622 00323
Clustering Coefficient-Watts-Strogatz[Agent x Agent Agent x Agent] Average 0023 00786 0126 01471 01728 01993 02334 02782 02382 02256 02604 02856 02775 02958 02157 01468 00355
Efficiency-Local[Agent x Agent Agent x Agent] 00238 01032 01587 01916 02253 02598 03034 0362 03167 03054 03644 03843 03678 03861 02663 01705 0041
Interlockers[Agent x Agent Agent x Agent] Average 00246 00546 00656 00628 00738 00902 00902 00902 00847 0071 00738 00874 00874 0082 00792 00683 00546
Breadth-Column[Agent x Agent Agent x Agent] 00273 01011 01803 02596 03224 03716 04399 05082 04672 04672 05355 05328 05519 05738 03825 02404 00601
Breadth-Row[Agent x Agent Agent x Agent] 00273 01339 02158 02923 03388 03934 04754 05301 04863 04973 05792 05765 05765 05984 04153 02514 00601
Clique Count[Agent x Agent Agent x Agent] Average 00301 02896 06421 10929 14699 17213 19781 22486 20601 20492 24071 2571 24918 24727 13689 06913 01148
Constraint-Burt[Agent x Agent Agent x Agent] Average 0038 01331 01404 01481 01869 02077 02643 02719 02506 02623 02827 0295 03034 02956 02548 01967 00948
Interdependence[Agent x Agent Agent x Agent] 00485 00073 00042 0003 00024 0002 00017 00015 00016 00016 00014 00013 00014 00013 00022 00037 00176
Effective Network Size-Burt[Agent x Agent Agent x Agent] Average 0053 03636 06977 10333 13118 15388 18575 21156 19688 19928 22983 2382 23991 2429 14392 08408 01957
Redundancy-Row[Agent x Agent Agent x Agent] 00601 03525 08443 12459 16093 19699 23716 26913 24754 23798 27541 29781 29617 30683 1765 10219 01311
Redundancy-Column[Agent x Agent Agent x Agent] 00601 03743 08634 12568 1623 2 23989 27077 24945 24016 27814 30055 29891 30902 17842 10383 01448
Node Levels[Agent x Agent Agent x Agent] Average 00601 09836 14344 23087 24126 36339 46913 42541 3918 43197 51913 44727 46421 48169 47568 23033 03169
Centrality-Bonacich Power[Agent x Agent Agent x Agent] Average 01038 05492 11066 15683 20027 2429 29372 32978 30383 29563 34071 36339 36284 37432 22541 13552 02514
Triad Count[Agent x Agent Agent x Agent] Average 01967 08279 27104 42432 5918 78251 96503 101475 94454 84508 94645 111421 110164 112732 67131 35574 01995
Efficiency[Agent x Agent Agent x Agent] 03333 09552 0964 09658 09658 09746 09798 09806 09795 09818 09835 09822 0982 09828 09809 09756 09607
Speed-Minimum[Agent x Agent Agent x Agent] 05 0125 0125 01 01111 00909 00909 01 01 01 00909 01111 01 01 00769 01111 01667
Edge Count-Skip[Agent x Agent Agent x Agent] 07895 07761 08988 09408 09304 0946 0933 09519 09487 09409 09455 09534 09465 09584 09188 0881 06196
Transitivity[Agent x Agent Agent x Agent] 08372 03475 03372 03238 03336 03626 0375 03495 03503 03253 0319 03266 03302 0336 03822 0405 0258
Edge Count-Pooled[Agent x Agent Agent x Agent] 08421 08657 09432 09669 09714 09764 09665 09751 09748 09704 09735 09737 09759 09788 09612 09435 08152
Speed-Average[Agent x Agent Agent x Agent] 08519 02994 03068 02802 02799 0233 02156 02387 0245 02484 02448 02516 02464 02429 02259 02373 04139
Edge Count-Lateral[Agent x Agent Agent x Agent] 09474 03731 03926 0392 04216 04061 03944 03828 03768 03651 03633 03872 03795 03985 03842 03891 05761
Resemblance Correlation[Agent x Agent Agent x Agent] Average 09994 09971 09941 09917 09894 09871 09844 09825 09838 09843 09819 09807 09807 09801 0988 09927 09986
Fragmentation[Agent x Agent Agent x Agent] 09997 0979 09411 09068 0881 08047 07079 06554 0699 06777 05778 05813 05951 05598 07798 09101 09941
Communicative Need[Agent x Agent Agent x Agent] 1 0608 07457 07662 0801 08069 08098 08233 0817 08124 0784 08169 08368 08365 08212 09004 08039
Upper Boundedness[Agent x Agent Agent x Agent] 1 09021 09909 0999 09987 0994 09966 09993 09989 09965 0995 09928 09987 09988 09986 09983 09073
Average Distance[Agent x Agent Agent x Agent] 11739 33399 326 35685 35731 42917 46372 41893 40814 40263 40845 39749 40577 41172 44266 42139 24163
Network Levels[Agent x Agent Agent x Agent] 2 8 8 10 9 11 11 10 10 10 11 9 10 10 13 9 6
Span Of Control[Agent x Agent Agent x Agent] 2375 27917 42188 48644 50903 52917 51932 54369 53981 5128 52176 55417 54426 55466 46089 40656 20909
Number of AGENT 32 94 112 137 162 183 221 228 213 219 254 250 256 260 190 141 62
Edge Count[Agent x Agent Agent x Agent] 38 201 405 574 733 889 1075 1207 1112 1082 1247 1330 1328 1370 825 496 92
Component Members-Weak[Agent x Agent Agent x Agent] Average 1728798 1248005 101776 862131 710437 559781 367623 299836 366667 345 232377 232213 225628 211885 50694 855628 1487596
Isolate Count[Agent x Agent Agent x Agent] 350 293 269 247 222 198 157 142 159 154 126 125 122 119 187 243 322
Component Count-Weak[Agent x Agent Agent x Agent] 355 301 272 250 226 201 162 146 162 157 128 128 126 122 191 249 329
Component Count-Strong[Agent x Agent Agent x Agent] 355 315 281 255 232 214 177 156 172 169 145 144 137 131 198 256 336
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Agents 221 228 213 219 254 250 256 260 190 141 62
Z -03746343246 00624390541 -08741467575 -04995124328 16858544608 14360982444 18107325691 20604887855 -23102450019 -53697586531 -103024439274
C+ 0 0 0 0 11858544608 21219527053 34326852743 49931740598 21829290579 0 0
C- 0 0 03741467575 03736591903 0 0 0 0 18102450019 6680003655 164824475824
Signal 0 0 0 0 0 0 0 1 0 1 1
Average 227
Std Dev 16015617378
k 05
Control Limit 4
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Component Members-Weak Average 367623 299836 366667 345 232377 232213 225628 211885 50694 855628 1487596
Z 09566024078 -01181147213 09414456731 05979300132 -11876316317 -1190231741 -12946324712 -15125184597 31653794044 86935922814 187130183455
C+ 04566024078 0 04414456731 05393756863 0 0 0 0 26653794044 108589716857 290719900313
C- 0 0 0 0 06876316317 13778633728 21724958439 31850143037 0 0 0
Signal 0 0 0 0 0 0 0 0 0 1 1
Average 307286
Std Dev 63074271515
k 05
Control Limit 4
1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Edge Count 38 201 405 574 733 889 1075 1207 1112 1082 1247 1330 1328 1370 825 496 92
Slope of Edge Count 163 204 169 159 156 186 132 -95 -30 165 83 -2 42 -545 -329 -404
Z -05039753822 15973457028 -01964649795 -07089823173 -08627375186 06748144948 -20927791293 -137269226972 -6393653789 59836399187 20500693512 -16784942813 -06150208054 -316223197419 -114547624997 -100197139539
C+ 0 10973457028 04008807233 0 0 01748144948 0 0 0 54836399187 70337092699 48552149886 37401941833 0 0 0
C- 00039753822 0 0 02089823173 05717198359 0 15927791293 148197018266 207133556155 142297156968 116796463456 128581406269 129731614323 440954811741 550502436738 645699576277
Signal 0 0 0 0 0 0 0 0 1 1 1 1 1 1 1 1 1
Average 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 9475 4825 43 3075 54 72 -1055 -2085 -309
Std Dev 195115350498 195115350498 195115350498 195115350498 195115350498 195115350498 195115350498 195115350498 1284143683549 1324547092406 1254033492376 115840047767 882307580533 710539700979 29504858357 278774102097 2505474007049
k 05
Control Limit 4
1988 1988 1988 1988 1988 1988
1989 1989 1989 1989 1989 1989
1990 1990 1990 1990 1990 1990
1991 1991 1991 1991 1991 1991
1992 1992 1992 1992 1992 1992
1993 1993 1993 1993 1993 1993
1994 1994 1994 1994 1994 1994
1995 1995 1995 1995 1995 1995
1996 1996 1996 1996 1996 1996
1997 1997 1997 1997 1997 1997
1998 1998 1998 1998 1998 1998
1999 1999 1999 1999 1999 1999
2000 2000 2000 2000 2000 2000
2001 2001 2001 2001 2001 2001
2002 2002 2002 2002 2002 2002
2003 2003 2003 2003 2003 2003
2004 2004 2004 2004 2004 2004
Page 22: Data Science Center Eindhoven - 4TU.Federation · 4TU AMI SRO Big Data Meeting Big Data: Mathematics in Action! November 24, 2017 Data Science Center Eindhoven The Mathematics Behind

Chart1

Number of AGENT
Edge Count[Agent x Agent Agent x Agent]
Component Members-Weak[Agent x Agent Agent x Agent] Average
Isolate Count[Agent x Agent Agent x Agent]
Component Count-Weak[Agent x Agent Agent x Agent]
Component Count-Strong[Agent x Agent Agent x Agent]
32
38
1728798
350
355
355
94
201
1248005
293
301
315
112
405
101776
269
272
281
137
574
862131
247
250
255
162
733
710437
222
226
232
183
889
559781
198
201
214
221
1075
367623
157
162
177
228
1207
299836
142
146
156
213
1112
366667
159
162
172
219
1082
345
154
157
169
254
1247
232377
126
128
145
250
1330
232213
125
128
144
256
1328
225628
122
126
137
260
1370
211885
119
122
131
190
825
50694
187
191
198
141
496
855628
243
249
256
62
92
1487596
322
329
336

Control Charts

raw data

Chart4

C+
Closeness CUSUM Statistic for Al-Qaeda (1994-2004)
0
05910894512
0
0
12457431219
18368325731
37372293657
69469334997
36101009266
0
0

Std Measures Control Charts

Std Measures Control Charts

Betweeness
Density
Closeness
Time
Value
Standard Measures

Chart1

C+
Closeness CUSUM Chart for Al-Qaeda (1994-2004)

Control Charts

Number of AGENT
Edge Count[Agent x Agent Agent x Agent]
Component Members-Weak[Agent x Agent Agent x Agent] Average
Isolate Count[Agent x Agent Agent x Agent]
Component Count-Weak[Agent x Agent Agent x Agent]
Component Count-Strong[Agent x Agent Agent x Agent]
32
38
1728798
350
355
355
94
201
1248005
293
301
315
112
405
101776
269
272
281
137
574
862131
247
250
255
162
733
710437
222
226
232
183
889
559781
198
201
214
221
1075
367623
157
162
177
228
1207
299836
142
146
156
213
1112
366667
159
162
172
219
1082
345
154
157
169
254
1247
232377
126
128
145
250
1330
232213
125
128
144
256
1328
225628
122
126
137
260
1370
211885
119
122
131
190
825
50694
187
191
198
141
496
855628
243
249
256
62
92
1487596
322
329
336

raw data

Topic Precision farming

Topic Precision farming

Topic Compressed Sensing

Topic Network structure

bull dependencies between nodes in networks measured by degrees of direct neighbours

bull assortativity coefficient of Newman is nothing but Pearsonrsquos correlation statistic

bull inconsistent estimator when variances are infinitebull Spearman rank correlation behaves better but

calculation is computationally intensive bull requires heavy asymptotics

Van der Hofstad R and Litvak N (2014) Degree-Degree Dependencies in Random Graphs with Heavy-Tailed Degrees Internet Mathematics 10 (3-4) pp 287-334

Topic Topological Data Analysis

Common Big Data problem is to choose relevant ldquofeaturesrdquo from high-dimensional data

Combination of machine learning with topological tools (simplices cohomology) yields new algorithmsfor finding patterns and clustering

Presenter
Presentation Notes
httpsyoutubeXfWibrh6stw Gunnar Carsson

Mathematical contributions in general

bull Modellingbull Performance of algorithmsbull Statistical thinking

We need to work hard to makemathematical contributions more explicit

Conclusions

1 Mathematical methods are an important enabler in big data settings

2 Developments in big data settings not only require more computer speed and memory capacity but also new advanced mathematics

  • The Mathematics Behind Big Data
  • Outline
  • What is big data
  • Four Vrsquos of Big Data
  • High-dimensional data ldquo119951≪119953rdquo
  • Topic Google Search
  • Topic Google Search
  • Topic Google Search
  • Topic Google Search
  • Topic Google Search
  • Topic Google Search
  • Equivalent Random Surfer Model
  • Google Matrix
  • Choice of Matrix B
  • Practical Issues
  • Topic Streaming algorithms
  • Topic Uncertainty Quantification
  • Topic Network monitoring
  • Topic Precision farming
  • Topic Precision farming
  • Topic Compressed Sensing
  • Topic Network structure
  • Topic Topological Data Analysis
  • Mathematical contributions in general
  • Conclusions
1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Centrality-Betweenness[Agent x Agent Agent x Agent] Average 0 00001 00003 00006 00008 00016 00027 00029 00024 00025 00033 00031 00033 00036 0002 00008 0
Meta-Matrix Hamming Distance 0 00018 00036 0006 00065 00087 00124 0012 00146 00136 00135 00166 00164 00156 0015 0009 0004
Network Centralization-Betweenness[Agent x Agent Agent x Agent] 0 00091 00292 00492 00608 01105 01557 01586 01242 01187 01295 01338 01316 01414 0104 0042 00014
Edge Count-Sequential[Agent x Agent Agent x Agent] 0 001 00025 00017 0 0 00019 00008 00009 00009 00016 00015 0 00007 0 0002 0
Hierarchy[Agent x Agent Agent x Agent] 0 04827 01679 00696 00796 01496 01339 00727 00775 01273 01492 01711 0075 00717 00786 00882 05587
Exclusivity[Agent x Agent Agent x Agent] Average 00001 00003 00003 00003 00004 00004 00005 00005 00005 00006 00006 00006 00006 00006 00006 00004 00002
Network Centralization-Closeness[Agent x Agent Agent x Agent] 00001 00005 00012 00016 00018 00021 00027 0003 00028 00028 00031 0003 00032 00034 00024 00015 00004
Similarity Correlation[Agent x Agent Agent x Agent] Average 00001 00023 00036 00043 0005 0006 0007 00077 0007 00069 0008 00084 0008 00082 00053 0003 00008
Simmelian Ties[Agent x Agent Agent x Agent] Average 00002 00008 00021 00029 0004 0005 00059 00068 00061 00058 00065 00076 00076 0008 00045 00029 00004
Overall Complexity 00003 00015 0003 00043 00055 00067 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Expertise Correlation[Agent x Agent Agent x Agent] Average 00003 00015 0003 00042 00053 00065 00079 00088 00081 00079 00091 00097 00097 00101 0006 00036 00007
Density[Agent x Agent Agent x Agent] 00003 00015 0003 00043 00055 00067 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Centrality-Inverse Closeness[Agent x Agent Agent x Agent] Average 00003 00057 00191 00306 00387 0053 00733 00943 0084 00877 01111 01117 01125 01206 00587 00257 00018
Efficiency-Global[Agent x Agent Agent x Agent] 00003 00087 00223 00338 00421 00591 00815 0101 00904 00972 01245 01244 01203 01286 00632 00277 00026
Connectedness[Agent x Agent Agent x Agent] 00003 0021 00589 00932 0119 01953 02921 03446 0301 03223 04222 04187 04049 04402 02202 00899 00059
Distinctiveness Correlation[Agent x Agent Agent x Agent] Average 00006 00029 00059 00083 00106 00129 00156 00175 00162 00157 00181 00193 00193 00199 0012 00073 00014
Centrality-Closeness[Agent x Agent Agent x Agent] Average 00027 00028 00029 00031 00032 00035 00042 00048 00044 00044 00053 00052 00054 00058 00038 00031 00027
Boundary Spanner[Agent x Agent Agent x Agent] Average 00027 00191 00328 00328 00546 00574 00683 00601 00656 0071 00683 0071 00847 00656 00738 00546 00246
Network Centralization-Row Degree[Agent x Agent Agent x Agent] 00134 00314 006 00752 00794 0081 00851 00841 00848 00878 00893 00969 00996 00993 00842 00648 0024
Network Centralization-Column Degree[Agent x Agent Agent x Agent] 00134 00588 00819 00834 00849 00892 00878 00896 00876 00905 00948 00996 01024 00993 00842 0062 00322
Network Centralization-Out Degree[Agent x Agent Agent x Agent] 00135 00315 00601 00754 00797 00812 00853 00843 00851 0088 00895 00972 00999 00996 00845 0065 0024
Network Centralization-Total Degree[Agent x Agent Agent x Agent] 00135 00453 00672 00769 00813 00828 00869 0086 00867 00897 00925 00988 01016 00999 00847 00638 00282
Network Centralization-In Degree[Agent x Agent Agent x Agent] 00135 00589 00821 00836 00852 00895 00881 00898 00878 00908 0095 00999 01027 00996 00845 00622 00323
Clustering Coefficient-Watts-Strogatz[Agent x Agent Agent x Agent] Average 0023 00786 0126 01471 01728 01993 02334 02782 02382 02256 02604 02856 02775 02958 02157 01468 00355
Efficiency-Local[Agent x Agent Agent x Agent] 00238 01032 01587 01916 02253 02598 03034 0362 03167 03054 03644 03843 03678 03861 02663 01705 0041
Interlockers[Agent x Agent Agent x Agent] Average 00246 00546 00656 00628 00738 00902 00902 00902 00847 0071 00738 00874 00874 0082 00792 00683 00546
Breadth-Column[Agent x Agent Agent x Agent] 00273 01011 01803 02596 03224 03716 04399 05082 04672 04672 05355 05328 05519 05738 03825 02404 00601
Breadth-Row[Agent x Agent Agent x Agent] 00273 01339 02158 02923 03388 03934 04754 05301 04863 04973 05792 05765 05765 05984 04153 02514 00601
Clique Count[Agent x Agent Agent x Agent] Average 00301 02896 06421 10929 14699 17213 19781 22486 20601 20492 24071 2571 24918 24727 13689 06913 01148
Constraint-Burt[Agent x Agent Agent x Agent] Average 0038 01331 01404 01481 01869 02077 02643 02719 02506 02623 02827 0295 03034 02956 02548 01967 00948
Interdependence[Agent x Agent Agent x Agent] 00485 00073 00042 0003 00024 0002 00017 00015 00016 00016 00014 00013 00014 00013 00022 00037 00176
Effective Network Size-Burt[Agent x Agent Agent x Agent] Average 0053 03636 06977 10333 13118 15388 18575 21156 19688 19928 22983 2382 23991 2429 14392 08408 01957
Redundancy-Row[Agent x Agent Agent x Agent] 00601 03525 08443 12459 16093 19699 23716 26913 24754 23798 27541 29781 29617 30683 1765 10219 01311
Redundancy-Column[Agent x Agent Agent x Agent] 00601 03743 08634 12568 1623 2 23989 27077 24945 24016 27814 30055 29891 30902 17842 10383 01448
Node Levels[Agent x Agent Agent x Agent] Average 00601 09836 14344 23087 24126 36339 46913 42541 3918 43197 51913 44727 46421 48169 47568 23033 03169
Centrality-Bonacich Power[Agent x Agent Agent x Agent] Average 01038 05492 11066 15683 20027 2429 29372 32978 30383 29563 34071 36339 36284 37432 22541 13552 02514
Triad Count[Agent x Agent Agent x Agent] Average 01967 08279 27104 42432 5918 78251 96503 101475 94454 84508 94645 111421 110164 112732 67131 35574 01995
Efficiency[Agent x Agent Agent x Agent] 03333 09552 0964 09658 09658 09746 09798 09806 09795 09818 09835 09822 0982 09828 09809 09756 09607
Speed-Minimum[Agent x Agent Agent x Agent] 05 0125 0125 01 01111 00909 00909 01 01 01 00909 01111 01 01 00769 01111 01667
Edge Count-Skip[Agent x Agent Agent x Agent] 07895 07761 08988 09408 09304 0946 0933 09519 09487 09409 09455 09534 09465 09584 09188 0881 06196
Transitivity[Agent x Agent Agent x Agent] 08372 03475 03372 03238 03336 03626 0375 03495 03503 03253 0319 03266 03302 0336 03822 0405 0258
Edge Count-Pooled[Agent x Agent Agent x Agent] 08421 08657 09432 09669 09714 09764 09665 09751 09748 09704 09735 09737 09759 09788 09612 09435 08152
Speed-Average[Agent x Agent Agent x Agent] 08519 02994 03068 02802 02799 0233 02156 02387 0245 02484 02448 02516 02464 02429 02259 02373 04139
Edge Count-Lateral[Agent x Agent Agent x Agent] 09474 03731 03926 0392 04216 04061 03944 03828 03768 03651 03633 03872 03795 03985 03842 03891 05761
Resemblance Correlation[Agent x Agent Agent x Agent] Average 09994 09971 09941 09917 09894 09871 09844 09825 09838 09843 09819 09807 09807 09801 0988 09927 09986
Fragmentation[Agent x Agent Agent x Agent] 09997 0979 09411 09068 0881 08047 07079 06554 0699 06777 05778 05813 05951 05598 07798 09101 09941
Communicative Need[Agent x Agent Agent x Agent] 1 0608 07457 07662 0801 08069 08098 08233 0817 08124 0784 08169 08368 08365 08212 09004 08039
Upper Boundedness[Agent x Agent Agent x Agent] 1 09021 09909 0999 09987 0994 09966 09993 09989 09965 0995 09928 09987 09988 09986 09983 09073
Average Distance[Agent x Agent Agent x Agent] 11739 33399 326 35685 35731 42917 46372 41893 40814 40263 40845 39749 40577 41172 44266 42139 24163
Network Levels[Agent x Agent Agent x Agent] 2 8 8 10 9 11 11 10 10 10 11 9 10 10 13 9 6
Span Of Control[Agent x Agent Agent x Agent] 2375 27917 42188 48644 50903 52917 51932 54369 53981 5128 52176 55417 54426 55466 46089 40656 20909
Number of AGENT 32 94 112 137 162 183 221 228 213 219 254 250 256 260 190 141 62
Edge Count[Agent x Agent Agent x Agent] 38 201 405 574 733 889 1075 1207 1112 1082 1247 1330 1328 1370 825 496 92
Component Members-Weak[Agent x Agent Agent x Agent] Average 1728798 1248005 101776 862131 710437 559781 367623 299836 366667 345 232377 232213 225628 211885 50694 855628 1487596
Isolate Count[Agent x Agent Agent x Agent] 350 293 269 247 222 198 157 142 159 154 126 125 122 119 187 243 322
Component Count-Weak[Agent x Agent Agent x Agent] 355 301 272 250 226 201 162 146 162 157 128 128 126 122 191 249 329
Component Count-Strong[Agent x Agent Agent x Agent] 355 315 281 255 232 214 177 156 172 169 145 144 137 131 198 256 336
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Agents 221 228 213 219 254 250 256 260 190 141 62
Z -03746343246 00624390541 -08741467575 -04995124328 16858544608 14360982444 18107325691 20604887855 -23102450019 -53697586531 -103024439274
C+ 0 0 0 0 11858544608 21219527053 34326852743 49931740598 21829290579 0 0
C- 0 0 03741467575 03736591903 0 0 0 0 18102450019 6680003655 164824475824
Signal 0 0 0 0 0 0 0 1 0 1 1
Average 227
Std Dev 16015617378
k 05
Control Limit 4
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Component Members-Weak Average 367623 299836 366667 345 232377 232213 225628 211885 50694 855628 1487596
Z 09566024078 -01181147213 09414456731 05979300132 -11876316317 -1190231741 -12946324712 -15125184597 31653794044 86935922814 187130183455
C+ 04566024078 0 04414456731 05393756863 0 0 0 0 26653794044 108589716857 290719900313
C- 0 0 0 0 06876316317 13778633728 21724958439 31850143037 0 0 0
Signal 0 0 0 0 0 0 0 0 0 1 1
Average 307286
Std Dev 63074271515
k 05
Control Limit 4
1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Edge Count 38 201 405 574 733 889 1075 1207 1112 1082 1247 1330 1328 1370 825 496 92
Slope of Edge Count 163 204 169 159 156 186 132 -95 -30 165 83 -2 42 -545 -329 -404
Z -05039753822 15973457028 -01964649795 -07089823173 -08627375186 06748144948 -20927791293 -137269226972 -6393653789 59836399187 20500693512 -16784942813 -06150208054 -316223197419 -114547624997 -100197139539
C+ 0 10973457028 04008807233 0 0 01748144948 0 0 0 54836399187 70337092699 48552149886 37401941833 0 0 0
C- 00039753822 0 0 02089823173 05717198359 0 15927791293 148197018266 207133556155 142297156968 116796463456 128581406269 129731614323 440954811741 550502436738 645699576277
Signal 0 0 0 0 0 0 0 0 1 1 1 1 1 1 1 1 1
Average 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 9475 4825 43 3075 54 72 -1055 -2085 -309
Std Dev 195115350498 195115350498 195115350498 195115350498 195115350498 195115350498 195115350498 195115350498 1284143683549 1324547092406 1254033492376 115840047767 882307580533 710539700979 29504858357 278774102097 2505474007049
k 05
Control Limit 4
1988 1988 1988 1988 1988 1988
1989 1989 1989 1989 1989 1989
1990 1990 1990 1990 1990 1990
1991 1991 1991 1991 1991 1991
1992 1992 1992 1992 1992 1992
1993 1993 1993 1993 1993 1993
1994 1994 1994 1994 1994 1994
1995 1995 1995 1995 1995 1995
1996 1996 1996 1996 1996 1996
1997 1997 1997 1997 1997 1997
1998 1998 1998 1998 1998 1998
1999 1999 1999 1999 1999 1999
2000 2000 2000 2000 2000 2000
2001 2001 2001 2001 2001 2001
2002 2002 2002 2002 2002 2002
2003 2003 2003 2003 2003 2003
2004 2004 2004 2004 2004 2004
httpwwwmarocnlnieuwsforumsshowphppg=timeline
Significant Events Founded in Afghanistan Moves to Sudan WTC 1 and Somalia Leaves Sudan Returns to Afghanistan Saudia Arabia bomb Issues fatwa US embassy bombings USS Cole WTC 2 Afghanistan 3 Man is Captured Dirty bomb foiled 911 plotter captured Sheikh Mohammed captured Saudi bombings Morroco attacks Sectarian violence plot Madrid Europe truce offered
1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Betweeness 0 00001 00003 00006 00008 00016 00027 00029 00024 00025 00033 00031 00033 00036 0002 00008 0
Density 00003 00015 0003 00043 00055 00067 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Closeness 00001 00005 00012 00016 00018 00021 00027 0003 00028 00028 00031 0003 00032 00034 00024 00015 00004
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Closeness 00027 0003 00028 00028 00031 0003 00032 00034 00024 00015 00004
Z -08728715609 10910894512 -02182178902 -02182178902 17457431219 10910894512 24003967926 3709704134 -28368325731 -87287156094 -159299059872
C+ 0 05910894512 0 0 12457431219 18368325731 37372293657 69469334997 36101009266 0 0
C- 03728715609 0 0 0 0 0 0 0 23368325731 105655481825 259954541697
03728715609 05910894512 0 0 12457431219 18368325731 37372293657 69469334997 36101009266 105655481825 259954541697
Signal 0 0 0 0 0 0 0 1 0 1 1
Average 00028333333
Std Dev 00001527525
k 05
Control Limit 4
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Density 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Z -08444407432 11042686642 -0259827921 -06495698025 16888814864 30529780716 28581071308 36375908938 -43521176765 -9223891195 -150700194171
C+ 0 06042686642 0 0 11888814864 3741859558 60999666888 92375575826 43854399061 0 0
C- 03444407432 0 0 01495698025 0 0 0 0 38521176765 125760088714 271460282886
Signal 0 0 0 0 0 0 1 1 1 1 1
Average 00084333333
Std Dev 00005131601
k 05
Control Limit 4
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Betweeness 00027 00029 00024 00025 00033 00031 00033 00036 0002 00008 0
Z 01324532357 09271726499 -10596258857 -06622661785 25166114784 17218920642 25166114784 37086905998 -26490647141 -74173811996 -105962588565
C+ 0 04271726499 0 0 20166114784 32385035426 5255115021 84638056208 53147409067 0 0
C- 0 0 05596258857 07218920642 0 0 0 0 21490647141 90664459137 191627047702
Signal 0 0 0 0 0 0 1 1 1 1 1
Average 00026666667
Std Dev 00002516611
k 05
Control Limit 4
1994
1995
1996
1997
1998
1999
2000
2001
2002
2003
2004
1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Centrality-Betweenness[Agent x Agent Agent x Agent] Average 0 00001 00003 00006 00008 00016 00027 00029 00024 00025 00033 00031 00033 00036 0002 00008 0
Meta-Matrix Hamming Distance 0 00018 00036 0006 00065 00087 00124 0012 00146 00136 00135 00166 00164 00156 0015 0009 0004
Network Centralization-Betweenness[Agent x Agent Agent x Agent] 0 00091 00292 00492 00608 01105 01557 01586 01242 01187 01295 01338 01316 01414 0104 0042 00014
Edge Count-Sequential[Agent x Agent Agent x Agent] 0 001 00025 00017 0 0 00019 00008 00009 00009 00016 00015 0 00007 0 0002 0
Hierarchy[Agent x Agent Agent x Agent] 0 04827 01679 00696 00796 01496 01339 00727 00775 01273 01492 01711 0075 00717 00786 00882 05587
Exclusivity[Agent x Agent Agent x Agent] Average 00001 00003 00003 00003 00004 00004 00005 00005 00005 00006 00006 00006 00006 00006 00006 00004 00002
Network Centralization-Closeness[Agent x Agent Agent x Agent] 00001 00005 00012 00016 00018 00021 00027 0003 00028 00028 00031 0003 00032 00034 00024 00015 00004
Similarity Correlation[Agent x Agent Agent x Agent] Average 00001 00023 00036 00043 0005 0006 0007 00077 0007 00069 0008 00084 0008 00082 00053 0003 00008
Simmelian Ties[Agent x Agent Agent x Agent] Average 00002 00008 00021 00029 0004 0005 00059 00068 00061 00058 00065 00076 00076 0008 00045 00029 00004
Overall Complexity 00003 00015 0003 00043 00055 00067 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Expertise Correlation[Agent x Agent Agent x Agent] Average 00003 00015 0003 00042 00053 00065 00079 00088 00081 00079 00091 00097 00097 00101 0006 00036 00007
Density[Agent x Agent Agent x Agent] 00003 00015 0003 00043 00055 00067 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Centrality-Inverse Closeness[Agent x Agent Agent x Agent] Average 00003 00057 00191 00306 00387 0053 00733 00943 0084 00877 01111 01117 01125 01206 00587 00257 00018
Efficiency-Global[Agent x Agent Agent x Agent] 00003 00087 00223 00338 00421 00591 00815 0101 00904 00972 01245 01244 01203 01286 00632 00277 00026
Connectedness[Agent x Agent Agent x Agent] 00003 0021 00589 00932 0119 01953 02921 03446 0301 03223 04222 04187 04049 04402 02202 00899 00059
Distinctiveness Correlation[Agent x Agent Agent x Agent] Average 00006 00029 00059 00083 00106 00129 00156 00175 00162 00157 00181 00193 00193 00199 0012 00073 00014
Centrality-Closeness[Agent x Agent Agent x Agent] Average 00027 00028 00029 00031 00032 00035 00042 00048 00044 00044 00053 00052 00054 00058 00038 00031 00027
Boundary Spanner[Agent x Agent Agent x Agent] Average 00027 00191 00328 00328 00546 00574 00683 00601 00656 0071 00683 0071 00847 00656 00738 00546 00246
Network Centralization-Row Degree[Agent x Agent Agent x Agent] 00134 00314 006 00752 00794 0081 00851 00841 00848 00878 00893 00969 00996 00993 00842 00648 0024
Network Centralization-Column Degree[Agent x Agent Agent x Agent] 00134 00588 00819 00834 00849 00892 00878 00896 00876 00905 00948 00996 01024 00993 00842 0062 00322
Network Centralization-Out Degree[Agent x Agent Agent x Agent] 00135 00315 00601 00754 00797 00812 00853 00843 00851 0088 00895 00972 00999 00996 00845 0065 0024
Network Centralization-Total Degree[Agent x Agent Agent x Agent] 00135 00453 00672 00769 00813 00828 00869 0086 00867 00897 00925 00988 01016 00999 00847 00638 00282
Network Centralization-In Degree[Agent x Agent Agent x Agent] 00135 00589 00821 00836 00852 00895 00881 00898 00878 00908 0095 00999 01027 00996 00845 00622 00323
Clustering Coefficient-Watts-Strogatz[Agent x Agent Agent x Agent] Average 0023 00786 0126 01471 01728 01993 02334 02782 02382 02256 02604 02856 02775 02958 02157 01468 00355
Efficiency-Local[Agent x Agent Agent x Agent] 00238 01032 01587 01916 02253 02598 03034 0362 03167 03054 03644 03843 03678 03861 02663 01705 0041
Interlockers[Agent x Agent Agent x Agent] Average 00246 00546 00656 00628 00738 00902 00902 00902 00847 0071 00738 00874 00874 0082 00792 00683 00546
Breadth-Column[Agent x Agent Agent x Agent] 00273 01011 01803 02596 03224 03716 04399 05082 04672 04672 05355 05328 05519 05738 03825 02404 00601
Breadth-Row[Agent x Agent Agent x Agent] 00273 01339 02158 02923 03388 03934 04754 05301 04863 04973 05792 05765 05765 05984 04153 02514 00601
Clique Count[Agent x Agent Agent x Agent] Average 00301 02896 06421 10929 14699 17213 19781 22486 20601 20492 24071 2571 24918 24727 13689 06913 01148
Constraint-Burt[Agent x Agent Agent x Agent] Average 0038 01331 01404 01481 01869 02077 02643 02719 02506 02623 02827 0295 03034 02956 02548 01967 00948
Interdependence[Agent x Agent Agent x Agent] 00485 00073 00042 0003 00024 0002 00017 00015 00016 00016 00014 00013 00014 00013 00022 00037 00176
Effective Network Size-Burt[Agent x Agent Agent x Agent] Average 0053 03636 06977 10333 13118 15388 18575 21156 19688 19928 22983 2382 23991 2429 14392 08408 01957
Redundancy-Row[Agent x Agent Agent x Agent] 00601 03525 08443 12459 16093 19699 23716 26913 24754 23798 27541 29781 29617 30683 1765 10219 01311
Redundancy-Column[Agent x Agent Agent x Agent] 00601 03743 08634 12568 1623 2 23989 27077 24945 24016 27814 30055 29891 30902 17842 10383 01448
Node Levels[Agent x Agent Agent x Agent] Average 00601 09836 14344 23087 24126 36339 46913 42541 3918 43197 51913 44727 46421 48169 47568 23033 03169
Centrality-Bonacich Power[Agent x Agent Agent x Agent] Average 01038 05492 11066 15683 20027 2429 29372 32978 30383 29563 34071 36339 36284 37432 22541 13552 02514
Triad Count[Agent x Agent Agent x Agent] Average 01967 08279 27104 42432 5918 78251 96503 101475 94454 84508 94645 111421 110164 112732 67131 35574 01995
Efficiency[Agent x Agent Agent x Agent] 03333 09552 0964 09658 09658 09746 09798 09806 09795 09818 09835 09822 0982 09828 09809 09756 09607
Speed-Minimum[Agent x Agent Agent x Agent] 05 0125 0125 01 01111 00909 00909 01 01 01 00909 01111 01 01 00769 01111 01667
Edge Count-Skip[Agent x Agent Agent x Agent] 07895 07761 08988 09408 09304 0946 0933 09519 09487 09409 09455 09534 09465 09584 09188 0881 06196
Transitivity[Agent x Agent Agent x Agent] 08372 03475 03372 03238 03336 03626 0375 03495 03503 03253 0319 03266 03302 0336 03822 0405 0258
Edge Count-Pooled[Agent x Agent Agent x Agent] 08421 08657 09432 09669 09714 09764 09665 09751 09748 09704 09735 09737 09759 09788 09612 09435 08152
Speed-Average[Agent x Agent Agent x Agent] 08519 02994 03068 02802 02799 0233 02156 02387 0245 02484 02448 02516 02464 02429 02259 02373 04139
Edge Count-Lateral[Agent x Agent Agent x Agent] 09474 03731 03926 0392 04216 04061 03944 03828 03768 03651 03633 03872 03795 03985 03842 03891 05761
Resemblance Correlation[Agent x Agent Agent x Agent] Average 09994 09971 09941 09917 09894 09871 09844 09825 09838 09843 09819 09807 09807 09801 0988 09927 09986
Fragmentation[Agent x Agent Agent x Agent] 09997 0979 09411 09068 0881 08047 07079 06554 0699 06777 05778 05813 05951 05598 07798 09101 09941
Communicative Need[Agent x Agent Agent x Agent] 1 0608 07457 07662 0801 08069 08098 08233 0817 08124 0784 08169 08368 08365 08212 09004 08039
Upper Boundedness[Agent x Agent Agent x Agent] 1 09021 09909 0999 09987 0994 09966 09993 09989 09965 0995 09928 09987 09988 09986 09983 09073
Average Distance[Agent x Agent Agent x Agent] 11739 33399 326 35685 35731 42917 46372 41893 40814 40263 40845 39749 40577 41172 44266 42139 24163
Network Levels[Agent x Agent Agent x Agent] 2 8 8 10 9 11 11 10 10 10 11 9 10 10 13 9 6
Span Of Control[Agent x Agent Agent x Agent] 2375 27917 42188 48644 50903 52917 51932 54369 53981 5128 52176 55417 54426 55466 46089 40656 20909
Number of AGENT 32 94 112 137 162 183 221 228 213 219 254 250 256 260 190 141 62
Edge Count[Agent x Agent Agent x Agent] 38 201 405 574 733 889 1075 1207 1112 1082 1247 1330 1328 1370 825 496 92
Component Members-Weak[Agent x Agent Agent x Agent] Average 1728798 1248005 101776 862131 710437 559781 367623 299836 366667 345 232377 232213 225628 211885 50694 855628 1487596
Isolate Count[Agent x Agent Agent x Agent] 350 293 269 247 222 198 157 142 159 154 126 125 122 119 187 243 322
Component Count-Weak[Agent x Agent Agent x Agent] 355 301 272 250 226 201 162 146 162 157 128 128 126 122 191 249 329
Component Count-Strong[Agent x Agent Agent x Agent] 355 315 281 255 232 214 177 156 172 169 145 144 137 131 198 256 336
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Agents 221 228 213 219 254 250 256 260 190 141 62
Z -03746343246 00624390541 -08741467575 -04995124328 16858544608 14360982444 18107325691 20604887855 -23102450019 -53697586531 -103024439274
C+ 0 0 0 0 11858544608 21219527053 34326852743 49931740598 21829290579 0 0
C- 0 0 03741467575 03736591903 0 0 0 0 18102450019 6680003655 164824475824
Signal 0 0 0 0 0 0 0 1 0 1 1
Average 227
Std Dev 16015617378
k 05
Control Limit 4
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Component Members-Weak Average 367623 299836 366667 345 232377 232213 225628 211885 50694 855628 1487596
Z 09566024078 -01181147213 09414456731 05979300132 -11876316317 -1190231741 -12946324712 -15125184597 31653794044 86935922814 187130183455
C+ 04566024078 0 04414456731 05393756863 0 0 0 0 26653794044 108589716857 290719900313
C- 0 0 0 0 06876316317 13778633728 21724958439 31850143037 0 0 0
Signal 0 0 0 0 0 0 0 0 0 1 1
Average 307286
Std Dev 63074271515
k 05
Control Limit 4
1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Edge Count 38 201 405 574 733 889 1075 1207 1112 1082 1247 1330 1328 1370 825 496 92
Slope of Edge Count 163 204 169 159 156 186 132 -95 -30 165 83 -2 42 -545 -329 -404
Z -05039753822 15973457028 -01964649795 -07089823173 -08627375186 06748144948 -20927791293 -137269226972 -6393653789 59836399187 20500693512 -16784942813 -06150208054 -316223197419 -114547624997 -100197139539
C+ 0 10973457028 04008807233 0 0 01748144948 0 0 0 54836399187 70337092699 48552149886 37401941833 0 0 0
C- 00039753822 0 0 02089823173 05717198359 0 15927791293 148197018266 207133556155 142297156968 116796463456 128581406269 129731614323 440954811741 550502436738 645699576277
Signal 0 0 0 0 0 0 0 0 1 1 1 1 1 1 1 1 1
Average 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 9475 4825 43 3075 54 72 -1055 -2085 -309
Std Dev 195115350498 195115350498 195115350498 195115350498 195115350498 195115350498 195115350498 195115350498 1284143683549 1324547092406 1254033492376 115840047767 882307580533 710539700979 29504858357 278774102097 2505474007049
k 05
Control Limit 4
1988 1988 1988 1988 1988 1988
1989 1989 1989 1989 1989 1989
1990 1990 1990 1990 1990 1990
1991 1991 1991 1991 1991 1991
1992 1992 1992 1992 1992 1992
1993 1993 1993 1993 1993 1993
1994 1994 1994 1994 1994 1994
1995 1995 1995 1995 1995 1995
1996 1996 1996 1996 1996 1996
1997 1997 1997 1997 1997 1997
1998 1998 1998 1998 1998 1998
1999 1999 1999 1999 1999 1999
2000 2000 2000 2000 2000 2000
2001 2001 2001 2001 2001 2001
2002 2002 2002 2002 2002 2002
2003 2003 2003 2003 2003 2003
2004 2004 2004 2004 2004 2004
Page 23: Data Science Center Eindhoven - 4TU.Federation · 4TU AMI SRO Big Data Meeting Big Data: Mathematics in Action! November 24, 2017 Data Science Center Eindhoven The Mathematics Behind

Control Charts

raw data

Chart4

C+
Closeness CUSUM Statistic for Al-Qaeda (1994-2004)
0
05910894512
0
0
12457431219
18368325731
37372293657
69469334997
36101009266
0
0

Std Measures Control Charts

Std Measures Control Charts

Betweeness
Density
Closeness
Time
Value
Standard Measures

Chart1

C+
Closeness CUSUM Chart for Al-Qaeda (1994-2004)

Control Charts

Number of AGENT
Edge Count[Agent x Agent Agent x Agent]
Component Members-Weak[Agent x Agent Agent x Agent] Average
Isolate Count[Agent x Agent Agent x Agent]
Component Count-Weak[Agent x Agent Agent x Agent]
Component Count-Strong[Agent x Agent Agent x Agent]
32
38
1728798
350
355
355
94
201
1248005
293
301
315
112
405
101776
269
272
281
137
574
862131
247
250
255
162
733
710437
222
226
232
183
889
559781
198
201
214
221
1075
367623
157
162
177
228
1207
299836
142
146
156
213
1112
366667
159
162
172
219
1082
345
154
157
169
254
1247
232377
126
128
145
250
1330
232213
125
128
144
256
1328
225628
122
126
137
260
1370
211885
119
122
131
190
825
50694
187
191
198
141
496
855628
243
249
256
62
92
1487596
322
329
336

raw data

Topic Precision farming

Topic Precision farming

Topic Compressed Sensing

Topic Network structure

bull dependencies between nodes in networks measured by degrees of direct neighbours

bull assortativity coefficient of Newman is nothing but Pearsonrsquos correlation statistic

bull inconsistent estimator when variances are infinitebull Spearman rank correlation behaves better but

calculation is computationally intensive bull requires heavy asymptotics

Van der Hofstad R and Litvak N (2014) Degree-Degree Dependencies in Random Graphs with Heavy-Tailed Degrees Internet Mathematics 10 (3-4) pp 287-334

Topic Topological Data Analysis

Common Big Data problem is to choose relevant ldquofeaturesrdquo from high-dimensional data

Combination of machine learning with topological tools (simplices cohomology) yields new algorithmsfor finding patterns and clustering

Presenter
Presentation Notes
httpsyoutubeXfWibrh6stw Gunnar Carsson

Mathematical contributions in general

bull Modellingbull Performance of algorithmsbull Statistical thinking

We need to work hard to makemathematical contributions more explicit

Conclusions

1 Mathematical methods are an important enabler in big data settings

2 Developments in big data settings not only require more computer speed and memory capacity but also new advanced mathematics

  • The Mathematics Behind Big Data
  • Outline
  • What is big data
  • Four Vrsquos of Big Data
  • High-dimensional data ldquo119951≪119953rdquo
  • Topic Google Search
  • Topic Google Search
  • Topic Google Search
  • Topic Google Search
  • Topic Google Search
  • Topic Google Search
  • Equivalent Random Surfer Model
  • Google Matrix
  • Choice of Matrix B
  • Practical Issues
  • Topic Streaming algorithms
  • Topic Uncertainty Quantification
  • Topic Network monitoring
  • Topic Precision farming
  • Topic Precision farming
  • Topic Compressed Sensing
  • Topic Network structure
  • Topic Topological Data Analysis
  • Mathematical contributions in general
  • Conclusions
1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Centrality-Betweenness[Agent x Agent Agent x Agent] Average 0 00001 00003 00006 00008 00016 00027 00029 00024 00025 00033 00031 00033 00036 0002 00008 0
Meta-Matrix Hamming Distance 0 00018 00036 0006 00065 00087 00124 0012 00146 00136 00135 00166 00164 00156 0015 0009 0004
Network Centralization-Betweenness[Agent x Agent Agent x Agent] 0 00091 00292 00492 00608 01105 01557 01586 01242 01187 01295 01338 01316 01414 0104 0042 00014
Edge Count-Sequential[Agent x Agent Agent x Agent] 0 001 00025 00017 0 0 00019 00008 00009 00009 00016 00015 0 00007 0 0002 0
Hierarchy[Agent x Agent Agent x Agent] 0 04827 01679 00696 00796 01496 01339 00727 00775 01273 01492 01711 0075 00717 00786 00882 05587
Exclusivity[Agent x Agent Agent x Agent] Average 00001 00003 00003 00003 00004 00004 00005 00005 00005 00006 00006 00006 00006 00006 00006 00004 00002
Network Centralization-Closeness[Agent x Agent Agent x Agent] 00001 00005 00012 00016 00018 00021 00027 0003 00028 00028 00031 0003 00032 00034 00024 00015 00004
Similarity Correlation[Agent x Agent Agent x Agent] Average 00001 00023 00036 00043 0005 0006 0007 00077 0007 00069 0008 00084 0008 00082 00053 0003 00008
Simmelian Ties[Agent x Agent Agent x Agent] Average 00002 00008 00021 00029 0004 0005 00059 00068 00061 00058 00065 00076 00076 0008 00045 00029 00004
Overall Complexity 00003 00015 0003 00043 00055 00067 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Expertise Correlation[Agent x Agent Agent x Agent] Average 00003 00015 0003 00042 00053 00065 00079 00088 00081 00079 00091 00097 00097 00101 0006 00036 00007
Density[Agent x Agent Agent x Agent] 00003 00015 0003 00043 00055 00067 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Centrality-Inverse Closeness[Agent x Agent Agent x Agent] Average 00003 00057 00191 00306 00387 0053 00733 00943 0084 00877 01111 01117 01125 01206 00587 00257 00018
Efficiency-Global[Agent x Agent Agent x Agent] 00003 00087 00223 00338 00421 00591 00815 0101 00904 00972 01245 01244 01203 01286 00632 00277 00026
Connectedness[Agent x Agent Agent x Agent] 00003 0021 00589 00932 0119 01953 02921 03446 0301 03223 04222 04187 04049 04402 02202 00899 00059
Distinctiveness Correlation[Agent x Agent Agent x Agent] Average 00006 00029 00059 00083 00106 00129 00156 00175 00162 00157 00181 00193 00193 00199 0012 00073 00014
Centrality-Closeness[Agent x Agent Agent x Agent] Average 00027 00028 00029 00031 00032 00035 00042 00048 00044 00044 00053 00052 00054 00058 00038 00031 00027
Boundary Spanner[Agent x Agent Agent x Agent] Average 00027 00191 00328 00328 00546 00574 00683 00601 00656 0071 00683 0071 00847 00656 00738 00546 00246
Network Centralization-Row Degree[Agent x Agent Agent x Agent] 00134 00314 006 00752 00794 0081 00851 00841 00848 00878 00893 00969 00996 00993 00842 00648 0024
Network Centralization-Column Degree[Agent x Agent Agent x Agent] 00134 00588 00819 00834 00849 00892 00878 00896 00876 00905 00948 00996 01024 00993 00842 0062 00322
Network Centralization-Out Degree[Agent x Agent Agent x Agent] 00135 00315 00601 00754 00797 00812 00853 00843 00851 0088 00895 00972 00999 00996 00845 0065 0024
Network Centralization-Total Degree[Agent x Agent Agent x Agent] 00135 00453 00672 00769 00813 00828 00869 0086 00867 00897 00925 00988 01016 00999 00847 00638 00282
Network Centralization-In Degree[Agent x Agent Agent x Agent] 00135 00589 00821 00836 00852 00895 00881 00898 00878 00908 0095 00999 01027 00996 00845 00622 00323
Clustering Coefficient-Watts-Strogatz[Agent x Agent Agent x Agent] Average 0023 00786 0126 01471 01728 01993 02334 02782 02382 02256 02604 02856 02775 02958 02157 01468 00355
Efficiency-Local[Agent x Agent Agent x Agent] 00238 01032 01587 01916 02253 02598 03034 0362 03167 03054 03644 03843 03678 03861 02663 01705 0041
Interlockers[Agent x Agent Agent x Agent] Average 00246 00546 00656 00628 00738 00902 00902 00902 00847 0071 00738 00874 00874 0082 00792 00683 00546
Breadth-Column[Agent x Agent Agent x Agent] 00273 01011 01803 02596 03224 03716 04399 05082 04672 04672 05355 05328 05519 05738 03825 02404 00601
Breadth-Row[Agent x Agent Agent x Agent] 00273 01339 02158 02923 03388 03934 04754 05301 04863 04973 05792 05765 05765 05984 04153 02514 00601
Clique Count[Agent x Agent Agent x Agent] Average 00301 02896 06421 10929 14699 17213 19781 22486 20601 20492 24071 2571 24918 24727 13689 06913 01148
Constraint-Burt[Agent x Agent Agent x Agent] Average 0038 01331 01404 01481 01869 02077 02643 02719 02506 02623 02827 0295 03034 02956 02548 01967 00948
Interdependence[Agent x Agent Agent x Agent] 00485 00073 00042 0003 00024 0002 00017 00015 00016 00016 00014 00013 00014 00013 00022 00037 00176
Effective Network Size-Burt[Agent x Agent Agent x Agent] Average 0053 03636 06977 10333 13118 15388 18575 21156 19688 19928 22983 2382 23991 2429 14392 08408 01957
Redundancy-Row[Agent x Agent Agent x Agent] 00601 03525 08443 12459 16093 19699 23716 26913 24754 23798 27541 29781 29617 30683 1765 10219 01311
Redundancy-Column[Agent x Agent Agent x Agent] 00601 03743 08634 12568 1623 2 23989 27077 24945 24016 27814 30055 29891 30902 17842 10383 01448
Node Levels[Agent x Agent Agent x Agent] Average 00601 09836 14344 23087 24126 36339 46913 42541 3918 43197 51913 44727 46421 48169 47568 23033 03169
Centrality-Bonacich Power[Agent x Agent Agent x Agent] Average 01038 05492 11066 15683 20027 2429 29372 32978 30383 29563 34071 36339 36284 37432 22541 13552 02514
Triad Count[Agent x Agent Agent x Agent] Average 01967 08279 27104 42432 5918 78251 96503 101475 94454 84508 94645 111421 110164 112732 67131 35574 01995
Efficiency[Agent x Agent Agent x Agent] 03333 09552 0964 09658 09658 09746 09798 09806 09795 09818 09835 09822 0982 09828 09809 09756 09607
Speed-Minimum[Agent x Agent Agent x Agent] 05 0125 0125 01 01111 00909 00909 01 01 01 00909 01111 01 01 00769 01111 01667
Edge Count-Skip[Agent x Agent Agent x Agent] 07895 07761 08988 09408 09304 0946 0933 09519 09487 09409 09455 09534 09465 09584 09188 0881 06196
Transitivity[Agent x Agent Agent x Agent] 08372 03475 03372 03238 03336 03626 0375 03495 03503 03253 0319 03266 03302 0336 03822 0405 0258
Edge Count-Pooled[Agent x Agent Agent x Agent] 08421 08657 09432 09669 09714 09764 09665 09751 09748 09704 09735 09737 09759 09788 09612 09435 08152
Speed-Average[Agent x Agent Agent x Agent] 08519 02994 03068 02802 02799 0233 02156 02387 0245 02484 02448 02516 02464 02429 02259 02373 04139
Edge Count-Lateral[Agent x Agent Agent x Agent] 09474 03731 03926 0392 04216 04061 03944 03828 03768 03651 03633 03872 03795 03985 03842 03891 05761
Resemblance Correlation[Agent x Agent Agent x Agent] Average 09994 09971 09941 09917 09894 09871 09844 09825 09838 09843 09819 09807 09807 09801 0988 09927 09986
Fragmentation[Agent x Agent Agent x Agent] 09997 0979 09411 09068 0881 08047 07079 06554 0699 06777 05778 05813 05951 05598 07798 09101 09941
Communicative Need[Agent x Agent Agent x Agent] 1 0608 07457 07662 0801 08069 08098 08233 0817 08124 0784 08169 08368 08365 08212 09004 08039
Upper Boundedness[Agent x Agent Agent x Agent] 1 09021 09909 0999 09987 0994 09966 09993 09989 09965 0995 09928 09987 09988 09986 09983 09073
Average Distance[Agent x Agent Agent x Agent] 11739 33399 326 35685 35731 42917 46372 41893 40814 40263 40845 39749 40577 41172 44266 42139 24163
Network Levels[Agent x Agent Agent x Agent] 2 8 8 10 9 11 11 10 10 10 11 9 10 10 13 9 6
Span Of Control[Agent x Agent Agent x Agent] 2375 27917 42188 48644 50903 52917 51932 54369 53981 5128 52176 55417 54426 55466 46089 40656 20909
Number of AGENT 32 94 112 137 162 183 221 228 213 219 254 250 256 260 190 141 62
Edge Count[Agent x Agent Agent x Agent] 38 201 405 574 733 889 1075 1207 1112 1082 1247 1330 1328 1370 825 496 92
Component Members-Weak[Agent x Agent Agent x Agent] Average 1728798 1248005 101776 862131 710437 559781 367623 299836 366667 345 232377 232213 225628 211885 50694 855628 1487596
Isolate Count[Agent x Agent Agent x Agent] 350 293 269 247 222 198 157 142 159 154 126 125 122 119 187 243 322
Component Count-Weak[Agent x Agent Agent x Agent] 355 301 272 250 226 201 162 146 162 157 128 128 126 122 191 249 329
Component Count-Strong[Agent x Agent Agent x Agent] 355 315 281 255 232 214 177 156 172 169 145 144 137 131 198 256 336
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Agents 221 228 213 219 254 250 256 260 190 141 62
Z -03746343246 00624390541 -08741467575 -04995124328 16858544608 14360982444 18107325691 20604887855 -23102450019 -53697586531 -103024439274
C+ 0 0 0 0 11858544608 21219527053 34326852743 49931740598 21829290579 0 0
C- 0 0 03741467575 03736591903 0 0 0 0 18102450019 6680003655 164824475824
Signal 0 0 0 0 0 0 0 1 0 1 1
Average 227
Std Dev 16015617378
k 05
Control Limit 4
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Component Members-Weak Average 367623 299836 366667 345 232377 232213 225628 211885 50694 855628 1487596
Z 09566024078 -01181147213 09414456731 05979300132 -11876316317 -1190231741 -12946324712 -15125184597 31653794044 86935922814 187130183455
C+ 04566024078 0 04414456731 05393756863 0 0 0 0 26653794044 108589716857 290719900313
C- 0 0 0 0 06876316317 13778633728 21724958439 31850143037 0 0 0
Signal 0 0 0 0 0 0 0 0 0 1 1
Average 307286
Std Dev 63074271515
k 05
Control Limit 4
1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Edge Count 38 201 405 574 733 889 1075 1207 1112 1082 1247 1330 1328 1370 825 496 92
Slope of Edge Count 163 204 169 159 156 186 132 -95 -30 165 83 -2 42 -545 -329 -404
Z -05039753822 15973457028 -01964649795 -07089823173 -08627375186 06748144948 -20927791293 -137269226972 -6393653789 59836399187 20500693512 -16784942813 -06150208054 -316223197419 -114547624997 -100197139539
C+ 0 10973457028 04008807233 0 0 01748144948 0 0 0 54836399187 70337092699 48552149886 37401941833 0 0 0
C- 00039753822 0 0 02089823173 05717198359 0 15927791293 148197018266 207133556155 142297156968 116796463456 128581406269 129731614323 440954811741 550502436738 645699576277
Signal 0 0 0 0 0 0 0 0 1 1 1 1 1 1 1 1 1
Average 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 9475 4825 43 3075 54 72 -1055 -2085 -309
Std Dev 195115350498 195115350498 195115350498 195115350498 195115350498 195115350498 195115350498 195115350498 1284143683549 1324547092406 1254033492376 115840047767 882307580533 710539700979 29504858357 278774102097 2505474007049
k 05
Control Limit 4
1988 1988 1988 1988 1988 1988
1989 1989 1989 1989 1989 1989
1990 1990 1990 1990 1990 1990
1991 1991 1991 1991 1991 1991
1992 1992 1992 1992 1992 1992
1993 1993 1993 1993 1993 1993
1994 1994 1994 1994 1994 1994
1995 1995 1995 1995 1995 1995
1996 1996 1996 1996 1996 1996
1997 1997 1997 1997 1997 1997
1998 1998 1998 1998 1998 1998
1999 1999 1999 1999 1999 1999
2000 2000 2000 2000 2000 2000
2001 2001 2001 2001 2001 2001
2002 2002 2002 2002 2002 2002
2003 2003 2003 2003 2003 2003
2004 2004 2004 2004 2004 2004
httpwwwmarocnlnieuwsforumsshowphppg=timeline
Significant Events Founded in Afghanistan Moves to Sudan WTC 1 and Somalia Leaves Sudan Returns to Afghanistan Saudia Arabia bomb Issues fatwa US embassy bombings USS Cole WTC 2 Afghanistan 3 Man is Captured Dirty bomb foiled 911 plotter captured Sheikh Mohammed captured Saudi bombings Morroco attacks Sectarian violence plot Madrid Europe truce offered
1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Betweeness 0 00001 00003 00006 00008 00016 00027 00029 00024 00025 00033 00031 00033 00036 0002 00008 0
Density 00003 00015 0003 00043 00055 00067 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Closeness 00001 00005 00012 00016 00018 00021 00027 0003 00028 00028 00031 0003 00032 00034 00024 00015 00004
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Closeness 00027 0003 00028 00028 00031 0003 00032 00034 00024 00015 00004
Z -08728715609 10910894512 -02182178902 -02182178902 17457431219 10910894512 24003967926 3709704134 -28368325731 -87287156094 -159299059872
C+ 0 05910894512 0 0 12457431219 18368325731 37372293657 69469334997 36101009266 0 0
C- 03728715609 0 0 0 0 0 0 0 23368325731 105655481825 259954541697
03728715609 05910894512 0 0 12457431219 18368325731 37372293657 69469334997 36101009266 105655481825 259954541697
Signal 0 0 0 0 0 0 0 1 0 1 1
Average 00028333333
Std Dev 00001527525
k 05
Control Limit 4
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Density 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Z -08444407432 11042686642 -0259827921 -06495698025 16888814864 30529780716 28581071308 36375908938 -43521176765 -9223891195 -150700194171
C+ 0 06042686642 0 0 11888814864 3741859558 60999666888 92375575826 43854399061 0 0
C- 03444407432 0 0 01495698025 0 0 0 0 38521176765 125760088714 271460282886
Signal 0 0 0 0 0 0 1 1 1 1 1
Average 00084333333
Std Dev 00005131601
k 05
Control Limit 4
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Betweeness 00027 00029 00024 00025 00033 00031 00033 00036 0002 00008 0
Z 01324532357 09271726499 -10596258857 -06622661785 25166114784 17218920642 25166114784 37086905998 -26490647141 -74173811996 -105962588565
C+ 0 04271726499 0 0 20166114784 32385035426 5255115021 84638056208 53147409067 0 0
C- 0 0 05596258857 07218920642 0 0 0 0 21490647141 90664459137 191627047702
Signal 0 0 0 0 0 0 1 1 1 1 1
Average 00026666667
Std Dev 00002516611
k 05
Control Limit 4
1994
1995
1996
1997
1998
1999
2000
2001
2002
2003
2004
1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Centrality-Betweenness[Agent x Agent Agent x Agent] Average 0 00001 00003 00006 00008 00016 00027 00029 00024 00025 00033 00031 00033 00036 0002 00008 0
Meta-Matrix Hamming Distance 0 00018 00036 0006 00065 00087 00124 0012 00146 00136 00135 00166 00164 00156 0015 0009 0004
Network Centralization-Betweenness[Agent x Agent Agent x Agent] 0 00091 00292 00492 00608 01105 01557 01586 01242 01187 01295 01338 01316 01414 0104 0042 00014
Edge Count-Sequential[Agent x Agent Agent x Agent] 0 001 00025 00017 0 0 00019 00008 00009 00009 00016 00015 0 00007 0 0002 0
Hierarchy[Agent x Agent Agent x Agent] 0 04827 01679 00696 00796 01496 01339 00727 00775 01273 01492 01711 0075 00717 00786 00882 05587
Exclusivity[Agent x Agent Agent x Agent] Average 00001 00003 00003 00003 00004 00004 00005 00005 00005 00006 00006 00006 00006 00006 00006 00004 00002
Network Centralization-Closeness[Agent x Agent Agent x Agent] 00001 00005 00012 00016 00018 00021 00027 0003 00028 00028 00031 0003 00032 00034 00024 00015 00004
Similarity Correlation[Agent x Agent Agent x Agent] Average 00001 00023 00036 00043 0005 0006 0007 00077 0007 00069 0008 00084 0008 00082 00053 0003 00008
Simmelian Ties[Agent x Agent Agent x Agent] Average 00002 00008 00021 00029 0004 0005 00059 00068 00061 00058 00065 00076 00076 0008 00045 00029 00004
Overall Complexity 00003 00015 0003 00043 00055 00067 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Expertise Correlation[Agent x Agent Agent x Agent] Average 00003 00015 0003 00042 00053 00065 00079 00088 00081 00079 00091 00097 00097 00101 0006 00036 00007
Density[Agent x Agent Agent x Agent] 00003 00015 0003 00043 00055 00067 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Centrality-Inverse Closeness[Agent x Agent Agent x Agent] Average 00003 00057 00191 00306 00387 0053 00733 00943 0084 00877 01111 01117 01125 01206 00587 00257 00018
Efficiency-Global[Agent x Agent Agent x Agent] 00003 00087 00223 00338 00421 00591 00815 0101 00904 00972 01245 01244 01203 01286 00632 00277 00026
Connectedness[Agent x Agent Agent x Agent] 00003 0021 00589 00932 0119 01953 02921 03446 0301 03223 04222 04187 04049 04402 02202 00899 00059
Distinctiveness Correlation[Agent x Agent Agent x Agent] Average 00006 00029 00059 00083 00106 00129 00156 00175 00162 00157 00181 00193 00193 00199 0012 00073 00014
Centrality-Closeness[Agent x Agent Agent x Agent] Average 00027 00028 00029 00031 00032 00035 00042 00048 00044 00044 00053 00052 00054 00058 00038 00031 00027
Boundary Spanner[Agent x Agent Agent x Agent] Average 00027 00191 00328 00328 00546 00574 00683 00601 00656 0071 00683 0071 00847 00656 00738 00546 00246
Network Centralization-Row Degree[Agent x Agent Agent x Agent] 00134 00314 006 00752 00794 0081 00851 00841 00848 00878 00893 00969 00996 00993 00842 00648 0024
Network Centralization-Column Degree[Agent x Agent Agent x Agent] 00134 00588 00819 00834 00849 00892 00878 00896 00876 00905 00948 00996 01024 00993 00842 0062 00322
Network Centralization-Out Degree[Agent x Agent Agent x Agent] 00135 00315 00601 00754 00797 00812 00853 00843 00851 0088 00895 00972 00999 00996 00845 0065 0024
Network Centralization-Total Degree[Agent x Agent Agent x Agent] 00135 00453 00672 00769 00813 00828 00869 0086 00867 00897 00925 00988 01016 00999 00847 00638 00282
Network Centralization-In Degree[Agent x Agent Agent x Agent] 00135 00589 00821 00836 00852 00895 00881 00898 00878 00908 0095 00999 01027 00996 00845 00622 00323
Clustering Coefficient-Watts-Strogatz[Agent x Agent Agent x Agent] Average 0023 00786 0126 01471 01728 01993 02334 02782 02382 02256 02604 02856 02775 02958 02157 01468 00355
Efficiency-Local[Agent x Agent Agent x Agent] 00238 01032 01587 01916 02253 02598 03034 0362 03167 03054 03644 03843 03678 03861 02663 01705 0041
Interlockers[Agent x Agent Agent x Agent] Average 00246 00546 00656 00628 00738 00902 00902 00902 00847 0071 00738 00874 00874 0082 00792 00683 00546
Breadth-Column[Agent x Agent Agent x Agent] 00273 01011 01803 02596 03224 03716 04399 05082 04672 04672 05355 05328 05519 05738 03825 02404 00601
Breadth-Row[Agent x Agent Agent x Agent] 00273 01339 02158 02923 03388 03934 04754 05301 04863 04973 05792 05765 05765 05984 04153 02514 00601
Clique Count[Agent x Agent Agent x Agent] Average 00301 02896 06421 10929 14699 17213 19781 22486 20601 20492 24071 2571 24918 24727 13689 06913 01148
Constraint-Burt[Agent x Agent Agent x Agent] Average 0038 01331 01404 01481 01869 02077 02643 02719 02506 02623 02827 0295 03034 02956 02548 01967 00948
Interdependence[Agent x Agent Agent x Agent] 00485 00073 00042 0003 00024 0002 00017 00015 00016 00016 00014 00013 00014 00013 00022 00037 00176
Effective Network Size-Burt[Agent x Agent Agent x Agent] Average 0053 03636 06977 10333 13118 15388 18575 21156 19688 19928 22983 2382 23991 2429 14392 08408 01957
Redundancy-Row[Agent x Agent Agent x Agent] 00601 03525 08443 12459 16093 19699 23716 26913 24754 23798 27541 29781 29617 30683 1765 10219 01311
Redundancy-Column[Agent x Agent Agent x Agent] 00601 03743 08634 12568 1623 2 23989 27077 24945 24016 27814 30055 29891 30902 17842 10383 01448
Node Levels[Agent x Agent Agent x Agent] Average 00601 09836 14344 23087 24126 36339 46913 42541 3918 43197 51913 44727 46421 48169 47568 23033 03169
Centrality-Bonacich Power[Agent x Agent Agent x Agent] Average 01038 05492 11066 15683 20027 2429 29372 32978 30383 29563 34071 36339 36284 37432 22541 13552 02514
Triad Count[Agent x Agent Agent x Agent] Average 01967 08279 27104 42432 5918 78251 96503 101475 94454 84508 94645 111421 110164 112732 67131 35574 01995
Efficiency[Agent x Agent Agent x Agent] 03333 09552 0964 09658 09658 09746 09798 09806 09795 09818 09835 09822 0982 09828 09809 09756 09607
Speed-Minimum[Agent x Agent Agent x Agent] 05 0125 0125 01 01111 00909 00909 01 01 01 00909 01111 01 01 00769 01111 01667
Edge Count-Skip[Agent x Agent Agent x Agent] 07895 07761 08988 09408 09304 0946 0933 09519 09487 09409 09455 09534 09465 09584 09188 0881 06196
Transitivity[Agent x Agent Agent x Agent] 08372 03475 03372 03238 03336 03626 0375 03495 03503 03253 0319 03266 03302 0336 03822 0405 0258
Edge Count-Pooled[Agent x Agent Agent x Agent] 08421 08657 09432 09669 09714 09764 09665 09751 09748 09704 09735 09737 09759 09788 09612 09435 08152
Speed-Average[Agent x Agent Agent x Agent] 08519 02994 03068 02802 02799 0233 02156 02387 0245 02484 02448 02516 02464 02429 02259 02373 04139
Edge Count-Lateral[Agent x Agent Agent x Agent] 09474 03731 03926 0392 04216 04061 03944 03828 03768 03651 03633 03872 03795 03985 03842 03891 05761
Resemblance Correlation[Agent x Agent Agent x Agent] Average 09994 09971 09941 09917 09894 09871 09844 09825 09838 09843 09819 09807 09807 09801 0988 09927 09986
Fragmentation[Agent x Agent Agent x Agent] 09997 0979 09411 09068 0881 08047 07079 06554 0699 06777 05778 05813 05951 05598 07798 09101 09941
Communicative Need[Agent x Agent Agent x Agent] 1 0608 07457 07662 0801 08069 08098 08233 0817 08124 0784 08169 08368 08365 08212 09004 08039
Upper Boundedness[Agent x Agent Agent x Agent] 1 09021 09909 0999 09987 0994 09966 09993 09989 09965 0995 09928 09987 09988 09986 09983 09073
Average Distance[Agent x Agent Agent x Agent] 11739 33399 326 35685 35731 42917 46372 41893 40814 40263 40845 39749 40577 41172 44266 42139 24163
Network Levels[Agent x Agent Agent x Agent] 2 8 8 10 9 11 11 10 10 10 11 9 10 10 13 9 6
Span Of Control[Agent x Agent Agent x Agent] 2375 27917 42188 48644 50903 52917 51932 54369 53981 5128 52176 55417 54426 55466 46089 40656 20909
Number of AGENT 32 94 112 137 162 183 221 228 213 219 254 250 256 260 190 141 62
Edge Count[Agent x Agent Agent x Agent] 38 201 405 574 733 889 1075 1207 1112 1082 1247 1330 1328 1370 825 496 92
Component Members-Weak[Agent x Agent Agent x Agent] Average 1728798 1248005 101776 862131 710437 559781 367623 299836 366667 345 232377 232213 225628 211885 50694 855628 1487596
Isolate Count[Agent x Agent Agent x Agent] 350 293 269 247 222 198 157 142 159 154 126 125 122 119 187 243 322
Component Count-Weak[Agent x Agent Agent x Agent] 355 301 272 250 226 201 162 146 162 157 128 128 126 122 191 249 329
Component Count-Strong[Agent x Agent Agent x Agent] 355 315 281 255 232 214 177 156 172 169 145 144 137 131 198 256 336
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Agents 221 228 213 219 254 250 256 260 190 141 62
Z -03746343246 00624390541 -08741467575 -04995124328 16858544608 14360982444 18107325691 20604887855 -23102450019 -53697586531 -103024439274
C+ 0 0 0 0 11858544608 21219527053 34326852743 49931740598 21829290579 0 0
C- 0 0 03741467575 03736591903 0 0 0 0 18102450019 6680003655 164824475824
Signal 0 0 0 0 0 0 0 1 0 1 1
Average 227
Std Dev 16015617378
k 05
Control Limit 4
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Component Members-Weak Average 367623 299836 366667 345 232377 232213 225628 211885 50694 855628 1487596
Z 09566024078 -01181147213 09414456731 05979300132 -11876316317 -1190231741 -12946324712 -15125184597 31653794044 86935922814 187130183455
C+ 04566024078 0 04414456731 05393756863 0 0 0 0 26653794044 108589716857 290719900313
C- 0 0 0 0 06876316317 13778633728 21724958439 31850143037 0 0 0
Signal 0 0 0 0 0 0 0 0 0 1 1
Average 307286
Std Dev 63074271515
k 05
Control Limit 4
1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Edge Count 38 201 405 574 733 889 1075 1207 1112 1082 1247 1330 1328 1370 825 496 92
Slope of Edge Count 163 204 169 159 156 186 132 -95 -30 165 83 -2 42 -545 -329 -404
Z -05039753822 15973457028 -01964649795 -07089823173 -08627375186 06748144948 -20927791293 -137269226972 -6393653789 59836399187 20500693512 -16784942813 -06150208054 -316223197419 -114547624997 -100197139539
C+ 0 10973457028 04008807233 0 0 01748144948 0 0 0 54836399187 70337092699 48552149886 37401941833 0 0 0
C- 00039753822 0 0 02089823173 05717198359 0 15927791293 148197018266 207133556155 142297156968 116796463456 128581406269 129731614323 440954811741 550502436738 645699576277
Signal 0 0 0 0 0 0 0 0 1 1 1 1 1 1 1 1 1
Average 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 9475 4825 43 3075 54 72 -1055 -2085 -309
Std Dev 195115350498 195115350498 195115350498 195115350498 195115350498 195115350498 195115350498 195115350498 1284143683549 1324547092406 1254033492376 115840047767 882307580533 710539700979 29504858357 278774102097 2505474007049
k 05
Control Limit 4
Page 24: Data Science Center Eindhoven - 4TU.Federation · 4TU AMI SRO Big Data Meeting Big Data: Mathematics in Action! November 24, 2017 Data Science Center Eindhoven The Mathematics Behind

raw data

Chart4

C+
Closeness CUSUM Statistic for Al-Qaeda (1994-2004)
0
05910894512
0
0
12457431219
18368325731
37372293657
69469334997
36101009266
0
0

Std Measures Control Charts

Std Measures Control Charts

Betweeness
Density
Closeness
Time
Value
Standard Measures

Chart1

C+
Closeness CUSUM Chart for Al-Qaeda (1994-2004)

Control Charts

Number of AGENT
Edge Count[Agent x Agent Agent x Agent]
Component Members-Weak[Agent x Agent Agent x Agent] Average
Isolate Count[Agent x Agent Agent x Agent]
Component Count-Weak[Agent x Agent Agent x Agent]
Component Count-Strong[Agent x Agent Agent x Agent]
32
38
1728798
350
355
355
94
201
1248005
293
301
315
112
405
101776
269
272
281
137
574
862131
247
250
255
162
733
710437
222
226
232
183
889
559781
198
201
214
221
1075
367623
157
162
177
228
1207
299836
142
146
156
213
1112
366667
159
162
172
219
1082
345
154
157
169
254
1247
232377
126
128
145
250
1330
232213
125
128
144
256
1328
225628
122
126
137
260
1370
211885
119
122
131
190
825
50694
187
191
198
141
496
855628
243
249
256
62
92
1487596
322
329
336

raw data

Topic Precision farming

Topic Precision farming

Topic Compressed Sensing

Topic Network structure

bull dependencies between nodes in networks measured by degrees of direct neighbours

bull assortativity coefficient of Newman is nothing but Pearsonrsquos correlation statistic

bull inconsistent estimator when variances are infinitebull Spearman rank correlation behaves better but

calculation is computationally intensive bull requires heavy asymptotics

Van der Hofstad R and Litvak N (2014) Degree-Degree Dependencies in Random Graphs with Heavy-Tailed Degrees Internet Mathematics 10 (3-4) pp 287-334

Topic Topological Data Analysis

Common Big Data problem is to choose relevant ldquofeaturesrdquo from high-dimensional data

Combination of machine learning with topological tools (simplices cohomology) yields new algorithmsfor finding patterns and clustering

Presenter
Presentation Notes
httpsyoutubeXfWibrh6stw Gunnar Carsson

Mathematical contributions in general

bull Modellingbull Performance of algorithmsbull Statistical thinking

We need to work hard to makemathematical contributions more explicit

Conclusions

1 Mathematical methods are an important enabler in big data settings

2 Developments in big data settings not only require more computer speed and memory capacity but also new advanced mathematics

  • The Mathematics Behind Big Data
  • Outline
  • What is big data
  • Four Vrsquos of Big Data
  • High-dimensional data ldquo119951≪119953rdquo
  • Topic Google Search
  • Topic Google Search
  • Topic Google Search
  • Topic Google Search
  • Topic Google Search
  • Topic Google Search
  • Equivalent Random Surfer Model
  • Google Matrix
  • Choice of Matrix B
  • Practical Issues
  • Topic Streaming algorithms
  • Topic Uncertainty Quantification
  • Topic Network monitoring
  • Topic Precision farming
  • Topic Precision farming
  • Topic Compressed Sensing
  • Topic Network structure
  • Topic Topological Data Analysis
  • Mathematical contributions in general
  • Conclusions
1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Centrality-Betweenness[Agent x Agent Agent x Agent] Average 0 00001 00003 00006 00008 00016 00027 00029 00024 00025 00033 00031 00033 00036 0002 00008 0
Meta-Matrix Hamming Distance 0 00018 00036 0006 00065 00087 00124 0012 00146 00136 00135 00166 00164 00156 0015 0009 0004
Network Centralization-Betweenness[Agent x Agent Agent x Agent] 0 00091 00292 00492 00608 01105 01557 01586 01242 01187 01295 01338 01316 01414 0104 0042 00014
Edge Count-Sequential[Agent x Agent Agent x Agent] 0 001 00025 00017 0 0 00019 00008 00009 00009 00016 00015 0 00007 0 0002 0
Hierarchy[Agent x Agent Agent x Agent] 0 04827 01679 00696 00796 01496 01339 00727 00775 01273 01492 01711 0075 00717 00786 00882 05587
Exclusivity[Agent x Agent Agent x Agent] Average 00001 00003 00003 00003 00004 00004 00005 00005 00005 00006 00006 00006 00006 00006 00006 00004 00002
Network Centralization-Closeness[Agent x Agent Agent x Agent] 00001 00005 00012 00016 00018 00021 00027 0003 00028 00028 00031 0003 00032 00034 00024 00015 00004
Similarity Correlation[Agent x Agent Agent x Agent] Average 00001 00023 00036 00043 0005 0006 0007 00077 0007 00069 0008 00084 0008 00082 00053 0003 00008
Simmelian Ties[Agent x Agent Agent x Agent] Average 00002 00008 00021 00029 0004 0005 00059 00068 00061 00058 00065 00076 00076 0008 00045 00029 00004
Overall Complexity 00003 00015 0003 00043 00055 00067 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Expertise Correlation[Agent x Agent Agent x Agent] Average 00003 00015 0003 00042 00053 00065 00079 00088 00081 00079 00091 00097 00097 00101 0006 00036 00007
Density[Agent x Agent Agent x Agent] 00003 00015 0003 00043 00055 00067 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Centrality-Inverse Closeness[Agent x Agent Agent x Agent] Average 00003 00057 00191 00306 00387 0053 00733 00943 0084 00877 01111 01117 01125 01206 00587 00257 00018
Efficiency-Global[Agent x Agent Agent x Agent] 00003 00087 00223 00338 00421 00591 00815 0101 00904 00972 01245 01244 01203 01286 00632 00277 00026
Connectedness[Agent x Agent Agent x Agent] 00003 0021 00589 00932 0119 01953 02921 03446 0301 03223 04222 04187 04049 04402 02202 00899 00059
Distinctiveness Correlation[Agent x Agent Agent x Agent] Average 00006 00029 00059 00083 00106 00129 00156 00175 00162 00157 00181 00193 00193 00199 0012 00073 00014
Centrality-Closeness[Agent x Agent Agent x Agent] Average 00027 00028 00029 00031 00032 00035 00042 00048 00044 00044 00053 00052 00054 00058 00038 00031 00027
Boundary Spanner[Agent x Agent Agent x Agent] Average 00027 00191 00328 00328 00546 00574 00683 00601 00656 0071 00683 0071 00847 00656 00738 00546 00246
Network Centralization-Row Degree[Agent x Agent Agent x Agent] 00134 00314 006 00752 00794 0081 00851 00841 00848 00878 00893 00969 00996 00993 00842 00648 0024
Network Centralization-Column Degree[Agent x Agent Agent x Agent] 00134 00588 00819 00834 00849 00892 00878 00896 00876 00905 00948 00996 01024 00993 00842 0062 00322
Network Centralization-Out Degree[Agent x Agent Agent x Agent] 00135 00315 00601 00754 00797 00812 00853 00843 00851 0088 00895 00972 00999 00996 00845 0065 0024
Network Centralization-Total Degree[Agent x Agent Agent x Agent] 00135 00453 00672 00769 00813 00828 00869 0086 00867 00897 00925 00988 01016 00999 00847 00638 00282
Network Centralization-In Degree[Agent x Agent Agent x Agent] 00135 00589 00821 00836 00852 00895 00881 00898 00878 00908 0095 00999 01027 00996 00845 00622 00323
Clustering Coefficient-Watts-Strogatz[Agent x Agent Agent x Agent] Average 0023 00786 0126 01471 01728 01993 02334 02782 02382 02256 02604 02856 02775 02958 02157 01468 00355
Efficiency-Local[Agent x Agent Agent x Agent] 00238 01032 01587 01916 02253 02598 03034 0362 03167 03054 03644 03843 03678 03861 02663 01705 0041
Interlockers[Agent x Agent Agent x Agent] Average 00246 00546 00656 00628 00738 00902 00902 00902 00847 0071 00738 00874 00874 0082 00792 00683 00546
Breadth-Column[Agent x Agent Agent x Agent] 00273 01011 01803 02596 03224 03716 04399 05082 04672 04672 05355 05328 05519 05738 03825 02404 00601
Breadth-Row[Agent x Agent Agent x Agent] 00273 01339 02158 02923 03388 03934 04754 05301 04863 04973 05792 05765 05765 05984 04153 02514 00601
Clique Count[Agent x Agent Agent x Agent] Average 00301 02896 06421 10929 14699 17213 19781 22486 20601 20492 24071 2571 24918 24727 13689 06913 01148
Constraint-Burt[Agent x Agent Agent x Agent] Average 0038 01331 01404 01481 01869 02077 02643 02719 02506 02623 02827 0295 03034 02956 02548 01967 00948
Interdependence[Agent x Agent Agent x Agent] 00485 00073 00042 0003 00024 0002 00017 00015 00016 00016 00014 00013 00014 00013 00022 00037 00176
Effective Network Size-Burt[Agent x Agent Agent x Agent] Average 0053 03636 06977 10333 13118 15388 18575 21156 19688 19928 22983 2382 23991 2429 14392 08408 01957
Redundancy-Row[Agent x Agent Agent x Agent] 00601 03525 08443 12459 16093 19699 23716 26913 24754 23798 27541 29781 29617 30683 1765 10219 01311
Redundancy-Column[Agent x Agent Agent x Agent] 00601 03743 08634 12568 1623 2 23989 27077 24945 24016 27814 30055 29891 30902 17842 10383 01448
Node Levels[Agent x Agent Agent x Agent] Average 00601 09836 14344 23087 24126 36339 46913 42541 3918 43197 51913 44727 46421 48169 47568 23033 03169
Centrality-Bonacich Power[Agent x Agent Agent x Agent] Average 01038 05492 11066 15683 20027 2429 29372 32978 30383 29563 34071 36339 36284 37432 22541 13552 02514
Triad Count[Agent x Agent Agent x Agent] Average 01967 08279 27104 42432 5918 78251 96503 101475 94454 84508 94645 111421 110164 112732 67131 35574 01995
Efficiency[Agent x Agent Agent x Agent] 03333 09552 0964 09658 09658 09746 09798 09806 09795 09818 09835 09822 0982 09828 09809 09756 09607
Speed-Minimum[Agent x Agent Agent x Agent] 05 0125 0125 01 01111 00909 00909 01 01 01 00909 01111 01 01 00769 01111 01667
Edge Count-Skip[Agent x Agent Agent x Agent] 07895 07761 08988 09408 09304 0946 0933 09519 09487 09409 09455 09534 09465 09584 09188 0881 06196
Transitivity[Agent x Agent Agent x Agent] 08372 03475 03372 03238 03336 03626 0375 03495 03503 03253 0319 03266 03302 0336 03822 0405 0258
Edge Count-Pooled[Agent x Agent Agent x Agent] 08421 08657 09432 09669 09714 09764 09665 09751 09748 09704 09735 09737 09759 09788 09612 09435 08152
Speed-Average[Agent x Agent Agent x Agent] 08519 02994 03068 02802 02799 0233 02156 02387 0245 02484 02448 02516 02464 02429 02259 02373 04139
Edge Count-Lateral[Agent x Agent Agent x Agent] 09474 03731 03926 0392 04216 04061 03944 03828 03768 03651 03633 03872 03795 03985 03842 03891 05761
Resemblance Correlation[Agent x Agent Agent x Agent] Average 09994 09971 09941 09917 09894 09871 09844 09825 09838 09843 09819 09807 09807 09801 0988 09927 09986
Fragmentation[Agent x Agent Agent x Agent] 09997 0979 09411 09068 0881 08047 07079 06554 0699 06777 05778 05813 05951 05598 07798 09101 09941
Communicative Need[Agent x Agent Agent x Agent] 1 0608 07457 07662 0801 08069 08098 08233 0817 08124 0784 08169 08368 08365 08212 09004 08039
Upper Boundedness[Agent x Agent Agent x Agent] 1 09021 09909 0999 09987 0994 09966 09993 09989 09965 0995 09928 09987 09988 09986 09983 09073
Average Distance[Agent x Agent Agent x Agent] 11739 33399 326 35685 35731 42917 46372 41893 40814 40263 40845 39749 40577 41172 44266 42139 24163
Network Levels[Agent x Agent Agent x Agent] 2 8 8 10 9 11 11 10 10 10 11 9 10 10 13 9 6
Span Of Control[Agent x Agent Agent x Agent] 2375 27917 42188 48644 50903 52917 51932 54369 53981 5128 52176 55417 54426 55466 46089 40656 20909
Number of AGENT 32 94 112 137 162 183 221 228 213 219 254 250 256 260 190 141 62
Edge Count[Agent x Agent Agent x Agent] 38 201 405 574 733 889 1075 1207 1112 1082 1247 1330 1328 1370 825 496 92
Component Members-Weak[Agent x Agent Agent x Agent] Average 1728798 1248005 101776 862131 710437 559781 367623 299836 366667 345 232377 232213 225628 211885 50694 855628 1487596
Isolate Count[Agent x Agent Agent x Agent] 350 293 269 247 222 198 157 142 159 154 126 125 122 119 187 243 322
Component Count-Weak[Agent x Agent Agent x Agent] 355 301 272 250 226 201 162 146 162 157 128 128 126 122 191 249 329
Component Count-Strong[Agent x Agent Agent x Agent] 355 315 281 255 232 214 177 156 172 169 145 144 137 131 198 256 336
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Agents 221 228 213 219 254 250 256 260 190 141 62
Z -03746343246 00624390541 -08741467575 -04995124328 16858544608 14360982444 18107325691 20604887855 -23102450019 -53697586531 -103024439274
C+ 0 0 0 0 11858544608 21219527053 34326852743 49931740598 21829290579 0 0
C- 0 0 03741467575 03736591903 0 0 0 0 18102450019 6680003655 164824475824
Signal 0 0 0 0 0 0 0 1 0 1 1
Average 227
Std Dev 16015617378
k 05
Control Limit 4
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Component Members-Weak Average 367623 299836 366667 345 232377 232213 225628 211885 50694 855628 1487596
Z 09566024078 -01181147213 09414456731 05979300132 -11876316317 -1190231741 -12946324712 -15125184597 31653794044 86935922814 187130183455
C+ 04566024078 0 04414456731 05393756863 0 0 0 0 26653794044 108589716857 290719900313
C- 0 0 0 0 06876316317 13778633728 21724958439 31850143037 0 0 0
Signal 0 0 0 0 0 0 0 0 0 1 1
Average 307286
Std Dev 63074271515
k 05
Control Limit 4
1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Edge Count 38 201 405 574 733 889 1075 1207 1112 1082 1247 1330 1328 1370 825 496 92
Slope of Edge Count 163 204 169 159 156 186 132 -95 -30 165 83 -2 42 -545 -329 -404
Z -05039753822 15973457028 -01964649795 -07089823173 -08627375186 06748144948 -20927791293 -137269226972 -6393653789 59836399187 20500693512 -16784942813 -06150208054 -316223197419 -114547624997 -100197139539
C+ 0 10973457028 04008807233 0 0 01748144948 0 0 0 54836399187 70337092699 48552149886 37401941833 0 0 0
C- 00039753822 0 0 02089823173 05717198359 0 15927791293 148197018266 207133556155 142297156968 116796463456 128581406269 129731614323 440954811741 550502436738 645699576277
Signal 0 0 0 0 0 0 0 0 1 1 1 1 1 1 1 1 1
Average 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 9475 4825 43 3075 54 72 -1055 -2085 -309
Std Dev 195115350498 195115350498 195115350498 195115350498 195115350498 195115350498 195115350498 195115350498 1284143683549 1324547092406 1254033492376 115840047767 882307580533 710539700979 29504858357 278774102097 2505474007049
k 05
Control Limit 4
1988 1988 1988 1988 1988 1988
1989 1989 1989 1989 1989 1989
1990 1990 1990 1990 1990 1990
1991 1991 1991 1991 1991 1991
1992 1992 1992 1992 1992 1992
1993 1993 1993 1993 1993 1993
1994 1994 1994 1994 1994 1994
1995 1995 1995 1995 1995 1995
1996 1996 1996 1996 1996 1996
1997 1997 1997 1997 1997 1997
1998 1998 1998 1998 1998 1998
1999 1999 1999 1999 1999 1999
2000 2000 2000 2000 2000 2000
2001 2001 2001 2001 2001 2001
2002 2002 2002 2002 2002 2002
2003 2003 2003 2003 2003 2003
2004 2004 2004 2004 2004 2004
httpwwwmarocnlnieuwsforumsshowphppg=timeline
Significant Events Founded in Afghanistan Moves to Sudan WTC 1 and Somalia Leaves Sudan Returns to Afghanistan Saudia Arabia bomb Issues fatwa US embassy bombings USS Cole WTC 2 Afghanistan 3 Man is Captured Dirty bomb foiled 911 plotter captured Sheikh Mohammed captured Saudi bombings Morroco attacks Sectarian violence plot Madrid Europe truce offered
1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Betweeness 0 00001 00003 00006 00008 00016 00027 00029 00024 00025 00033 00031 00033 00036 0002 00008 0
Density 00003 00015 0003 00043 00055 00067 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Closeness 00001 00005 00012 00016 00018 00021 00027 0003 00028 00028 00031 0003 00032 00034 00024 00015 00004
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Closeness 00027 0003 00028 00028 00031 0003 00032 00034 00024 00015 00004
Z -08728715609 10910894512 -02182178902 -02182178902 17457431219 10910894512 24003967926 3709704134 -28368325731 -87287156094 -159299059872
C+ 0 05910894512 0 0 12457431219 18368325731 37372293657 69469334997 36101009266 0 0
C- 03728715609 0 0 0 0 0 0 0 23368325731 105655481825 259954541697
03728715609 05910894512 0 0 12457431219 18368325731 37372293657 69469334997 36101009266 105655481825 259954541697
Signal 0 0 0 0 0 0 0 1 0 1 1
Average 00028333333
Std Dev 00001527525
k 05
Control Limit 4
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Density 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Z -08444407432 11042686642 -0259827921 -06495698025 16888814864 30529780716 28581071308 36375908938 -43521176765 -9223891195 -150700194171
C+ 0 06042686642 0 0 11888814864 3741859558 60999666888 92375575826 43854399061 0 0
C- 03444407432 0 0 01495698025 0 0 0 0 38521176765 125760088714 271460282886
Signal 0 0 0 0 0 0 1 1 1 1 1
Average 00084333333
Std Dev 00005131601
k 05
Control Limit 4
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Betweeness 00027 00029 00024 00025 00033 00031 00033 00036 0002 00008 0
Z 01324532357 09271726499 -10596258857 -06622661785 25166114784 17218920642 25166114784 37086905998 -26490647141 -74173811996 -105962588565
C+ 0 04271726499 0 0 20166114784 32385035426 5255115021 84638056208 53147409067 0 0
C- 0 0 05596258857 07218920642 0 0 0 0 21490647141 90664459137 191627047702
Signal 0 0 0 0 0 0 1 1 1 1 1
Average 00026666667
Std Dev 00002516611
k 05
Control Limit 4
1994
1995
1996
1997
1998
1999
2000
2001
2002
2003
2004
1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Centrality-Betweenness[Agent x Agent Agent x Agent] Average 0 00001 00003 00006 00008 00016 00027 00029 00024 00025 00033 00031 00033 00036 0002 00008 0
Meta-Matrix Hamming Distance 0 00018 00036 0006 00065 00087 00124 0012 00146 00136 00135 00166 00164 00156 0015 0009 0004
Network Centralization-Betweenness[Agent x Agent Agent x Agent] 0 00091 00292 00492 00608 01105 01557 01586 01242 01187 01295 01338 01316 01414 0104 0042 00014
Edge Count-Sequential[Agent x Agent Agent x Agent] 0 001 00025 00017 0 0 00019 00008 00009 00009 00016 00015 0 00007 0 0002 0
Hierarchy[Agent x Agent Agent x Agent] 0 04827 01679 00696 00796 01496 01339 00727 00775 01273 01492 01711 0075 00717 00786 00882 05587
Exclusivity[Agent x Agent Agent x Agent] Average 00001 00003 00003 00003 00004 00004 00005 00005 00005 00006 00006 00006 00006 00006 00006 00004 00002
Network Centralization-Closeness[Agent x Agent Agent x Agent] 00001 00005 00012 00016 00018 00021 00027 0003 00028 00028 00031 0003 00032 00034 00024 00015 00004
Similarity Correlation[Agent x Agent Agent x Agent] Average 00001 00023 00036 00043 0005 0006 0007 00077 0007 00069 0008 00084 0008 00082 00053 0003 00008
Simmelian Ties[Agent x Agent Agent x Agent] Average 00002 00008 00021 00029 0004 0005 00059 00068 00061 00058 00065 00076 00076 0008 00045 00029 00004
Overall Complexity 00003 00015 0003 00043 00055 00067 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Expertise Correlation[Agent x Agent Agent x Agent] Average 00003 00015 0003 00042 00053 00065 00079 00088 00081 00079 00091 00097 00097 00101 0006 00036 00007
Density[Agent x Agent Agent x Agent] 00003 00015 0003 00043 00055 00067 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Centrality-Inverse Closeness[Agent x Agent Agent x Agent] Average 00003 00057 00191 00306 00387 0053 00733 00943 0084 00877 01111 01117 01125 01206 00587 00257 00018
Efficiency-Global[Agent x Agent Agent x Agent] 00003 00087 00223 00338 00421 00591 00815 0101 00904 00972 01245 01244 01203 01286 00632 00277 00026
Connectedness[Agent x Agent Agent x Agent] 00003 0021 00589 00932 0119 01953 02921 03446 0301 03223 04222 04187 04049 04402 02202 00899 00059
Distinctiveness Correlation[Agent x Agent Agent x Agent] Average 00006 00029 00059 00083 00106 00129 00156 00175 00162 00157 00181 00193 00193 00199 0012 00073 00014
Centrality-Closeness[Agent x Agent Agent x Agent] Average 00027 00028 00029 00031 00032 00035 00042 00048 00044 00044 00053 00052 00054 00058 00038 00031 00027
Boundary Spanner[Agent x Agent Agent x Agent] Average 00027 00191 00328 00328 00546 00574 00683 00601 00656 0071 00683 0071 00847 00656 00738 00546 00246
Network Centralization-Row Degree[Agent x Agent Agent x Agent] 00134 00314 006 00752 00794 0081 00851 00841 00848 00878 00893 00969 00996 00993 00842 00648 0024
Network Centralization-Column Degree[Agent x Agent Agent x Agent] 00134 00588 00819 00834 00849 00892 00878 00896 00876 00905 00948 00996 01024 00993 00842 0062 00322
Network Centralization-Out Degree[Agent x Agent Agent x Agent] 00135 00315 00601 00754 00797 00812 00853 00843 00851 0088 00895 00972 00999 00996 00845 0065 0024
Network Centralization-Total Degree[Agent x Agent Agent x Agent] 00135 00453 00672 00769 00813 00828 00869 0086 00867 00897 00925 00988 01016 00999 00847 00638 00282
Network Centralization-In Degree[Agent x Agent Agent x Agent] 00135 00589 00821 00836 00852 00895 00881 00898 00878 00908 0095 00999 01027 00996 00845 00622 00323
Clustering Coefficient-Watts-Strogatz[Agent x Agent Agent x Agent] Average 0023 00786 0126 01471 01728 01993 02334 02782 02382 02256 02604 02856 02775 02958 02157 01468 00355
Efficiency-Local[Agent x Agent Agent x Agent] 00238 01032 01587 01916 02253 02598 03034 0362 03167 03054 03644 03843 03678 03861 02663 01705 0041
Interlockers[Agent x Agent Agent x Agent] Average 00246 00546 00656 00628 00738 00902 00902 00902 00847 0071 00738 00874 00874 0082 00792 00683 00546
Breadth-Column[Agent x Agent Agent x Agent] 00273 01011 01803 02596 03224 03716 04399 05082 04672 04672 05355 05328 05519 05738 03825 02404 00601
Breadth-Row[Agent x Agent Agent x Agent] 00273 01339 02158 02923 03388 03934 04754 05301 04863 04973 05792 05765 05765 05984 04153 02514 00601
Clique Count[Agent x Agent Agent x Agent] Average 00301 02896 06421 10929 14699 17213 19781 22486 20601 20492 24071 2571 24918 24727 13689 06913 01148
Constraint-Burt[Agent x Agent Agent x Agent] Average 0038 01331 01404 01481 01869 02077 02643 02719 02506 02623 02827 0295 03034 02956 02548 01967 00948
Interdependence[Agent x Agent Agent x Agent] 00485 00073 00042 0003 00024 0002 00017 00015 00016 00016 00014 00013 00014 00013 00022 00037 00176
Effective Network Size-Burt[Agent x Agent Agent x Agent] Average 0053 03636 06977 10333 13118 15388 18575 21156 19688 19928 22983 2382 23991 2429 14392 08408 01957
Redundancy-Row[Agent x Agent Agent x Agent] 00601 03525 08443 12459 16093 19699 23716 26913 24754 23798 27541 29781 29617 30683 1765 10219 01311
Redundancy-Column[Agent x Agent Agent x Agent] 00601 03743 08634 12568 1623 2 23989 27077 24945 24016 27814 30055 29891 30902 17842 10383 01448
Node Levels[Agent x Agent Agent x Agent] Average 00601 09836 14344 23087 24126 36339 46913 42541 3918 43197 51913 44727 46421 48169 47568 23033 03169
Centrality-Bonacich Power[Agent x Agent Agent x Agent] Average 01038 05492 11066 15683 20027 2429 29372 32978 30383 29563 34071 36339 36284 37432 22541 13552 02514
Triad Count[Agent x Agent Agent x Agent] Average 01967 08279 27104 42432 5918 78251 96503 101475 94454 84508 94645 111421 110164 112732 67131 35574 01995
Efficiency[Agent x Agent Agent x Agent] 03333 09552 0964 09658 09658 09746 09798 09806 09795 09818 09835 09822 0982 09828 09809 09756 09607
Speed-Minimum[Agent x Agent Agent x Agent] 05 0125 0125 01 01111 00909 00909 01 01 01 00909 01111 01 01 00769 01111 01667
Edge Count-Skip[Agent x Agent Agent x Agent] 07895 07761 08988 09408 09304 0946 0933 09519 09487 09409 09455 09534 09465 09584 09188 0881 06196
Transitivity[Agent x Agent Agent x Agent] 08372 03475 03372 03238 03336 03626 0375 03495 03503 03253 0319 03266 03302 0336 03822 0405 0258
Edge Count-Pooled[Agent x Agent Agent x Agent] 08421 08657 09432 09669 09714 09764 09665 09751 09748 09704 09735 09737 09759 09788 09612 09435 08152
Speed-Average[Agent x Agent Agent x Agent] 08519 02994 03068 02802 02799 0233 02156 02387 0245 02484 02448 02516 02464 02429 02259 02373 04139
Edge Count-Lateral[Agent x Agent Agent x Agent] 09474 03731 03926 0392 04216 04061 03944 03828 03768 03651 03633 03872 03795 03985 03842 03891 05761
Resemblance Correlation[Agent x Agent Agent x Agent] Average 09994 09971 09941 09917 09894 09871 09844 09825 09838 09843 09819 09807 09807 09801 0988 09927 09986
Fragmentation[Agent x Agent Agent x Agent] 09997 0979 09411 09068 0881 08047 07079 06554 0699 06777 05778 05813 05951 05598 07798 09101 09941
Communicative Need[Agent x Agent Agent x Agent] 1 0608 07457 07662 0801 08069 08098 08233 0817 08124 0784 08169 08368 08365 08212 09004 08039
Upper Boundedness[Agent x Agent Agent x Agent] 1 09021 09909 0999 09987 0994 09966 09993 09989 09965 0995 09928 09987 09988 09986 09983 09073
Average Distance[Agent x Agent Agent x Agent] 11739 33399 326 35685 35731 42917 46372 41893 40814 40263 40845 39749 40577 41172 44266 42139 24163
Network Levels[Agent x Agent Agent x Agent] 2 8 8 10 9 11 11 10 10 10 11 9 10 10 13 9 6
Span Of Control[Agent x Agent Agent x Agent] 2375 27917 42188 48644 50903 52917 51932 54369 53981 5128 52176 55417 54426 55466 46089 40656 20909
Number of AGENT 32 94 112 137 162 183 221 228 213 219 254 250 256 260 190 141 62
Edge Count[Agent x Agent Agent x Agent] 38 201 405 574 733 889 1075 1207 1112 1082 1247 1330 1328 1370 825 496 92
Component Members-Weak[Agent x Agent Agent x Agent] Average 1728798 1248005 101776 862131 710437 559781 367623 299836 366667 345 232377 232213 225628 211885 50694 855628 1487596
Isolate Count[Agent x Agent Agent x Agent] 350 293 269 247 222 198 157 142 159 154 126 125 122 119 187 243 322
Component Count-Weak[Agent x Agent Agent x Agent] 355 301 272 250 226 201 162 146 162 157 128 128 126 122 191 249 329
Component Count-Strong[Agent x Agent Agent x Agent] 355 315 281 255 232 214 177 156 172 169 145 144 137 131 198 256 336
Page 25: Data Science Center Eindhoven - 4TU.Federation · 4TU AMI SRO Big Data Meeting Big Data: Mathematics in Action! November 24, 2017 Data Science Center Eindhoven The Mathematics Behind

Chart4

C+
Closeness CUSUM Statistic for Al-Qaeda (1994-2004)
0
05910894512
0
0
12457431219
18368325731
37372293657
69469334997
36101009266
0
0

Std Measures Control Charts

Std Measures Control Charts

Betweeness
Density
Closeness
Time
Value
Standard Measures

Chart1

C+
Closeness CUSUM Chart for Al-Qaeda (1994-2004)

Control Charts

Number of AGENT
Edge Count[Agent x Agent Agent x Agent]
Component Members-Weak[Agent x Agent Agent x Agent] Average
Isolate Count[Agent x Agent Agent x Agent]
Component Count-Weak[Agent x Agent Agent x Agent]
Component Count-Strong[Agent x Agent Agent x Agent]
32
38
1728798
350
355
355
94
201
1248005
293
301
315
112
405
101776
269
272
281
137
574
862131
247
250
255
162
733
710437
222
226
232
183
889
559781
198
201
214
221
1075
367623
157
162
177
228
1207
299836
142
146
156
213
1112
366667
159
162
172
219
1082
345
154
157
169
254
1247
232377
126
128
145
250
1330
232213
125
128
144
256
1328
225628
122
126
137
260
1370
211885
119
122
131
190
825
50694
187
191
198
141
496
855628
243
249
256
62
92
1487596
322
329
336

raw data

Topic Precision farming

Topic Precision farming

Topic Compressed Sensing

Topic Network structure

bull dependencies between nodes in networks measured by degrees of direct neighbours

bull assortativity coefficient of Newman is nothing but Pearsonrsquos correlation statistic

bull inconsistent estimator when variances are infinitebull Spearman rank correlation behaves better but

calculation is computationally intensive bull requires heavy asymptotics

Van der Hofstad R and Litvak N (2014) Degree-Degree Dependencies in Random Graphs with Heavy-Tailed Degrees Internet Mathematics 10 (3-4) pp 287-334

Topic Topological Data Analysis

Common Big Data problem is to choose relevant ldquofeaturesrdquo from high-dimensional data

Combination of machine learning with topological tools (simplices cohomology) yields new algorithmsfor finding patterns and clustering

Presenter
Presentation Notes
httpsyoutubeXfWibrh6stw Gunnar Carsson

Mathematical contributions in general

bull Modellingbull Performance of algorithmsbull Statistical thinking

We need to work hard to makemathematical contributions more explicit

Conclusions

1 Mathematical methods are an important enabler in big data settings

2 Developments in big data settings not only require more computer speed and memory capacity but also new advanced mathematics

  • The Mathematics Behind Big Data
  • Outline
  • What is big data
  • Four Vrsquos of Big Data
  • High-dimensional data ldquo119951≪119953rdquo
  • Topic Google Search
  • Topic Google Search
  • Topic Google Search
  • Topic Google Search
  • Topic Google Search
  • Topic Google Search
  • Equivalent Random Surfer Model
  • Google Matrix
  • Choice of Matrix B
  • Practical Issues
  • Topic Streaming algorithms
  • Topic Uncertainty Quantification
  • Topic Network monitoring
  • Topic Precision farming
  • Topic Precision farming
  • Topic Compressed Sensing
  • Topic Network structure
  • Topic Topological Data Analysis
  • Mathematical contributions in general
  • Conclusions
1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Centrality-Betweenness[Agent x Agent Agent x Agent] Average 0 00001 00003 00006 00008 00016 00027 00029 00024 00025 00033 00031 00033 00036 0002 00008 0
Meta-Matrix Hamming Distance 0 00018 00036 0006 00065 00087 00124 0012 00146 00136 00135 00166 00164 00156 0015 0009 0004
Network Centralization-Betweenness[Agent x Agent Agent x Agent] 0 00091 00292 00492 00608 01105 01557 01586 01242 01187 01295 01338 01316 01414 0104 0042 00014
Edge Count-Sequential[Agent x Agent Agent x Agent] 0 001 00025 00017 0 0 00019 00008 00009 00009 00016 00015 0 00007 0 0002 0
Hierarchy[Agent x Agent Agent x Agent] 0 04827 01679 00696 00796 01496 01339 00727 00775 01273 01492 01711 0075 00717 00786 00882 05587
Exclusivity[Agent x Agent Agent x Agent] Average 00001 00003 00003 00003 00004 00004 00005 00005 00005 00006 00006 00006 00006 00006 00006 00004 00002
Network Centralization-Closeness[Agent x Agent Agent x Agent] 00001 00005 00012 00016 00018 00021 00027 0003 00028 00028 00031 0003 00032 00034 00024 00015 00004
Similarity Correlation[Agent x Agent Agent x Agent] Average 00001 00023 00036 00043 0005 0006 0007 00077 0007 00069 0008 00084 0008 00082 00053 0003 00008
Simmelian Ties[Agent x Agent Agent x Agent] Average 00002 00008 00021 00029 0004 0005 00059 00068 00061 00058 00065 00076 00076 0008 00045 00029 00004
Overall Complexity 00003 00015 0003 00043 00055 00067 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Expertise Correlation[Agent x Agent Agent x Agent] Average 00003 00015 0003 00042 00053 00065 00079 00088 00081 00079 00091 00097 00097 00101 0006 00036 00007
Density[Agent x Agent Agent x Agent] 00003 00015 0003 00043 00055 00067 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Centrality-Inverse Closeness[Agent x Agent Agent x Agent] Average 00003 00057 00191 00306 00387 0053 00733 00943 0084 00877 01111 01117 01125 01206 00587 00257 00018
Efficiency-Global[Agent x Agent Agent x Agent] 00003 00087 00223 00338 00421 00591 00815 0101 00904 00972 01245 01244 01203 01286 00632 00277 00026
Connectedness[Agent x Agent Agent x Agent] 00003 0021 00589 00932 0119 01953 02921 03446 0301 03223 04222 04187 04049 04402 02202 00899 00059
Distinctiveness Correlation[Agent x Agent Agent x Agent] Average 00006 00029 00059 00083 00106 00129 00156 00175 00162 00157 00181 00193 00193 00199 0012 00073 00014
Centrality-Closeness[Agent x Agent Agent x Agent] Average 00027 00028 00029 00031 00032 00035 00042 00048 00044 00044 00053 00052 00054 00058 00038 00031 00027
Boundary Spanner[Agent x Agent Agent x Agent] Average 00027 00191 00328 00328 00546 00574 00683 00601 00656 0071 00683 0071 00847 00656 00738 00546 00246
Network Centralization-Row Degree[Agent x Agent Agent x Agent] 00134 00314 006 00752 00794 0081 00851 00841 00848 00878 00893 00969 00996 00993 00842 00648 0024
Network Centralization-Column Degree[Agent x Agent Agent x Agent] 00134 00588 00819 00834 00849 00892 00878 00896 00876 00905 00948 00996 01024 00993 00842 0062 00322
Network Centralization-Out Degree[Agent x Agent Agent x Agent] 00135 00315 00601 00754 00797 00812 00853 00843 00851 0088 00895 00972 00999 00996 00845 0065 0024
Network Centralization-Total Degree[Agent x Agent Agent x Agent] 00135 00453 00672 00769 00813 00828 00869 0086 00867 00897 00925 00988 01016 00999 00847 00638 00282
Network Centralization-In Degree[Agent x Agent Agent x Agent] 00135 00589 00821 00836 00852 00895 00881 00898 00878 00908 0095 00999 01027 00996 00845 00622 00323
Clustering Coefficient-Watts-Strogatz[Agent x Agent Agent x Agent] Average 0023 00786 0126 01471 01728 01993 02334 02782 02382 02256 02604 02856 02775 02958 02157 01468 00355
Efficiency-Local[Agent x Agent Agent x Agent] 00238 01032 01587 01916 02253 02598 03034 0362 03167 03054 03644 03843 03678 03861 02663 01705 0041
Interlockers[Agent x Agent Agent x Agent] Average 00246 00546 00656 00628 00738 00902 00902 00902 00847 0071 00738 00874 00874 0082 00792 00683 00546
Breadth-Column[Agent x Agent Agent x Agent] 00273 01011 01803 02596 03224 03716 04399 05082 04672 04672 05355 05328 05519 05738 03825 02404 00601
Breadth-Row[Agent x Agent Agent x Agent] 00273 01339 02158 02923 03388 03934 04754 05301 04863 04973 05792 05765 05765 05984 04153 02514 00601
Clique Count[Agent x Agent Agent x Agent] Average 00301 02896 06421 10929 14699 17213 19781 22486 20601 20492 24071 2571 24918 24727 13689 06913 01148
Constraint-Burt[Agent x Agent Agent x Agent] Average 0038 01331 01404 01481 01869 02077 02643 02719 02506 02623 02827 0295 03034 02956 02548 01967 00948
Interdependence[Agent x Agent Agent x Agent] 00485 00073 00042 0003 00024 0002 00017 00015 00016 00016 00014 00013 00014 00013 00022 00037 00176
Effective Network Size-Burt[Agent x Agent Agent x Agent] Average 0053 03636 06977 10333 13118 15388 18575 21156 19688 19928 22983 2382 23991 2429 14392 08408 01957
Redundancy-Row[Agent x Agent Agent x Agent] 00601 03525 08443 12459 16093 19699 23716 26913 24754 23798 27541 29781 29617 30683 1765 10219 01311
Redundancy-Column[Agent x Agent Agent x Agent] 00601 03743 08634 12568 1623 2 23989 27077 24945 24016 27814 30055 29891 30902 17842 10383 01448
Node Levels[Agent x Agent Agent x Agent] Average 00601 09836 14344 23087 24126 36339 46913 42541 3918 43197 51913 44727 46421 48169 47568 23033 03169
Centrality-Bonacich Power[Agent x Agent Agent x Agent] Average 01038 05492 11066 15683 20027 2429 29372 32978 30383 29563 34071 36339 36284 37432 22541 13552 02514
Triad Count[Agent x Agent Agent x Agent] Average 01967 08279 27104 42432 5918 78251 96503 101475 94454 84508 94645 111421 110164 112732 67131 35574 01995
Efficiency[Agent x Agent Agent x Agent] 03333 09552 0964 09658 09658 09746 09798 09806 09795 09818 09835 09822 0982 09828 09809 09756 09607
Speed-Minimum[Agent x Agent Agent x Agent] 05 0125 0125 01 01111 00909 00909 01 01 01 00909 01111 01 01 00769 01111 01667
Edge Count-Skip[Agent x Agent Agent x Agent] 07895 07761 08988 09408 09304 0946 0933 09519 09487 09409 09455 09534 09465 09584 09188 0881 06196
Transitivity[Agent x Agent Agent x Agent] 08372 03475 03372 03238 03336 03626 0375 03495 03503 03253 0319 03266 03302 0336 03822 0405 0258
Edge Count-Pooled[Agent x Agent Agent x Agent] 08421 08657 09432 09669 09714 09764 09665 09751 09748 09704 09735 09737 09759 09788 09612 09435 08152
Speed-Average[Agent x Agent Agent x Agent] 08519 02994 03068 02802 02799 0233 02156 02387 0245 02484 02448 02516 02464 02429 02259 02373 04139
Edge Count-Lateral[Agent x Agent Agent x Agent] 09474 03731 03926 0392 04216 04061 03944 03828 03768 03651 03633 03872 03795 03985 03842 03891 05761
Resemblance Correlation[Agent x Agent Agent x Agent] Average 09994 09971 09941 09917 09894 09871 09844 09825 09838 09843 09819 09807 09807 09801 0988 09927 09986
Fragmentation[Agent x Agent Agent x Agent] 09997 0979 09411 09068 0881 08047 07079 06554 0699 06777 05778 05813 05951 05598 07798 09101 09941
Communicative Need[Agent x Agent Agent x Agent] 1 0608 07457 07662 0801 08069 08098 08233 0817 08124 0784 08169 08368 08365 08212 09004 08039
Upper Boundedness[Agent x Agent Agent x Agent] 1 09021 09909 0999 09987 0994 09966 09993 09989 09965 0995 09928 09987 09988 09986 09983 09073
Average Distance[Agent x Agent Agent x Agent] 11739 33399 326 35685 35731 42917 46372 41893 40814 40263 40845 39749 40577 41172 44266 42139 24163
Network Levels[Agent x Agent Agent x Agent] 2 8 8 10 9 11 11 10 10 10 11 9 10 10 13 9 6
Span Of Control[Agent x Agent Agent x Agent] 2375 27917 42188 48644 50903 52917 51932 54369 53981 5128 52176 55417 54426 55466 46089 40656 20909
Number of AGENT 32 94 112 137 162 183 221 228 213 219 254 250 256 260 190 141 62
Edge Count[Agent x Agent Agent x Agent] 38 201 405 574 733 889 1075 1207 1112 1082 1247 1330 1328 1370 825 496 92
Component Members-Weak[Agent x Agent Agent x Agent] Average 1728798 1248005 101776 862131 710437 559781 367623 299836 366667 345 232377 232213 225628 211885 50694 855628 1487596
Isolate Count[Agent x Agent Agent x Agent] 350 293 269 247 222 198 157 142 159 154 126 125 122 119 187 243 322
Component Count-Weak[Agent x Agent Agent x Agent] 355 301 272 250 226 201 162 146 162 157 128 128 126 122 191 249 329
Component Count-Strong[Agent x Agent Agent x Agent] 355 315 281 255 232 214 177 156 172 169 145 144 137 131 198 256 336
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Agents 221 228 213 219 254 250 256 260 190 141 62
Z -03746343246 00624390541 -08741467575 -04995124328 16858544608 14360982444 18107325691 20604887855 -23102450019 -53697586531 -103024439274
C+ 0 0 0 0 11858544608 21219527053 34326852743 49931740598 21829290579 0 0
C- 0 0 03741467575 03736591903 0 0 0 0 18102450019 6680003655 164824475824
Signal 0 0 0 0 0 0 0 1 0 1 1
Average 227
Std Dev 16015617378
k 05
Control Limit 4
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Component Members-Weak Average 367623 299836 366667 345 232377 232213 225628 211885 50694 855628 1487596
Z 09566024078 -01181147213 09414456731 05979300132 -11876316317 -1190231741 -12946324712 -15125184597 31653794044 86935922814 187130183455
C+ 04566024078 0 04414456731 05393756863 0 0 0 0 26653794044 108589716857 290719900313
C- 0 0 0 0 06876316317 13778633728 21724958439 31850143037 0 0 0
Signal 0 0 0 0 0 0 0 0 0 1 1
Average 307286
Std Dev 63074271515
k 05
Control Limit 4
1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Edge Count 38 201 405 574 733 889 1075 1207 1112 1082 1247 1330 1328 1370 825 496 92
Slope of Edge Count 163 204 169 159 156 186 132 -95 -30 165 83 -2 42 -545 -329 -404
Z -05039753822 15973457028 -01964649795 -07089823173 -08627375186 06748144948 -20927791293 -137269226972 -6393653789 59836399187 20500693512 -16784942813 -06150208054 -316223197419 -114547624997 -100197139539
C+ 0 10973457028 04008807233 0 0 01748144948 0 0 0 54836399187 70337092699 48552149886 37401941833 0 0 0
C- 00039753822 0 0 02089823173 05717198359 0 15927791293 148197018266 207133556155 142297156968 116796463456 128581406269 129731614323 440954811741 550502436738 645699576277
Signal 0 0 0 0 0 0 0 0 1 1 1 1 1 1 1 1 1
Average 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 9475 4825 43 3075 54 72 -1055 -2085 -309
Std Dev 195115350498 195115350498 195115350498 195115350498 195115350498 195115350498 195115350498 195115350498 1284143683549 1324547092406 1254033492376 115840047767 882307580533 710539700979 29504858357 278774102097 2505474007049
k 05
Control Limit 4
1988 1988 1988 1988 1988 1988
1989 1989 1989 1989 1989 1989
1990 1990 1990 1990 1990 1990
1991 1991 1991 1991 1991 1991
1992 1992 1992 1992 1992 1992
1993 1993 1993 1993 1993 1993
1994 1994 1994 1994 1994 1994
1995 1995 1995 1995 1995 1995
1996 1996 1996 1996 1996 1996
1997 1997 1997 1997 1997 1997
1998 1998 1998 1998 1998 1998
1999 1999 1999 1999 1999 1999
2000 2000 2000 2000 2000 2000
2001 2001 2001 2001 2001 2001
2002 2002 2002 2002 2002 2002
2003 2003 2003 2003 2003 2003
2004 2004 2004 2004 2004 2004
httpwwwmarocnlnieuwsforumsshowphppg=timeline
Significant Events Founded in Afghanistan Moves to Sudan WTC 1 and Somalia Leaves Sudan Returns to Afghanistan Saudia Arabia bomb Issues fatwa US embassy bombings USS Cole WTC 2 Afghanistan 3 Man is Captured Dirty bomb foiled 911 plotter captured Sheikh Mohammed captured Saudi bombings Morroco attacks Sectarian violence plot Madrid Europe truce offered
1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Betweeness 0 00001 00003 00006 00008 00016 00027 00029 00024 00025 00033 00031 00033 00036 0002 00008 0
Density 00003 00015 0003 00043 00055 00067 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Closeness 00001 00005 00012 00016 00018 00021 00027 0003 00028 00028 00031 0003 00032 00034 00024 00015 00004
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Closeness 00027 0003 00028 00028 00031 0003 00032 00034 00024 00015 00004
Z -08728715609 10910894512 -02182178902 -02182178902 17457431219 10910894512 24003967926 3709704134 -28368325731 -87287156094 -159299059872
C+ 0 05910894512 0 0 12457431219 18368325731 37372293657 69469334997 36101009266 0 0
C- 03728715609 0 0 0 0 0 0 0 23368325731 105655481825 259954541697
03728715609 05910894512 0 0 12457431219 18368325731 37372293657 69469334997 36101009266 105655481825 259954541697
Signal 0 0 0 0 0 0 0 1 0 1 1
Average 00028333333
Std Dev 00001527525
k 05
Control Limit 4
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Density 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Z -08444407432 11042686642 -0259827921 -06495698025 16888814864 30529780716 28581071308 36375908938 -43521176765 -9223891195 -150700194171
C+ 0 06042686642 0 0 11888814864 3741859558 60999666888 92375575826 43854399061 0 0
C- 03444407432 0 0 01495698025 0 0 0 0 38521176765 125760088714 271460282886
Signal 0 0 0 0 0 0 1 1 1 1 1
Average 00084333333
Std Dev 00005131601
k 05
Control Limit 4
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Betweeness 00027 00029 00024 00025 00033 00031 00033 00036 0002 00008 0
Z 01324532357 09271726499 -10596258857 -06622661785 25166114784 17218920642 25166114784 37086905998 -26490647141 -74173811996 -105962588565
C+ 0 04271726499 0 0 20166114784 32385035426 5255115021 84638056208 53147409067 0 0
C- 0 0 05596258857 07218920642 0 0 0 0 21490647141 90664459137 191627047702
Signal 0 0 0 0 0 0 1 1 1 1 1
Average 00026666667
Std Dev 00002516611
k 05
Control Limit 4
1994
1995
1996
1997
1998
1999
2000
2001
2002
2003
2004
Page 26: Data Science Center Eindhoven - 4TU.Federation · 4TU AMI SRO Big Data Meeting Big Data: Mathematics in Action! November 24, 2017 Data Science Center Eindhoven The Mathematics Behind

Std Measures Control Charts

Std Measures Control Charts

Betweeness
Density
Closeness
Time
Value
Standard Measures

Chart1

C+
Closeness CUSUM Chart for Al-Qaeda (1994-2004)

Control Charts

Number of AGENT
Edge Count[Agent x Agent Agent x Agent]
Component Members-Weak[Agent x Agent Agent x Agent] Average
Isolate Count[Agent x Agent Agent x Agent]
Component Count-Weak[Agent x Agent Agent x Agent]
Component Count-Strong[Agent x Agent Agent x Agent]
32
38
1728798
350
355
355
94
201
1248005
293
301
315
112
405
101776
269
272
281
137
574
862131
247
250
255
162
733
710437
222
226
232
183
889
559781
198
201
214
221
1075
367623
157
162
177
228
1207
299836
142
146
156
213
1112
366667
159
162
172
219
1082
345
154
157
169
254
1247
232377
126
128
145
250
1330
232213
125
128
144
256
1328
225628
122
126
137
260
1370
211885
119
122
131
190
825
50694
187
191
198
141
496
855628
243
249
256
62
92
1487596
322
329
336

raw data

Topic Precision farming

Topic Precision farming

Topic Compressed Sensing

Topic Network structure

bull dependencies between nodes in networks measured by degrees of direct neighbours

bull assortativity coefficient of Newman is nothing but Pearsonrsquos correlation statistic

bull inconsistent estimator when variances are infinitebull Spearman rank correlation behaves better but

calculation is computationally intensive bull requires heavy asymptotics

Van der Hofstad R and Litvak N (2014) Degree-Degree Dependencies in Random Graphs with Heavy-Tailed Degrees Internet Mathematics 10 (3-4) pp 287-334

Topic Topological Data Analysis

Common Big Data problem is to choose relevant ldquofeaturesrdquo from high-dimensional data

Combination of machine learning with topological tools (simplices cohomology) yields new algorithmsfor finding patterns and clustering

Presenter
Presentation Notes
httpsyoutubeXfWibrh6stw Gunnar Carsson

Mathematical contributions in general

bull Modellingbull Performance of algorithmsbull Statistical thinking

We need to work hard to makemathematical contributions more explicit

Conclusions

1 Mathematical methods are an important enabler in big data settings

2 Developments in big data settings not only require more computer speed and memory capacity but also new advanced mathematics

  • The Mathematics Behind Big Data
  • Outline
  • What is big data
  • Four Vrsquos of Big Data
  • High-dimensional data ldquo119951≪119953rdquo
  • Topic Google Search
  • Topic Google Search
  • Topic Google Search
  • Topic Google Search
  • Topic Google Search
  • Topic Google Search
  • Equivalent Random Surfer Model
  • Google Matrix
  • Choice of Matrix B
  • Practical Issues
  • Topic Streaming algorithms
  • Topic Uncertainty Quantification
  • Topic Network monitoring
  • Topic Precision farming
  • Topic Precision farming
  • Topic Compressed Sensing
  • Topic Network structure
  • Topic Topological Data Analysis
  • Mathematical contributions in general
  • Conclusions
1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Centrality-Betweenness[Agent x Agent Agent x Agent] Average 0 00001 00003 00006 00008 00016 00027 00029 00024 00025 00033 00031 00033 00036 0002 00008 0
Meta-Matrix Hamming Distance 0 00018 00036 0006 00065 00087 00124 0012 00146 00136 00135 00166 00164 00156 0015 0009 0004
Network Centralization-Betweenness[Agent x Agent Agent x Agent] 0 00091 00292 00492 00608 01105 01557 01586 01242 01187 01295 01338 01316 01414 0104 0042 00014
Edge Count-Sequential[Agent x Agent Agent x Agent] 0 001 00025 00017 0 0 00019 00008 00009 00009 00016 00015 0 00007 0 0002 0
Hierarchy[Agent x Agent Agent x Agent] 0 04827 01679 00696 00796 01496 01339 00727 00775 01273 01492 01711 0075 00717 00786 00882 05587
Exclusivity[Agent x Agent Agent x Agent] Average 00001 00003 00003 00003 00004 00004 00005 00005 00005 00006 00006 00006 00006 00006 00006 00004 00002
Network Centralization-Closeness[Agent x Agent Agent x Agent] 00001 00005 00012 00016 00018 00021 00027 0003 00028 00028 00031 0003 00032 00034 00024 00015 00004
Similarity Correlation[Agent x Agent Agent x Agent] Average 00001 00023 00036 00043 0005 0006 0007 00077 0007 00069 0008 00084 0008 00082 00053 0003 00008
Simmelian Ties[Agent x Agent Agent x Agent] Average 00002 00008 00021 00029 0004 0005 00059 00068 00061 00058 00065 00076 00076 0008 00045 00029 00004
Overall Complexity 00003 00015 0003 00043 00055 00067 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Expertise Correlation[Agent x Agent Agent x Agent] Average 00003 00015 0003 00042 00053 00065 00079 00088 00081 00079 00091 00097 00097 00101 0006 00036 00007
Density[Agent x Agent Agent x Agent] 00003 00015 0003 00043 00055 00067 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Centrality-Inverse Closeness[Agent x Agent Agent x Agent] Average 00003 00057 00191 00306 00387 0053 00733 00943 0084 00877 01111 01117 01125 01206 00587 00257 00018
Efficiency-Global[Agent x Agent Agent x Agent] 00003 00087 00223 00338 00421 00591 00815 0101 00904 00972 01245 01244 01203 01286 00632 00277 00026
Connectedness[Agent x Agent Agent x Agent] 00003 0021 00589 00932 0119 01953 02921 03446 0301 03223 04222 04187 04049 04402 02202 00899 00059
Distinctiveness Correlation[Agent x Agent Agent x Agent] Average 00006 00029 00059 00083 00106 00129 00156 00175 00162 00157 00181 00193 00193 00199 0012 00073 00014
Centrality-Closeness[Agent x Agent Agent x Agent] Average 00027 00028 00029 00031 00032 00035 00042 00048 00044 00044 00053 00052 00054 00058 00038 00031 00027
Boundary Spanner[Agent x Agent Agent x Agent] Average 00027 00191 00328 00328 00546 00574 00683 00601 00656 0071 00683 0071 00847 00656 00738 00546 00246
Network Centralization-Row Degree[Agent x Agent Agent x Agent] 00134 00314 006 00752 00794 0081 00851 00841 00848 00878 00893 00969 00996 00993 00842 00648 0024
Network Centralization-Column Degree[Agent x Agent Agent x Agent] 00134 00588 00819 00834 00849 00892 00878 00896 00876 00905 00948 00996 01024 00993 00842 0062 00322
Network Centralization-Out Degree[Agent x Agent Agent x Agent] 00135 00315 00601 00754 00797 00812 00853 00843 00851 0088 00895 00972 00999 00996 00845 0065 0024
Network Centralization-Total Degree[Agent x Agent Agent x Agent] 00135 00453 00672 00769 00813 00828 00869 0086 00867 00897 00925 00988 01016 00999 00847 00638 00282
Network Centralization-In Degree[Agent x Agent Agent x Agent] 00135 00589 00821 00836 00852 00895 00881 00898 00878 00908 0095 00999 01027 00996 00845 00622 00323
Clustering Coefficient-Watts-Strogatz[Agent x Agent Agent x Agent] Average 0023 00786 0126 01471 01728 01993 02334 02782 02382 02256 02604 02856 02775 02958 02157 01468 00355
Efficiency-Local[Agent x Agent Agent x Agent] 00238 01032 01587 01916 02253 02598 03034 0362 03167 03054 03644 03843 03678 03861 02663 01705 0041
Interlockers[Agent x Agent Agent x Agent] Average 00246 00546 00656 00628 00738 00902 00902 00902 00847 0071 00738 00874 00874 0082 00792 00683 00546
Breadth-Column[Agent x Agent Agent x Agent] 00273 01011 01803 02596 03224 03716 04399 05082 04672 04672 05355 05328 05519 05738 03825 02404 00601
Breadth-Row[Agent x Agent Agent x Agent] 00273 01339 02158 02923 03388 03934 04754 05301 04863 04973 05792 05765 05765 05984 04153 02514 00601
Clique Count[Agent x Agent Agent x Agent] Average 00301 02896 06421 10929 14699 17213 19781 22486 20601 20492 24071 2571 24918 24727 13689 06913 01148
Constraint-Burt[Agent x Agent Agent x Agent] Average 0038 01331 01404 01481 01869 02077 02643 02719 02506 02623 02827 0295 03034 02956 02548 01967 00948
Interdependence[Agent x Agent Agent x Agent] 00485 00073 00042 0003 00024 0002 00017 00015 00016 00016 00014 00013 00014 00013 00022 00037 00176
Effective Network Size-Burt[Agent x Agent Agent x Agent] Average 0053 03636 06977 10333 13118 15388 18575 21156 19688 19928 22983 2382 23991 2429 14392 08408 01957
Redundancy-Row[Agent x Agent Agent x Agent] 00601 03525 08443 12459 16093 19699 23716 26913 24754 23798 27541 29781 29617 30683 1765 10219 01311
Redundancy-Column[Agent x Agent Agent x Agent] 00601 03743 08634 12568 1623 2 23989 27077 24945 24016 27814 30055 29891 30902 17842 10383 01448
Node Levels[Agent x Agent Agent x Agent] Average 00601 09836 14344 23087 24126 36339 46913 42541 3918 43197 51913 44727 46421 48169 47568 23033 03169
Centrality-Bonacich Power[Agent x Agent Agent x Agent] Average 01038 05492 11066 15683 20027 2429 29372 32978 30383 29563 34071 36339 36284 37432 22541 13552 02514
Triad Count[Agent x Agent Agent x Agent] Average 01967 08279 27104 42432 5918 78251 96503 101475 94454 84508 94645 111421 110164 112732 67131 35574 01995
Efficiency[Agent x Agent Agent x Agent] 03333 09552 0964 09658 09658 09746 09798 09806 09795 09818 09835 09822 0982 09828 09809 09756 09607
Speed-Minimum[Agent x Agent Agent x Agent] 05 0125 0125 01 01111 00909 00909 01 01 01 00909 01111 01 01 00769 01111 01667
Edge Count-Skip[Agent x Agent Agent x Agent] 07895 07761 08988 09408 09304 0946 0933 09519 09487 09409 09455 09534 09465 09584 09188 0881 06196
Transitivity[Agent x Agent Agent x Agent] 08372 03475 03372 03238 03336 03626 0375 03495 03503 03253 0319 03266 03302 0336 03822 0405 0258
Edge Count-Pooled[Agent x Agent Agent x Agent] 08421 08657 09432 09669 09714 09764 09665 09751 09748 09704 09735 09737 09759 09788 09612 09435 08152
Speed-Average[Agent x Agent Agent x Agent] 08519 02994 03068 02802 02799 0233 02156 02387 0245 02484 02448 02516 02464 02429 02259 02373 04139
Edge Count-Lateral[Agent x Agent Agent x Agent] 09474 03731 03926 0392 04216 04061 03944 03828 03768 03651 03633 03872 03795 03985 03842 03891 05761
Resemblance Correlation[Agent x Agent Agent x Agent] Average 09994 09971 09941 09917 09894 09871 09844 09825 09838 09843 09819 09807 09807 09801 0988 09927 09986
Fragmentation[Agent x Agent Agent x Agent] 09997 0979 09411 09068 0881 08047 07079 06554 0699 06777 05778 05813 05951 05598 07798 09101 09941
Communicative Need[Agent x Agent Agent x Agent] 1 0608 07457 07662 0801 08069 08098 08233 0817 08124 0784 08169 08368 08365 08212 09004 08039
Upper Boundedness[Agent x Agent Agent x Agent] 1 09021 09909 0999 09987 0994 09966 09993 09989 09965 0995 09928 09987 09988 09986 09983 09073
Average Distance[Agent x Agent Agent x Agent] 11739 33399 326 35685 35731 42917 46372 41893 40814 40263 40845 39749 40577 41172 44266 42139 24163
Network Levels[Agent x Agent Agent x Agent] 2 8 8 10 9 11 11 10 10 10 11 9 10 10 13 9 6
Span Of Control[Agent x Agent Agent x Agent] 2375 27917 42188 48644 50903 52917 51932 54369 53981 5128 52176 55417 54426 55466 46089 40656 20909
Number of AGENT 32 94 112 137 162 183 221 228 213 219 254 250 256 260 190 141 62
Edge Count[Agent x Agent Agent x Agent] 38 201 405 574 733 889 1075 1207 1112 1082 1247 1330 1328 1370 825 496 92
Component Members-Weak[Agent x Agent Agent x Agent] Average 1728798 1248005 101776 862131 710437 559781 367623 299836 366667 345 232377 232213 225628 211885 50694 855628 1487596
Isolate Count[Agent x Agent Agent x Agent] 350 293 269 247 222 198 157 142 159 154 126 125 122 119 187 243 322
Component Count-Weak[Agent x Agent Agent x Agent] 355 301 272 250 226 201 162 146 162 157 128 128 126 122 191 249 329
Component Count-Strong[Agent x Agent Agent x Agent] 355 315 281 255 232 214 177 156 172 169 145 144 137 131 198 256 336
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Agents 221 228 213 219 254 250 256 260 190 141 62
Z -03746343246 00624390541 -08741467575 -04995124328 16858544608 14360982444 18107325691 20604887855 -23102450019 -53697586531 -103024439274
C+ 0 0 0 0 11858544608 21219527053 34326852743 49931740598 21829290579 0 0
C- 0 0 03741467575 03736591903 0 0 0 0 18102450019 6680003655 164824475824
Signal 0 0 0 0 0 0 0 1 0 1 1
Average 227
Std Dev 16015617378
k 05
Control Limit 4
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Component Members-Weak Average 367623 299836 366667 345 232377 232213 225628 211885 50694 855628 1487596
Z 09566024078 -01181147213 09414456731 05979300132 -11876316317 -1190231741 -12946324712 -15125184597 31653794044 86935922814 187130183455
C+ 04566024078 0 04414456731 05393756863 0 0 0 0 26653794044 108589716857 290719900313
C- 0 0 0 0 06876316317 13778633728 21724958439 31850143037 0 0 0
Signal 0 0 0 0 0 0 0 0 0 1 1
Average 307286
Std Dev 63074271515
k 05
Control Limit 4
1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Edge Count 38 201 405 574 733 889 1075 1207 1112 1082 1247 1330 1328 1370 825 496 92
Slope of Edge Count 163 204 169 159 156 186 132 -95 -30 165 83 -2 42 -545 -329 -404
Z -05039753822 15973457028 -01964649795 -07089823173 -08627375186 06748144948 -20927791293 -137269226972 -6393653789 59836399187 20500693512 -16784942813 -06150208054 -316223197419 -114547624997 -100197139539
C+ 0 10973457028 04008807233 0 0 01748144948 0 0 0 54836399187 70337092699 48552149886 37401941833 0 0 0
C- 00039753822 0 0 02089823173 05717198359 0 15927791293 148197018266 207133556155 142297156968 116796463456 128581406269 129731614323 440954811741 550502436738 645699576277
Signal 0 0 0 0 0 0 0 0 1 1 1 1 1 1 1 1 1
Average 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 9475 4825 43 3075 54 72 -1055 -2085 -309
Std Dev 195115350498 195115350498 195115350498 195115350498 195115350498 195115350498 195115350498 195115350498 1284143683549 1324547092406 1254033492376 115840047767 882307580533 710539700979 29504858357 278774102097 2505474007049
k 05
Control Limit 4
1988 1988 1988 1988 1988 1988
1989 1989 1989 1989 1989 1989
1990 1990 1990 1990 1990 1990
1991 1991 1991 1991 1991 1991
1992 1992 1992 1992 1992 1992
1993 1993 1993 1993 1993 1993
1994 1994 1994 1994 1994 1994
1995 1995 1995 1995 1995 1995
1996 1996 1996 1996 1996 1996
1997 1997 1997 1997 1997 1997
1998 1998 1998 1998 1998 1998
1999 1999 1999 1999 1999 1999
2000 2000 2000 2000 2000 2000
2001 2001 2001 2001 2001 2001
2002 2002 2002 2002 2002 2002
2003 2003 2003 2003 2003 2003
2004 2004 2004 2004 2004 2004
httpwwwmarocnlnieuwsforumsshowphppg=timeline
Significant Events Founded in Afghanistan Moves to Sudan WTC 1 and Somalia Leaves Sudan Returns to Afghanistan Saudia Arabia bomb Issues fatwa US embassy bombings USS Cole WTC 2 Afghanistan 3 Man is Captured Dirty bomb foiled 911 plotter captured Sheikh Mohammed captured Saudi bombings Morroco attacks Sectarian violence plot Madrid Europe truce offered
1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Betweeness 0 00001 00003 00006 00008 00016 00027 00029 00024 00025 00033 00031 00033 00036 0002 00008 0
Density 00003 00015 0003 00043 00055 00067 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Closeness 00001 00005 00012 00016 00018 00021 00027 0003 00028 00028 00031 0003 00032 00034 00024 00015 00004
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Closeness 00027 0003 00028 00028 00031 0003 00032 00034 00024 00015 00004
Z -08728715609 10910894512 -02182178902 -02182178902 17457431219 10910894512 24003967926 3709704134 -28368325731 -87287156094 -159299059872
C+ 0 05910894512 0 0 12457431219 18368325731 37372293657 69469334997 36101009266 0 0
C- 03728715609 0 0 0 0 0 0 0 23368325731 105655481825 259954541697
03728715609 05910894512 0 0 12457431219 18368325731 37372293657 69469334997 36101009266 105655481825 259954541697
Signal 0 0 0 0 0 0 0 1 0 1 1
Average 00028333333
Std Dev 00001527525
k 05
Control Limit 4
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Density 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Z -08444407432 11042686642 -0259827921 -06495698025 16888814864 30529780716 28581071308 36375908938 -43521176765 -9223891195 -150700194171
C+ 0 06042686642 0 0 11888814864 3741859558 60999666888 92375575826 43854399061 0 0
C- 03444407432 0 0 01495698025 0 0 0 0 38521176765 125760088714 271460282886
Signal 0 0 0 0 0 0 1 1 1 1 1
Average 00084333333
Std Dev 00005131601
k 05
Control Limit 4
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Betweeness 00027 00029 00024 00025 00033 00031 00033 00036 0002 00008 0
Z 01324532357 09271726499 -10596258857 -06622661785 25166114784 17218920642 25166114784 37086905998 -26490647141 -74173811996 -105962588565
C+ 0 04271726499 0 0 20166114784 32385035426 5255115021 84638056208 53147409067 0 0
C- 0 0 05596258857 07218920642 0 0 0 0 21490647141 90664459137 191627047702
Signal 0 0 0 0 0 0 1 1 1 1 1
Average 00026666667
Std Dev 00002516611
k 05
Control Limit 4
Page 27: Data Science Center Eindhoven - 4TU.Federation · 4TU AMI SRO Big Data Meeting Big Data: Mathematics in Action! November 24, 2017 Data Science Center Eindhoven The Mathematics Behind

Std Measures Control Charts

Betweeness
Density
Closeness
Time
Value
Standard Measures

Chart1

C+
Closeness CUSUM Chart for Al-Qaeda (1994-2004)

Control Charts

Number of AGENT
Edge Count[Agent x Agent Agent x Agent]
Component Members-Weak[Agent x Agent Agent x Agent] Average
Isolate Count[Agent x Agent Agent x Agent]
Component Count-Weak[Agent x Agent Agent x Agent]
Component Count-Strong[Agent x Agent Agent x Agent]
32
38
1728798
350
355
355
94
201
1248005
293
301
315
112
405
101776
269
272
281
137
574
862131
247
250
255
162
733
710437
222
226
232
183
889
559781
198
201
214
221
1075
367623
157
162
177
228
1207
299836
142
146
156
213
1112
366667
159
162
172
219
1082
345
154
157
169
254
1247
232377
126
128
145
250
1330
232213
125
128
144
256
1328
225628
122
126
137
260
1370
211885
119
122
131
190
825
50694
187
191
198
141
496
855628
243
249
256
62
92
1487596
322
329
336

raw data

Topic Precision farming

Topic Precision farming

Topic Compressed Sensing

Topic Network structure

bull dependencies between nodes in networks measured by degrees of direct neighbours

bull assortativity coefficient of Newman is nothing but Pearsonrsquos correlation statistic

bull inconsistent estimator when variances are infinitebull Spearman rank correlation behaves better but

calculation is computationally intensive bull requires heavy asymptotics

Van der Hofstad R and Litvak N (2014) Degree-Degree Dependencies in Random Graphs with Heavy-Tailed Degrees Internet Mathematics 10 (3-4) pp 287-334

Topic Topological Data Analysis

Common Big Data problem is to choose relevant ldquofeaturesrdquo from high-dimensional data

Combination of machine learning with topological tools (simplices cohomology) yields new algorithmsfor finding patterns and clustering

Presenter
Presentation Notes
httpsyoutubeXfWibrh6stw Gunnar Carsson

Mathematical contributions in general

bull Modellingbull Performance of algorithmsbull Statistical thinking

We need to work hard to makemathematical contributions more explicit

Conclusions

1 Mathematical methods are an important enabler in big data settings

2 Developments in big data settings not only require more computer speed and memory capacity but also new advanced mathematics

  • The Mathematics Behind Big Data
  • Outline
  • What is big data
  • Four Vrsquos of Big Data
  • High-dimensional data ldquo119951≪119953rdquo
  • Topic Google Search
  • Topic Google Search
  • Topic Google Search
  • Topic Google Search
  • Topic Google Search
  • Topic Google Search
  • Equivalent Random Surfer Model
  • Google Matrix
  • Choice of Matrix B
  • Practical Issues
  • Topic Streaming algorithms
  • Topic Uncertainty Quantification
  • Topic Network monitoring
  • Topic Precision farming
  • Topic Precision farming
  • Topic Compressed Sensing
  • Topic Network structure
  • Topic Topological Data Analysis
  • Mathematical contributions in general
  • Conclusions
1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Centrality-Betweenness[Agent x Agent Agent x Agent] Average 0 00001 00003 00006 00008 00016 00027 00029 00024 00025 00033 00031 00033 00036 0002 00008 0
Meta-Matrix Hamming Distance 0 00018 00036 0006 00065 00087 00124 0012 00146 00136 00135 00166 00164 00156 0015 0009 0004
Network Centralization-Betweenness[Agent x Agent Agent x Agent] 0 00091 00292 00492 00608 01105 01557 01586 01242 01187 01295 01338 01316 01414 0104 0042 00014
Edge Count-Sequential[Agent x Agent Agent x Agent] 0 001 00025 00017 0 0 00019 00008 00009 00009 00016 00015 0 00007 0 0002 0
Hierarchy[Agent x Agent Agent x Agent] 0 04827 01679 00696 00796 01496 01339 00727 00775 01273 01492 01711 0075 00717 00786 00882 05587
Exclusivity[Agent x Agent Agent x Agent] Average 00001 00003 00003 00003 00004 00004 00005 00005 00005 00006 00006 00006 00006 00006 00006 00004 00002
Network Centralization-Closeness[Agent x Agent Agent x Agent] 00001 00005 00012 00016 00018 00021 00027 0003 00028 00028 00031 0003 00032 00034 00024 00015 00004
Similarity Correlation[Agent x Agent Agent x Agent] Average 00001 00023 00036 00043 0005 0006 0007 00077 0007 00069 0008 00084 0008 00082 00053 0003 00008
Simmelian Ties[Agent x Agent Agent x Agent] Average 00002 00008 00021 00029 0004 0005 00059 00068 00061 00058 00065 00076 00076 0008 00045 00029 00004
Overall Complexity 00003 00015 0003 00043 00055 00067 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Expertise Correlation[Agent x Agent Agent x Agent] Average 00003 00015 0003 00042 00053 00065 00079 00088 00081 00079 00091 00097 00097 00101 0006 00036 00007
Density[Agent x Agent Agent x Agent] 00003 00015 0003 00043 00055 00067 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Centrality-Inverse Closeness[Agent x Agent Agent x Agent] Average 00003 00057 00191 00306 00387 0053 00733 00943 0084 00877 01111 01117 01125 01206 00587 00257 00018
Efficiency-Global[Agent x Agent Agent x Agent] 00003 00087 00223 00338 00421 00591 00815 0101 00904 00972 01245 01244 01203 01286 00632 00277 00026
Connectedness[Agent x Agent Agent x Agent] 00003 0021 00589 00932 0119 01953 02921 03446 0301 03223 04222 04187 04049 04402 02202 00899 00059
Distinctiveness Correlation[Agent x Agent Agent x Agent] Average 00006 00029 00059 00083 00106 00129 00156 00175 00162 00157 00181 00193 00193 00199 0012 00073 00014
Centrality-Closeness[Agent x Agent Agent x Agent] Average 00027 00028 00029 00031 00032 00035 00042 00048 00044 00044 00053 00052 00054 00058 00038 00031 00027
Boundary Spanner[Agent x Agent Agent x Agent] Average 00027 00191 00328 00328 00546 00574 00683 00601 00656 0071 00683 0071 00847 00656 00738 00546 00246
Network Centralization-Row Degree[Agent x Agent Agent x Agent] 00134 00314 006 00752 00794 0081 00851 00841 00848 00878 00893 00969 00996 00993 00842 00648 0024
Network Centralization-Column Degree[Agent x Agent Agent x Agent] 00134 00588 00819 00834 00849 00892 00878 00896 00876 00905 00948 00996 01024 00993 00842 0062 00322
Network Centralization-Out Degree[Agent x Agent Agent x Agent] 00135 00315 00601 00754 00797 00812 00853 00843 00851 0088 00895 00972 00999 00996 00845 0065 0024
Network Centralization-Total Degree[Agent x Agent Agent x Agent] 00135 00453 00672 00769 00813 00828 00869 0086 00867 00897 00925 00988 01016 00999 00847 00638 00282
Network Centralization-In Degree[Agent x Agent Agent x Agent] 00135 00589 00821 00836 00852 00895 00881 00898 00878 00908 0095 00999 01027 00996 00845 00622 00323
Clustering Coefficient-Watts-Strogatz[Agent x Agent Agent x Agent] Average 0023 00786 0126 01471 01728 01993 02334 02782 02382 02256 02604 02856 02775 02958 02157 01468 00355
Efficiency-Local[Agent x Agent Agent x Agent] 00238 01032 01587 01916 02253 02598 03034 0362 03167 03054 03644 03843 03678 03861 02663 01705 0041
Interlockers[Agent x Agent Agent x Agent] Average 00246 00546 00656 00628 00738 00902 00902 00902 00847 0071 00738 00874 00874 0082 00792 00683 00546
Breadth-Column[Agent x Agent Agent x Agent] 00273 01011 01803 02596 03224 03716 04399 05082 04672 04672 05355 05328 05519 05738 03825 02404 00601
Breadth-Row[Agent x Agent Agent x Agent] 00273 01339 02158 02923 03388 03934 04754 05301 04863 04973 05792 05765 05765 05984 04153 02514 00601
Clique Count[Agent x Agent Agent x Agent] Average 00301 02896 06421 10929 14699 17213 19781 22486 20601 20492 24071 2571 24918 24727 13689 06913 01148
Constraint-Burt[Agent x Agent Agent x Agent] Average 0038 01331 01404 01481 01869 02077 02643 02719 02506 02623 02827 0295 03034 02956 02548 01967 00948
Interdependence[Agent x Agent Agent x Agent] 00485 00073 00042 0003 00024 0002 00017 00015 00016 00016 00014 00013 00014 00013 00022 00037 00176
Effective Network Size-Burt[Agent x Agent Agent x Agent] Average 0053 03636 06977 10333 13118 15388 18575 21156 19688 19928 22983 2382 23991 2429 14392 08408 01957
Redundancy-Row[Agent x Agent Agent x Agent] 00601 03525 08443 12459 16093 19699 23716 26913 24754 23798 27541 29781 29617 30683 1765 10219 01311
Redundancy-Column[Agent x Agent Agent x Agent] 00601 03743 08634 12568 1623 2 23989 27077 24945 24016 27814 30055 29891 30902 17842 10383 01448
Node Levels[Agent x Agent Agent x Agent] Average 00601 09836 14344 23087 24126 36339 46913 42541 3918 43197 51913 44727 46421 48169 47568 23033 03169
Centrality-Bonacich Power[Agent x Agent Agent x Agent] Average 01038 05492 11066 15683 20027 2429 29372 32978 30383 29563 34071 36339 36284 37432 22541 13552 02514
Triad Count[Agent x Agent Agent x Agent] Average 01967 08279 27104 42432 5918 78251 96503 101475 94454 84508 94645 111421 110164 112732 67131 35574 01995
Efficiency[Agent x Agent Agent x Agent] 03333 09552 0964 09658 09658 09746 09798 09806 09795 09818 09835 09822 0982 09828 09809 09756 09607
Speed-Minimum[Agent x Agent Agent x Agent] 05 0125 0125 01 01111 00909 00909 01 01 01 00909 01111 01 01 00769 01111 01667
Edge Count-Skip[Agent x Agent Agent x Agent] 07895 07761 08988 09408 09304 0946 0933 09519 09487 09409 09455 09534 09465 09584 09188 0881 06196
Transitivity[Agent x Agent Agent x Agent] 08372 03475 03372 03238 03336 03626 0375 03495 03503 03253 0319 03266 03302 0336 03822 0405 0258
Edge Count-Pooled[Agent x Agent Agent x Agent] 08421 08657 09432 09669 09714 09764 09665 09751 09748 09704 09735 09737 09759 09788 09612 09435 08152
Speed-Average[Agent x Agent Agent x Agent] 08519 02994 03068 02802 02799 0233 02156 02387 0245 02484 02448 02516 02464 02429 02259 02373 04139
Edge Count-Lateral[Agent x Agent Agent x Agent] 09474 03731 03926 0392 04216 04061 03944 03828 03768 03651 03633 03872 03795 03985 03842 03891 05761
Resemblance Correlation[Agent x Agent Agent x Agent] Average 09994 09971 09941 09917 09894 09871 09844 09825 09838 09843 09819 09807 09807 09801 0988 09927 09986
Fragmentation[Agent x Agent Agent x Agent] 09997 0979 09411 09068 0881 08047 07079 06554 0699 06777 05778 05813 05951 05598 07798 09101 09941
Communicative Need[Agent x Agent Agent x Agent] 1 0608 07457 07662 0801 08069 08098 08233 0817 08124 0784 08169 08368 08365 08212 09004 08039
Upper Boundedness[Agent x Agent Agent x Agent] 1 09021 09909 0999 09987 0994 09966 09993 09989 09965 0995 09928 09987 09988 09986 09983 09073
Average Distance[Agent x Agent Agent x Agent] 11739 33399 326 35685 35731 42917 46372 41893 40814 40263 40845 39749 40577 41172 44266 42139 24163
Network Levels[Agent x Agent Agent x Agent] 2 8 8 10 9 11 11 10 10 10 11 9 10 10 13 9 6
Span Of Control[Agent x Agent Agent x Agent] 2375 27917 42188 48644 50903 52917 51932 54369 53981 5128 52176 55417 54426 55466 46089 40656 20909
Number of AGENT 32 94 112 137 162 183 221 228 213 219 254 250 256 260 190 141 62
Edge Count[Agent x Agent Agent x Agent] 38 201 405 574 733 889 1075 1207 1112 1082 1247 1330 1328 1370 825 496 92
Component Members-Weak[Agent x Agent Agent x Agent] Average 1728798 1248005 101776 862131 710437 559781 367623 299836 366667 345 232377 232213 225628 211885 50694 855628 1487596
Isolate Count[Agent x Agent Agent x Agent] 350 293 269 247 222 198 157 142 159 154 126 125 122 119 187 243 322
Component Count-Weak[Agent x Agent Agent x Agent] 355 301 272 250 226 201 162 146 162 157 128 128 126 122 191 249 329
Component Count-Strong[Agent x Agent Agent x Agent] 355 315 281 255 232 214 177 156 172 169 145 144 137 131 198 256 336
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Agents 221 228 213 219 254 250 256 260 190 141 62
Z -03746343246 00624390541 -08741467575 -04995124328 16858544608 14360982444 18107325691 20604887855 -23102450019 -53697586531 -103024439274
C+ 0 0 0 0 11858544608 21219527053 34326852743 49931740598 21829290579 0 0
C- 0 0 03741467575 03736591903 0 0 0 0 18102450019 6680003655 164824475824
Signal 0 0 0 0 0 0 0 1 0 1 1
Average 227
Std Dev 16015617378
k 05
Control Limit 4
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Component Members-Weak Average 367623 299836 366667 345 232377 232213 225628 211885 50694 855628 1487596
Z 09566024078 -01181147213 09414456731 05979300132 -11876316317 -1190231741 -12946324712 -15125184597 31653794044 86935922814 187130183455
C+ 04566024078 0 04414456731 05393756863 0 0 0 0 26653794044 108589716857 290719900313
C- 0 0 0 0 06876316317 13778633728 21724958439 31850143037 0 0 0
Signal 0 0 0 0 0 0 0 0 0 1 1
Average 307286
Std Dev 63074271515
k 05
Control Limit 4
1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Edge Count 38 201 405 574 733 889 1075 1207 1112 1082 1247 1330 1328 1370 825 496 92
Slope of Edge Count 163 204 169 159 156 186 132 -95 -30 165 83 -2 42 -545 -329 -404
Z -05039753822 15973457028 -01964649795 -07089823173 -08627375186 06748144948 -20927791293 -137269226972 -6393653789 59836399187 20500693512 -16784942813 -06150208054 -316223197419 -114547624997 -100197139539
C+ 0 10973457028 04008807233 0 0 01748144948 0 0 0 54836399187 70337092699 48552149886 37401941833 0 0 0
C- 00039753822 0 0 02089823173 05717198359 0 15927791293 148197018266 207133556155 142297156968 116796463456 128581406269 129731614323 440954811741 550502436738 645699576277
Signal 0 0 0 0 0 0 0 0 1 1 1 1 1 1 1 1 1
Average 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 9475 4825 43 3075 54 72 -1055 -2085 -309
Std Dev 195115350498 195115350498 195115350498 195115350498 195115350498 195115350498 195115350498 195115350498 1284143683549 1324547092406 1254033492376 115840047767 882307580533 710539700979 29504858357 278774102097 2505474007049
k 05
Control Limit 4
1988 1988 1988 1988 1988 1988
1989 1989 1989 1989 1989 1989
1990 1990 1990 1990 1990 1990
1991 1991 1991 1991 1991 1991
1992 1992 1992 1992 1992 1992
1993 1993 1993 1993 1993 1993
1994 1994 1994 1994 1994 1994
1995 1995 1995 1995 1995 1995
1996 1996 1996 1996 1996 1996
1997 1997 1997 1997 1997 1997
1998 1998 1998 1998 1998 1998
1999 1999 1999 1999 1999 1999
2000 2000 2000 2000 2000 2000
2001 2001 2001 2001 2001 2001
2002 2002 2002 2002 2002 2002
2003 2003 2003 2003 2003 2003
2004 2004 2004 2004 2004 2004
Page 28: Data Science Center Eindhoven - 4TU.Federation · 4TU AMI SRO Big Data Meeting Big Data: Mathematics in Action! November 24, 2017 Data Science Center Eindhoven The Mathematics Behind

Chart1

C+
Closeness CUSUM Chart for Al-Qaeda (1994-2004)

Control Charts

Number of AGENT
Edge Count[Agent x Agent Agent x Agent]
Component Members-Weak[Agent x Agent Agent x Agent] Average
Isolate Count[Agent x Agent Agent x Agent]
Component Count-Weak[Agent x Agent Agent x Agent]
Component Count-Strong[Agent x Agent Agent x Agent]
32
38
1728798
350
355
355
94
201
1248005
293
301
315
112
405
101776
269
272
281
137
574
862131
247
250
255
162
733
710437
222
226
232
183
889
559781
198
201
214
221
1075
367623
157
162
177
228
1207
299836
142
146
156
213
1112
366667
159
162
172
219
1082
345
154
157
169
254
1247
232377
126
128
145
250
1330
232213
125
128
144
256
1328
225628
122
126
137
260
1370
211885
119
122
131
190
825
50694
187
191
198
141
496
855628
243
249
256
62
92
1487596
322
329
336

raw data

Topic Precision farming

Topic Precision farming

Topic Compressed Sensing

Topic Network structure

bull dependencies between nodes in networks measured by degrees of direct neighbours

bull assortativity coefficient of Newman is nothing but Pearsonrsquos correlation statistic

bull inconsistent estimator when variances are infinitebull Spearman rank correlation behaves better but

calculation is computationally intensive bull requires heavy asymptotics

Van der Hofstad R and Litvak N (2014) Degree-Degree Dependencies in Random Graphs with Heavy-Tailed Degrees Internet Mathematics 10 (3-4) pp 287-334

Topic Topological Data Analysis

Common Big Data problem is to choose relevant ldquofeaturesrdquo from high-dimensional data

Combination of machine learning with topological tools (simplices cohomology) yields new algorithmsfor finding patterns and clustering

Presenter
Presentation Notes
httpsyoutubeXfWibrh6stw Gunnar Carsson

Mathematical contributions in general

bull Modellingbull Performance of algorithmsbull Statistical thinking

We need to work hard to makemathematical contributions more explicit

Conclusions

1 Mathematical methods are an important enabler in big data settings

2 Developments in big data settings not only require more computer speed and memory capacity but also new advanced mathematics

  • The Mathematics Behind Big Data
  • Outline
  • What is big data
  • Four Vrsquos of Big Data
  • High-dimensional data ldquo119951≪119953rdquo
  • Topic Google Search
  • Topic Google Search
  • Topic Google Search
  • Topic Google Search
  • Topic Google Search
  • Topic Google Search
  • Equivalent Random Surfer Model
  • Google Matrix
  • Choice of Matrix B
  • Practical Issues
  • Topic Streaming algorithms
  • Topic Uncertainty Quantification
  • Topic Network monitoring
  • Topic Precision farming
  • Topic Precision farming
  • Topic Compressed Sensing
  • Topic Network structure
  • Topic Topological Data Analysis
  • Mathematical contributions in general
  • Conclusions
1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Centrality-Betweenness[Agent x Agent Agent x Agent] Average 0 00001 00003 00006 00008 00016 00027 00029 00024 00025 00033 00031 00033 00036 0002 00008 0
Meta-Matrix Hamming Distance 0 00018 00036 0006 00065 00087 00124 0012 00146 00136 00135 00166 00164 00156 0015 0009 0004
Network Centralization-Betweenness[Agent x Agent Agent x Agent] 0 00091 00292 00492 00608 01105 01557 01586 01242 01187 01295 01338 01316 01414 0104 0042 00014
Edge Count-Sequential[Agent x Agent Agent x Agent] 0 001 00025 00017 0 0 00019 00008 00009 00009 00016 00015 0 00007 0 0002 0
Hierarchy[Agent x Agent Agent x Agent] 0 04827 01679 00696 00796 01496 01339 00727 00775 01273 01492 01711 0075 00717 00786 00882 05587
Exclusivity[Agent x Agent Agent x Agent] Average 00001 00003 00003 00003 00004 00004 00005 00005 00005 00006 00006 00006 00006 00006 00006 00004 00002
Network Centralization-Closeness[Agent x Agent Agent x Agent] 00001 00005 00012 00016 00018 00021 00027 0003 00028 00028 00031 0003 00032 00034 00024 00015 00004
Similarity Correlation[Agent x Agent Agent x Agent] Average 00001 00023 00036 00043 0005 0006 0007 00077 0007 00069 0008 00084 0008 00082 00053 0003 00008
Simmelian Ties[Agent x Agent Agent x Agent] Average 00002 00008 00021 00029 0004 0005 00059 00068 00061 00058 00065 00076 00076 0008 00045 00029 00004
Overall Complexity 00003 00015 0003 00043 00055 00067 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Expertise Correlation[Agent x Agent Agent x Agent] Average 00003 00015 0003 00042 00053 00065 00079 00088 00081 00079 00091 00097 00097 00101 0006 00036 00007
Density[Agent x Agent Agent x Agent] 00003 00015 0003 00043 00055 00067 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Centrality-Inverse Closeness[Agent x Agent Agent x Agent] Average 00003 00057 00191 00306 00387 0053 00733 00943 0084 00877 01111 01117 01125 01206 00587 00257 00018
Efficiency-Global[Agent x Agent Agent x Agent] 00003 00087 00223 00338 00421 00591 00815 0101 00904 00972 01245 01244 01203 01286 00632 00277 00026
Connectedness[Agent x Agent Agent x Agent] 00003 0021 00589 00932 0119 01953 02921 03446 0301 03223 04222 04187 04049 04402 02202 00899 00059
Distinctiveness Correlation[Agent x Agent Agent x Agent] Average 00006 00029 00059 00083 00106 00129 00156 00175 00162 00157 00181 00193 00193 00199 0012 00073 00014
Centrality-Closeness[Agent x Agent Agent x Agent] Average 00027 00028 00029 00031 00032 00035 00042 00048 00044 00044 00053 00052 00054 00058 00038 00031 00027
Boundary Spanner[Agent x Agent Agent x Agent] Average 00027 00191 00328 00328 00546 00574 00683 00601 00656 0071 00683 0071 00847 00656 00738 00546 00246
Network Centralization-Row Degree[Agent x Agent Agent x Agent] 00134 00314 006 00752 00794 0081 00851 00841 00848 00878 00893 00969 00996 00993 00842 00648 0024
Network Centralization-Column Degree[Agent x Agent Agent x Agent] 00134 00588 00819 00834 00849 00892 00878 00896 00876 00905 00948 00996 01024 00993 00842 0062 00322
Network Centralization-Out Degree[Agent x Agent Agent x Agent] 00135 00315 00601 00754 00797 00812 00853 00843 00851 0088 00895 00972 00999 00996 00845 0065 0024
Network Centralization-Total Degree[Agent x Agent Agent x Agent] 00135 00453 00672 00769 00813 00828 00869 0086 00867 00897 00925 00988 01016 00999 00847 00638 00282
Network Centralization-In Degree[Agent x Agent Agent x Agent] 00135 00589 00821 00836 00852 00895 00881 00898 00878 00908 0095 00999 01027 00996 00845 00622 00323
Clustering Coefficient-Watts-Strogatz[Agent x Agent Agent x Agent] Average 0023 00786 0126 01471 01728 01993 02334 02782 02382 02256 02604 02856 02775 02958 02157 01468 00355
Efficiency-Local[Agent x Agent Agent x Agent] 00238 01032 01587 01916 02253 02598 03034 0362 03167 03054 03644 03843 03678 03861 02663 01705 0041
Interlockers[Agent x Agent Agent x Agent] Average 00246 00546 00656 00628 00738 00902 00902 00902 00847 0071 00738 00874 00874 0082 00792 00683 00546
Breadth-Column[Agent x Agent Agent x Agent] 00273 01011 01803 02596 03224 03716 04399 05082 04672 04672 05355 05328 05519 05738 03825 02404 00601
Breadth-Row[Agent x Agent Agent x Agent] 00273 01339 02158 02923 03388 03934 04754 05301 04863 04973 05792 05765 05765 05984 04153 02514 00601
Clique Count[Agent x Agent Agent x Agent] Average 00301 02896 06421 10929 14699 17213 19781 22486 20601 20492 24071 2571 24918 24727 13689 06913 01148
Constraint-Burt[Agent x Agent Agent x Agent] Average 0038 01331 01404 01481 01869 02077 02643 02719 02506 02623 02827 0295 03034 02956 02548 01967 00948
Interdependence[Agent x Agent Agent x Agent] 00485 00073 00042 0003 00024 0002 00017 00015 00016 00016 00014 00013 00014 00013 00022 00037 00176
Effective Network Size-Burt[Agent x Agent Agent x Agent] Average 0053 03636 06977 10333 13118 15388 18575 21156 19688 19928 22983 2382 23991 2429 14392 08408 01957
Redundancy-Row[Agent x Agent Agent x Agent] 00601 03525 08443 12459 16093 19699 23716 26913 24754 23798 27541 29781 29617 30683 1765 10219 01311
Redundancy-Column[Agent x Agent Agent x Agent] 00601 03743 08634 12568 1623 2 23989 27077 24945 24016 27814 30055 29891 30902 17842 10383 01448
Node Levels[Agent x Agent Agent x Agent] Average 00601 09836 14344 23087 24126 36339 46913 42541 3918 43197 51913 44727 46421 48169 47568 23033 03169
Centrality-Bonacich Power[Agent x Agent Agent x Agent] Average 01038 05492 11066 15683 20027 2429 29372 32978 30383 29563 34071 36339 36284 37432 22541 13552 02514
Triad Count[Agent x Agent Agent x Agent] Average 01967 08279 27104 42432 5918 78251 96503 101475 94454 84508 94645 111421 110164 112732 67131 35574 01995
Efficiency[Agent x Agent Agent x Agent] 03333 09552 0964 09658 09658 09746 09798 09806 09795 09818 09835 09822 0982 09828 09809 09756 09607
Speed-Minimum[Agent x Agent Agent x Agent] 05 0125 0125 01 01111 00909 00909 01 01 01 00909 01111 01 01 00769 01111 01667
Edge Count-Skip[Agent x Agent Agent x Agent] 07895 07761 08988 09408 09304 0946 0933 09519 09487 09409 09455 09534 09465 09584 09188 0881 06196
Transitivity[Agent x Agent Agent x Agent] 08372 03475 03372 03238 03336 03626 0375 03495 03503 03253 0319 03266 03302 0336 03822 0405 0258
Edge Count-Pooled[Agent x Agent Agent x Agent] 08421 08657 09432 09669 09714 09764 09665 09751 09748 09704 09735 09737 09759 09788 09612 09435 08152
Speed-Average[Agent x Agent Agent x Agent] 08519 02994 03068 02802 02799 0233 02156 02387 0245 02484 02448 02516 02464 02429 02259 02373 04139
Edge Count-Lateral[Agent x Agent Agent x Agent] 09474 03731 03926 0392 04216 04061 03944 03828 03768 03651 03633 03872 03795 03985 03842 03891 05761
Resemblance Correlation[Agent x Agent Agent x Agent] Average 09994 09971 09941 09917 09894 09871 09844 09825 09838 09843 09819 09807 09807 09801 0988 09927 09986
Fragmentation[Agent x Agent Agent x Agent] 09997 0979 09411 09068 0881 08047 07079 06554 0699 06777 05778 05813 05951 05598 07798 09101 09941
Communicative Need[Agent x Agent Agent x Agent] 1 0608 07457 07662 0801 08069 08098 08233 0817 08124 0784 08169 08368 08365 08212 09004 08039
Upper Boundedness[Agent x Agent Agent x Agent] 1 09021 09909 0999 09987 0994 09966 09993 09989 09965 0995 09928 09987 09988 09986 09983 09073
Average Distance[Agent x Agent Agent x Agent] 11739 33399 326 35685 35731 42917 46372 41893 40814 40263 40845 39749 40577 41172 44266 42139 24163
Network Levels[Agent x Agent Agent x Agent] 2 8 8 10 9 11 11 10 10 10 11 9 10 10 13 9 6
Span Of Control[Agent x Agent Agent x Agent] 2375 27917 42188 48644 50903 52917 51932 54369 53981 5128 52176 55417 54426 55466 46089 40656 20909
Number of AGENT 32 94 112 137 162 183 221 228 213 219 254 250 256 260 190 141 62
Edge Count[Agent x Agent Agent x Agent] 38 201 405 574 733 889 1075 1207 1112 1082 1247 1330 1328 1370 825 496 92
Component Members-Weak[Agent x Agent Agent x Agent] Average 1728798 1248005 101776 862131 710437 559781 367623 299836 366667 345 232377 232213 225628 211885 50694 855628 1487596
Isolate Count[Agent x Agent Agent x Agent] 350 293 269 247 222 198 157 142 159 154 126 125 122 119 187 243 322
Component Count-Weak[Agent x Agent Agent x Agent] 355 301 272 250 226 201 162 146 162 157 128 128 126 122 191 249 329
Component Count-Strong[Agent x Agent Agent x Agent] 355 315 281 255 232 214 177 156 172 169 145 144 137 131 198 256 336
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Agents 221 228 213 219 254 250 256 260 190 141 62
Z -03746343246 00624390541 -08741467575 -04995124328 16858544608 14360982444 18107325691 20604887855 -23102450019 -53697586531 -103024439274
C+ 0 0 0 0 11858544608 21219527053 34326852743 49931740598 21829290579 0 0
C- 0 0 03741467575 03736591903 0 0 0 0 18102450019 6680003655 164824475824
Signal 0 0 0 0 0 0 0 1 0 1 1
Average 227
Std Dev 16015617378
k 05
Control Limit 4
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Component Members-Weak Average 367623 299836 366667 345 232377 232213 225628 211885 50694 855628 1487596
Z 09566024078 -01181147213 09414456731 05979300132 -11876316317 -1190231741 -12946324712 -15125184597 31653794044 86935922814 187130183455
C+ 04566024078 0 04414456731 05393756863 0 0 0 0 26653794044 108589716857 290719900313
C- 0 0 0 0 06876316317 13778633728 21724958439 31850143037 0 0 0
Signal 0 0 0 0 0 0 0 0 0 1 1
Average 307286
Std Dev 63074271515
k 05
Control Limit 4
1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Edge Count 38 201 405 574 733 889 1075 1207 1112 1082 1247 1330 1328 1370 825 496 92
Slope of Edge Count 163 204 169 159 156 186 132 -95 -30 165 83 -2 42 -545 -329 -404
Z -05039753822 15973457028 -01964649795 -07089823173 -08627375186 06748144948 -20927791293 -137269226972 -6393653789 59836399187 20500693512 -16784942813 -06150208054 -316223197419 -114547624997 -100197139539
C+ 0 10973457028 04008807233 0 0 01748144948 0 0 0 54836399187 70337092699 48552149886 37401941833 0 0 0
C- 00039753822 0 0 02089823173 05717198359 0 15927791293 148197018266 207133556155 142297156968 116796463456 128581406269 129731614323 440954811741 550502436738 645699576277
Signal 0 0 0 0 0 0 0 0 1 1 1 1 1 1 1 1 1
Average 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 9475 4825 43 3075 54 72 -1055 -2085 -309
Std Dev 195115350498 195115350498 195115350498 195115350498 195115350498 195115350498 195115350498 195115350498 1284143683549 1324547092406 1254033492376 115840047767 882307580533 710539700979 29504858357 278774102097 2505474007049
k 05
Control Limit 4
1988 1988 1988 1988 1988 1988
1989 1989 1989 1989 1989 1989
1990 1990 1990 1990 1990 1990
1991 1991 1991 1991 1991 1991
1992 1992 1992 1992 1992 1992
1993 1993 1993 1993 1993 1993
1994 1994 1994 1994 1994 1994
1995 1995 1995 1995 1995 1995
1996 1996 1996 1996 1996 1996
1997 1997 1997 1997 1997 1997
1998 1998 1998 1998 1998 1998
1999 1999 1999 1999 1999 1999
2000 2000 2000 2000 2000 2000
2001 2001 2001 2001 2001 2001
2002 2002 2002 2002 2002 2002
2003 2003 2003 2003 2003 2003
2004 2004 2004 2004 2004 2004
Page 29: Data Science Center Eindhoven - 4TU.Federation · 4TU AMI SRO Big Data Meeting Big Data: Mathematics in Action! November 24, 2017 Data Science Center Eindhoven The Mathematics Behind

Control Charts

Number of AGENT
Edge Count[Agent x Agent Agent x Agent]
Component Members-Weak[Agent x Agent Agent x Agent] Average
Isolate Count[Agent x Agent Agent x Agent]
Component Count-Weak[Agent x Agent Agent x Agent]
Component Count-Strong[Agent x Agent Agent x Agent]
32
38
1728798
350
355
355
94
201
1248005
293
301
315
112
405
101776
269
272
281
137
574
862131
247
250
255
162
733
710437
222
226
232
183
889
559781
198
201
214
221
1075
367623
157
162
177
228
1207
299836
142
146
156
213
1112
366667
159
162
172
219
1082
345
154
157
169
254
1247
232377
126
128
145
250
1330
232213
125
128
144
256
1328
225628
122
126
137
260
1370
211885
119
122
131
190
825
50694
187
191
198
141
496
855628
243
249
256
62
92
1487596
322
329
336

raw data

Topic Precision farming

Topic Precision farming

Topic Compressed Sensing

Topic Network structure

bull dependencies between nodes in networks measured by degrees of direct neighbours

bull assortativity coefficient of Newman is nothing but Pearsonrsquos correlation statistic

bull inconsistent estimator when variances are infinitebull Spearman rank correlation behaves better but

calculation is computationally intensive bull requires heavy asymptotics

Van der Hofstad R and Litvak N (2014) Degree-Degree Dependencies in Random Graphs with Heavy-Tailed Degrees Internet Mathematics 10 (3-4) pp 287-334

Topic Topological Data Analysis

Common Big Data problem is to choose relevant ldquofeaturesrdquo from high-dimensional data

Combination of machine learning with topological tools (simplices cohomology) yields new algorithmsfor finding patterns and clustering

Presenter
Presentation Notes
httpsyoutubeXfWibrh6stw Gunnar Carsson

Mathematical contributions in general

bull Modellingbull Performance of algorithmsbull Statistical thinking

We need to work hard to makemathematical contributions more explicit

Conclusions

1 Mathematical methods are an important enabler in big data settings

2 Developments in big data settings not only require more computer speed and memory capacity but also new advanced mathematics

  • The Mathematics Behind Big Data
  • Outline
  • What is big data
  • Four Vrsquos of Big Data
  • High-dimensional data ldquo119951≪119953rdquo
  • Topic Google Search
  • Topic Google Search
  • Topic Google Search
  • Topic Google Search
  • Topic Google Search
  • Topic Google Search
  • Equivalent Random Surfer Model
  • Google Matrix
  • Choice of Matrix B
  • Practical Issues
  • Topic Streaming algorithms
  • Topic Uncertainty Quantification
  • Topic Network monitoring
  • Topic Precision farming
  • Topic Precision farming
  • Topic Compressed Sensing
  • Topic Network structure
  • Topic Topological Data Analysis
  • Mathematical contributions in general
  • Conclusions
1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Centrality-Betweenness[Agent x Agent Agent x Agent] Average 0 00001 00003 00006 00008 00016 00027 00029 00024 00025 00033 00031 00033 00036 0002 00008 0
Meta-Matrix Hamming Distance 0 00018 00036 0006 00065 00087 00124 0012 00146 00136 00135 00166 00164 00156 0015 0009 0004
Network Centralization-Betweenness[Agent x Agent Agent x Agent] 0 00091 00292 00492 00608 01105 01557 01586 01242 01187 01295 01338 01316 01414 0104 0042 00014
Edge Count-Sequential[Agent x Agent Agent x Agent] 0 001 00025 00017 0 0 00019 00008 00009 00009 00016 00015 0 00007 0 0002 0
Hierarchy[Agent x Agent Agent x Agent] 0 04827 01679 00696 00796 01496 01339 00727 00775 01273 01492 01711 0075 00717 00786 00882 05587
Exclusivity[Agent x Agent Agent x Agent] Average 00001 00003 00003 00003 00004 00004 00005 00005 00005 00006 00006 00006 00006 00006 00006 00004 00002
Network Centralization-Closeness[Agent x Agent Agent x Agent] 00001 00005 00012 00016 00018 00021 00027 0003 00028 00028 00031 0003 00032 00034 00024 00015 00004
Similarity Correlation[Agent x Agent Agent x Agent] Average 00001 00023 00036 00043 0005 0006 0007 00077 0007 00069 0008 00084 0008 00082 00053 0003 00008
Simmelian Ties[Agent x Agent Agent x Agent] Average 00002 00008 00021 00029 0004 0005 00059 00068 00061 00058 00065 00076 00076 0008 00045 00029 00004
Overall Complexity 00003 00015 0003 00043 00055 00067 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Expertise Correlation[Agent x Agent Agent x Agent] Average 00003 00015 0003 00042 00053 00065 00079 00088 00081 00079 00091 00097 00097 00101 0006 00036 00007
Density[Agent x Agent Agent x Agent] 00003 00015 0003 00043 00055 00067 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Centrality-Inverse Closeness[Agent x Agent Agent x Agent] Average 00003 00057 00191 00306 00387 0053 00733 00943 0084 00877 01111 01117 01125 01206 00587 00257 00018
Efficiency-Global[Agent x Agent Agent x Agent] 00003 00087 00223 00338 00421 00591 00815 0101 00904 00972 01245 01244 01203 01286 00632 00277 00026
Connectedness[Agent x Agent Agent x Agent] 00003 0021 00589 00932 0119 01953 02921 03446 0301 03223 04222 04187 04049 04402 02202 00899 00059
Distinctiveness Correlation[Agent x Agent Agent x Agent] Average 00006 00029 00059 00083 00106 00129 00156 00175 00162 00157 00181 00193 00193 00199 0012 00073 00014
Centrality-Closeness[Agent x Agent Agent x Agent] Average 00027 00028 00029 00031 00032 00035 00042 00048 00044 00044 00053 00052 00054 00058 00038 00031 00027
Boundary Spanner[Agent x Agent Agent x Agent] Average 00027 00191 00328 00328 00546 00574 00683 00601 00656 0071 00683 0071 00847 00656 00738 00546 00246
Network Centralization-Row Degree[Agent x Agent Agent x Agent] 00134 00314 006 00752 00794 0081 00851 00841 00848 00878 00893 00969 00996 00993 00842 00648 0024
Network Centralization-Column Degree[Agent x Agent Agent x Agent] 00134 00588 00819 00834 00849 00892 00878 00896 00876 00905 00948 00996 01024 00993 00842 0062 00322
Network Centralization-Out Degree[Agent x Agent Agent x Agent] 00135 00315 00601 00754 00797 00812 00853 00843 00851 0088 00895 00972 00999 00996 00845 0065 0024
Network Centralization-Total Degree[Agent x Agent Agent x Agent] 00135 00453 00672 00769 00813 00828 00869 0086 00867 00897 00925 00988 01016 00999 00847 00638 00282
Network Centralization-In Degree[Agent x Agent Agent x Agent] 00135 00589 00821 00836 00852 00895 00881 00898 00878 00908 0095 00999 01027 00996 00845 00622 00323
Clustering Coefficient-Watts-Strogatz[Agent x Agent Agent x Agent] Average 0023 00786 0126 01471 01728 01993 02334 02782 02382 02256 02604 02856 02775 02958 02157 01468 00355
Efficiency-Local[Agent x Agent Agent x Agent] 00238 01032 01587 01916 02253 02598 03034 0362 03167 03054 03644 03843 03678 03861 02663 01705 0041
Interlockers[Agent x Agent Agent x Agent] Average 00246 00546 00656 00628 00738 00902 00902 00902 00847 0071 00738 00874 00874 0082 00792 00683 00546
Breadth-Column[Agent x Agent Agent x Agent] 00273 01011 01803 02596 03224 03716 04399 05082 04672 04672 05355 05328 05519 05738 03825 02404 00601
Breadth-Row[Agent x Agent Agent x Agent] 00273 01339 02158 02923 03388 03934 04754 05301 04863 04973 05792 05765 05765 05984 04153 02514 00601
Clique Count[Agent x Agent Agent x Agent] Average 00301 02896 06421 10929 14699 17213 19781 22486 20601 20492 24071 2571 24918 24727 13689 06913 01148
Constraint-Burt[Agent x Agent Agent x Agent] Average 0038 01331 01404 01481 01869 02077 02643 02719 02506 02623 02827 0295 03034 02956 02548 01967 00948
Interdependence[Agent x Agent Agent x Agent] 00485 00073 00042 0003 00024 0002 00017 00015 00016 00016 00014 00013 00014 00013 00022 00037 00176
Effective Network Size-Burt[Agent x Agent Agent x Agent] Average 0053 03636 06977 10333 13118 15388 18575 21156 19688 19928 22983 2382 23991 2429 14392 08408 01957
Redundancy-Row[Agent x Agent Agent x Agent] 00601 03525 08443 12459 16093 19699 23716 26913 24754 23798 27541 29781 29617 30683 1765 10219 01311
Redundancy-Column[Agent x Agent Agent x Agent] 00601 03743 08634 12568 1623 2 23989 27077 24945 24016 27814 30055 29891 30902 17842 10383 01448
Node Levels[Agent x Agent Agent x Agent] Average 00601 09836 14344 23087 24126 36339 46913 42541 3918 43197 51913 44727 46421 48169 47568 23033 03169
Centrality-Bonacich Power[Agent x Agent Agent x Agent] Average 01038 05492 11066 15683 20027 2429 29372 32978 30383 29563 34071 36339 36284 37432 22541 13552 02514
Triad Count[Agent x Agent Agent x Agent] Average 01967 08279 27104 42432 5918 78251 96503 101475 94454 84508 94645 111421 110164 112732 67131 35574 01995
Efficiency[Agent x Agent Agent x Agent] 03333 09552 0964 09658 09658 09746 09798 09806 09795 09818 09835 09822 0982 09828 09809 09756 09607
Speed-Minimum[Agent x Agent Agent x Agent] 05 0125 0125 01 01111 00909 00909 01 01 01 00909 01111 01 01 00769 01111 01667
Edge Count-Skip[Agent x Agent Agent x Agent] 07895 07761 08988 09408 09304 0946 0933 09519 09487 09409 09455 09534 09465 09584 09188 0881 06196
Transitivity[Agent x Agent Agent x Agent] 08372 03475 03372 03238 03336 03626 0375 03495 03503 03253 0319 03266 03302 0336 03822 0405 0258
Edge Count-Pooled[Agent x Agent Agent x Agent] 08421 08657 09432 09669 09714 09764 09665 09751 09748 09704 09735 09737 09759 09788 09612 09435 08152
Speed-Average[Agent x Agent Agent x Agent] 08519 02994 03068 02802 02799 0233 02156 02387 0245 02484 02448 02516 02464 02429 02259 02373 04139
Edge Count-Lateral[Agent x Agent Agent x Agent] 09474 03731 03926 0392 04216 04061 03944 03828 03768 03651 03633 03872 03795 03985 03842 03891 05761
Resemblance Correlation[Agent x Agent Agent x Agent] Average 09994 09971 09941 09917 09894 09871 09844 09825 09838 09843 09819 09807 09807 09801 0988 09927 09986
Fragmentation[Agent x Agent Agent x Agent] 09997 0979 09411 09068 0881 08047 07079 06554 0699 06777 05778 05813 05951 05598 07798 09101 09941
Communicative Need[Agent x Agent Agent x Agent] 1 0608 07457 07662 0801 08069 08098 08233 0817 08124 0784 08169 08368 08365 08212 09004 08039
Upper Boundedness[Agent x Agent Agent x Agent] 1 09021 09909 0999 09987 0994 09966 09993 09989 09965 0995 09928 09987 09988 09986 09983 09073
Average Distance[Agent x Agent Agent x Agent] 11739 33399 326 35685 35731 42917 46372 41893 40814 40263 40845 39749 40577 41172 44266 42139 24163
Network Levels[Agent x Agent Agent x Agent] 2 8 8 10 9 11 11 10 10 10 11 9 10 10 13 9 6
Span Of Control[Agent x Agent Agent x Agent] 2375 27917 42188 48644 50903 52917 51932 54369 53981 5128 52176 55417 54426 55466 46089 40656 20909
Number of AGENT 32 94 112 137 162 183 221 228 213 219 254 250 256 260 190 141 62
Edge Count[Agent x Agent Agent x Agent] 38 201 405 574 733 889 1075 1207 1112 1082 1247 1330 1328 1370 825 496 92
Component Members-Weak[Agent x Agent Agent x Agent] Average 1728798 1248005 101776 862131 710437 559781 367623 299836 366667 345 232377 232213 225628 211885 50694 855628 1487596
Isolate Count[Agent x Agent Agent x Agent] 350 293 269 247 222 198 157 142 159 154 126 125 122 119 187 243 322
Component Count-Weak[Agent x Agent Agent x Agent] 355 301 272 250 226 201 162 146 162 157 128 128 126 122 191 249 329
Component Count-Strong[Agent x Agent Agent x Agent] 355 315 281 255 232 214 177 156 172 169 145 144 137 131 198 256 336
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Agents 221 228 213 219 254 250 256 260 190 141 62
Z -03746343246 00624390541 -08741467575 -04995124328 16858544608 14360982444 18107325691 20604887855 -23102450019 -53697586531 -103024439274
C+ 0 0 0 0 11858544608 21219527053 34326852743 49931740598 21829290579 0 0
C- 0 0 03741467575 03736591903 0 0 0 0 18102450019 6680003655 164824475824
Signal 0 0 0 0 0 0 0 1 0 1 1
Average 227
Std Dev 16015617378
k 05
Control Limit 4
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Component Members-Weak Average 367623 299836 366667 345 232377 232213 225628 211885 50694 855628 1487596
Z 09566024078 -01181147213 09414456731 05979300132 -11876316317 -1190231741 -12946324712 -15125184597 31653794044 86935922814 187130183455
C+ 04566024078 0 04414456731 05393756863 0 0 0 0 26653794044 108589716857 290719900313
C- 0 0 0 0 06876316317 13778633728 21724958439 31850143037 0 0 0
Signal 0 0 0 0 0 0 0 0 0 1 1
Average 307286
Std Dev 63074271515
k 05
Control Limit 4
1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Edge Count 38 201 405 574 733 889 1075 1207 1112 1082 1247 1330 1328 1370 825 496 92
Slope of Edge Count 163 204 169 159 156 186 132 -95 -30 165 83 -2 42 -545 -329 -404
Z -05039753822 15973457028 -01964649795 -07089823173 -08627375186 06748144948 -20927791293 -137269226972 -6393653789 59836399187 20500693512 -16784942813 -06150208054 -316223197419 -114547624997 -100197139539
C+ 0 10973457028 04008807233 0 0 01748144948 0 0 0 54836399187 70337092699 48552149886 37401941833 0 0 0
C- 00039753822 0 0 02089823173 05717198359 0 15927791293 148197018266 207133556155 142297156968 116796463456 128581406269 129731614323 440954811741 550502436738 645699576277
Signal 0 0 0 0 0 0 0 0 1 1 1 1 1 1 1 1 1
Average 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 9475 4825 43 3075 54 72 -1055 -2085 -309
Std Dev 195115350498 195115350498 195115350498 195115350498 195115350498 195115350498 195115350498 195115350498 1284143683549 1324547092406 1254033492376 115840047767 882307580533 710539700979 29504858357 278774102097 2505474007049
k 05
Control Limit 4
1988 1988 1988 1988 1988 1988
1989 1989 1989 1989 1989 1989
1990 1990 1990 1990 1990 1990
1991 1991 1991 1991 1991 1991
1992 1992 1992 1992 1992 1992
1993 1993 1993 1993 1993 1993
1994 1994 1994 1994 1994 1994
1995 1995 1995 1995 1995 1995
1996 1996 1996 1996 1996 1996
1997 1997 1997 1997 1997 1997
1998 1998 1998 1998 1998 1998
1999 1999 1999 1999 1999 1999
2000 2000 2000 2000 2000 2000
2001 2001 2001 2001 2001 2001
2002 2002 2002 2002 2002 2002
2003 2003 2003 2003 2003 2003
2004 2004 2004 2004 2004 2004
Page 30: Data Science Center Eindhoven - 4TU.Federation · 4TU AMI SRO Big Data Meeting Big Data: Mathematics in Action! November 24, 2017 Data Science Center Eindhoven The Mathematics Behind

raw data

Topic Precision farming

Topic Precision farming

Topic Compressed Sensing

Topic Network structure

bull dependencies between nodes in networks measured by degrees of direct neighbours

bull assortativity coefficient of Newman is nothing but Pearsonrsquos correlation statistic

bull inconsistent estimator when variances are infinitebull Spearman rank correlation behaves better but

calculation is computationally intensive bull requires heavy asymptotics

Van der Hofstad R and Litvak N (2014) Degree-Degree Dependencies in Random Graphs with Heavy-Tailed Degrees Internet Mathematics 10 (3-4) pp 287-334

Topic Topological Data Analysis

Common Big Data problem is to choose relevant ldquofeaturesrdquo from high-dimensional data

Combination of machine learning with topological tools (simplices cohomology) yields new algorithmsfor finding patterns and clustering

Presenter
Presentation Notes
httpsyoutubeXfWibrh6stw Gunnar Carsson

Mathematical contributions in general

bull Modellingbull Performance of algorithmsbull Statistical thinking

We need to work hard to makemathematical contributions more explicit

Conclusions

1 Mathematical methods are an important enabler in big data settings

2 Developments in big data settings not only require more computer speed and memory capacity but also new advanced mathematics

  • The Mathematics Behind Big Data
  • Outline
  • What is big data
  • Four Vrsquos of Big Data
  • High-dimensional data ldquo119951≪119953rdquo
  • Topic Google Search
  • Topic Google Search
  • Topic Google Search
  • Topic Google Search
  • Topic Google Search
  • Topic Google Search
  • Equivalent Random Surfer Model
  • Google Matrix
  • Choice of Matrix B
  • Practical Issues
  • Topic Streaming algorithms
  • Topic Uncertainty Quantification
  • Topic Network monitoring
  • Topic Precision farming
  • Topic Precision farming
  • Topic Compressed Sensing
  • Topic Network structure
  • Topic Topological Data Analysis
  • Mathematical contributions in general
  • Conclusions
1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Centrality-Betweenness[Agent x Agent Agent x Agent] Average 0 00001 00003 00006 00008 00016 00027 00029 00024 00025 00033 00031 00033 00036 0002 00008 0
Meta-Matrix Hamming Distance 0 00018 00036 0006 00065 00087 00124 0012 00146 00136 00135 00166 00164 00156 0015 0009 0004
Network Centralization-Betweenness[Agent x Agent Agent x Agent] 0 00091 00292 00492 00608 01105 01557 01586 01242 01187 01295 01338 01316 01414 0104 0042 00014
Edge Count-Sequential[Agent x Agent Agent x Agent] 0 001 00025 00017 0 0 00019 00008 00009 00009 00016 00015 0 00007 0 0002 0
Hierarchy[Agent x Agent Agent x Agent] 0 04827 01679 00696 00796 01496 01339 00727 00775 01273 01492 01711 0075 00717 00786 00882 05587
Exclusivity[Agent x Agent Agent x Agent] Average 00001 00003 00003 00003 00004 00004 00005 00005 00005 00006 00006 00006 00006 00006 00006 00004 00002
Network Centralization-Closeness[Agent x Agent Agent x Agent] 00001 00005 00012 00016 00018 00021 00027 0003 00028 00028 00031 0003 00032 00034 00024 00015 00004
Similarity Correlation[Agent x Agent Agent x Agent] Average 00001 00023 00036 00043 0005 0006 0007 00077 0007 00069 0008 00084 0008 00082 00053 0003 00008
Simmelian Ties[Agent x Agent Agent x Agent] Average 00002 00008 00021 00029 0004 0005 00059 00068 00061 00058 00065 00076 00076 0008 00045 00029 00004
Overall Complexity 00003 00015 0003 00043 00055 00067 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Expertise Correlation[Agent x Agent Agent x Agent] Average 00003 00015 0003 00042 00053 00065 00079 00088 00081 00079 00091 00097 00097 00101 0006 00036 00007
Density[Agent x Agent Agent x Agent] 00003 00015 0003 00043 00055 00067 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Centrality-Inverse Closeness[Agent x Agent Agent x Agent] Average 00003 00057 00191 00306 00387 0053 00733 00943 0084 00877 01111 01117 01125 01206 00587 00257 00018
Efficiency-Global[Agent x Agent Agent x Agent] 00003 00087 00223 00338 00421 00591 00815 0101 00904 00972 01245 01244 01203 01286 00632 00277 00026
Connectedness[Agent x Agent Agent x Agent] 00003 0021 00589 00932 0119 01953 02921 03446 0301 03223 04222 04187 04049 04402 02202 00899 00059
Distinctiveness Correlation[Agent x Agent Agent x Agent] Average 00006 00029 00059 00083 00106 00129 00156 00175 00162 00157 00181 00193 00193 00199 0012 00073 00014
Centrality-Closeness[Agent x Agent Agent x Agent] Average 00027 00028 00029 00031 00032 00035 00042 00048 00044 00044 00053 00052 00054 00058 00038 00031 00027
Boundary Spanner[Agent x Agent Agent x Agent] Average 00027 00191 00328 00328 00546 00574 00683 00601 00656 0071 00683 0071 00847 00656 00738 00546 00246
Network Centralization-Row Degree[Agent x Agent Agent x Agent] 00134 00314 006 00752 00794 0081 00851 00841 00848 00878 00893 00969 00996 00993 00842 00648 0024
Network Centralization-Column Degree[Agent x Agent Agent x Agent] 00134 00588 00819 00834 00849 00892 00878 00896 00876 00905 00948 00996 01024 00993 00842 0062 00322
Network Centralization-Out Degree[Agent x Agent Agent x Agent] 00135 00315 00601 00754 00797 00812 00853 00843 00851 0088 00895 00972 00999 00996 00845 0065 0024
Network Centralization-Total Degree[Agent x Agent Agent x Agent] 00135 00453 00672 00769 00813 00828 00869 0086 00867 00897 00925 00988 01016 00999 00847 00638 00282
Network Centralization-In Degree[Agent x Agent Agent x Agent] 00135 00589 00821 00836 00852 00895 00881 00898 00878 00908 0095 00999 01027 00996 00845 00622 00323
Clustering Coefficient-Watts-Strogatz[Agent x Agent Agent x Agent] Average 0023 00786 0126 01471 01728 01993 02334 02782 02382 02256 02604 02856 02775 02958 02157 01468 00355
Efficiency-Local[Agent x Agent Agent x Agent] 00238 01032 01587 01916 02253 02598 03034 0362 03167 03054 03644 03843 03678 03861 02663 01705 0041
Interlockers[Agent x Agent Agent x Agent] Average 00246 00546 00656 00628 00738 00902 00902 00902 00847 0071 00738 00874 00874 0082 00792 00683 00546
Breadth-Column[Agent x Agent Agent x Agent] 00273 01011 01803 02596 03224 03716 04399 05082 04672 04672 05355 05328 05519 05738 03825 02404 00601
Breadth-Row[Agent x Agent Agent x Agent] 00273 01339 02158 02923 03388 03934 04754 05301 04863 04973 05792 05765 05765 05984 04153 02514 00601
Clique Count[Agent x Agent Agent x Agent] Average 00301 02896 06421 10929 14699 17213 19781 22486 20601 20492 24071 2571 24918 24727 13689 06913 01148
Constraint-Burt[Agent x Agent Agent x Agent] Average 0038 01331 01404 01481 01869 02077 02643 02719 02506 02623 02827 0295 03034 02956 02548 01967 00948
Interdependence[Agent x Agent Agent x Agent] 00485 00073 00042 0003 00024 0002 00017 00015 00016 00016 00014 00013 00014 00013 00022 00037 00176
Effective Network Size-Burt[Agent x Agent Agent x Agent] Average 0053 03636 06977 10333 13118 15388 18575 21156 19688 19928 22983 2382 23991 2429 14392 08408 01957
Redundancy-Row[Agent x Agent Agent x Agent] 00601 03525 08443 12459 16093 19699 23716 26913 24754 23798 27541 29781 29617 30683 1765 10219 01311
Redundancy-Column[Agent x Agent Agent x Agent] 00601 03743 08634 12568 1623 2 23989 27077 24945 24016 27814 30055 29891 30902 17842 10383 01448
Node Levels[Agent x Agent Agent x Agent] Average 00601 09836 14344 23087 24126 36339 46913 42541 3918 43197 51913 44727 46421 48169 47568 23033 03169
Centrality-Bonacich Power[Agent x Agent Agent x Agent] Average 01038 05492 11066 15683 20027 2429 29372 32978 30383 29563 34071 36339 36284 37432 22541 13552 02514
Triad Count[Agent x Agent Agent x Agent] Average 01967 08279 27104 42432 5918 78251 96503 101475 94454 84508 94645 111421 110164 112732 67131 35574 01995
Efficiency[Agent x Agent Agent x Agent] 03333 09552 0964 09658 09658 09746 09798 09806 09795 09818 09835 09822 0982 09828 09809 09756 09607
Speed-Minimum[Agent x Agent Agent x Agent] 05 0125 0125 01 01111 00909 00909 01 01 01 00909 01111 01 01 00769 01111 01667
Edge Count-Skip[Agent x Agent Agent x Agent] 07895 07761 08988 09408 09304 0946 0933 09519 09487 09409 09455 09534 09465 09584 09188 0881 06196
Transitivity[Agent x Agent Agent x Agent] 08372 03475 03372 03238 03336 03626 0375 03495 03503 03253 0319 03266 03302 0336 03822 0405 0258
Edge Count-Pooled[Agent x Agent Agent x Agent] 08421 08657 09432 09669 09714 09764 09665 09751 09748 09704 09735 09737 09759 09788 09612 09435 08152
Speed-Average[Agent x Agent Agent x Agent] 08519 02994 03068 02802 02799 0233 02156 02387 0245 02484 02448 02516 02464 02429 02259 02373 04139
Edge Count-Lateral[Agent x Agent Agent x Agent] 09474 03731 03926 0392 04216 04061 03944 03828 03768 03651 03633 03872 03795 03985 03842 03891 05761
Resemblance Correlation[Agent x Agent Agent x Agent] Average 09994 09971 09941 09917 09894 09871 09844 09825 09838 09843 09819 09807 09807 09801 0988 09927 09986
Fragmentation[Agent x Agent Agent x Agent] 09997 0979 09411 09068 0881 08047 07079 06554 0699 06777 05778 05813 05951 05598 07798 09101 09941
Communicative Need[Agent x Agent Agent x Agent] 1 0608 07457 07662 0801 08069 08098 08233 0817 08124 0784 08169 08368 08365 08212 09004 08039
Upper Boundedness[Agent x Agent Agent x Agent] 1 09021 09909 0999 09987 0994 09966 09993 09989 09965 0995 09928 09987 09988 09986 09983 09073
Average Distance[Agent x Agent Agent x Agent] 11739 33399 326 35685 35731 42917 46372 41893 40814 40263 40845 39749 40577 41172 44266 42139 24163
Network Levels[Agent x Agent Agent x Agent] 2 8 8 10 9 11 11 10 10 10 11 9 10 10 13 9 6
Span Of Control[Agent x Agent Agent x Agent] 2375 27917 42188 48644 50903 52917 51932 54369 53981 5128 52176 55417 54426 55466 46089 40656 20909
Number of AGENT 32 94 112 137 162 183 221 228 213 219 254 250 256 260 190 141 62
Edge Count[Agent x Agent Agent x Agent] 38 201 405 574 733 889 1075 1207 1112 1082 1247 1330 1328 1370 825 496 92
Component Members-Weak[Agent x Agent Agent x Agent] Average 1728798 1248005 101776 862131 710437 559781 367623 299836 366667 345 232377 232213 225628 211885 50694 855628 1487596
Isolate Count[Agent x Agent Agent x Agent] 350 293 269 247 222 198 157 142 159 154 126 125 122 119 187 243 322
Component Count-Weak[Agent x Agent Agent x Agent] 355 301 272 250 226 201 162 146 162 157 128 128 126 122 191 249 329
Component Count-Strong[Agent x Agent Agent x Agent] 355 315 281 255 232 214 177 156 172 169 145 144 137 131 198 256 336
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Agents 221 228 213 219 254 250 256 260 190 141 62
Z -03746343246 00624390541 -08741467575 -04995124328 16858544608 14360982444 18107325691 20604887855 -23102450019 -53697586531 -103024439274
C+ 0 0 0 0 11858544608 21219527053 34326852743 49931740598 21829290579 0 0
C- 0 0 03741467575 03736591903 0 0 0 0 18102450019 6680003655 164824475824
Signal 0 0 0 0 0 0 0 1 0 1 1
Average 227
Std Dev 16015617378
k 05
Control Limit 4
1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Component Members-Weak Average 367623 299836 366667 345 232377 232213 225628 211885 50694 855628 1487596
Z 09566024078 -01181147213 09414456731 05979300132 -11876316317 -1190231741 -12946324712 -15125184597 31653794044 86935922814 187130183455
C+ 04566024078 0 04414456731 05393756863 0 0 0 0 26653794044 108589716857 290719900313
C- 0 0 0 0 06876316317 13778633728 21724958439 31850143037 0 0 0
Signal 0 0 0 0 0 0 0 0 0 1 1
Average 307286
Std Dev 63074271515
k 05
Control Limit 4
1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Edge Count 38 201 405 574 733 889 1075 1207 1112 1082 1247 1330 1328 1370 825 496 92
Slope of Edge Count 163 204 169 159 156 186 132 -95 -30 165 83 -2 42 -545 -329 -404
Z -05039753822 15973457028 -01964649795 -07089823173 -08627375186 06748144948 -20927791293 -137269226972 -6393653789 59836399187 20500693512 -16784942813 -06150208054 -316223197419 -114547624997 -100197139539
C+ 0 10973457028 04008807233 0 0 01748144948 0 0 0 54836399187 70337092699 48552149886 37401941833 0 0 0
C- 00039753822 0 0 02089823173 05717198359 0 15927791293 148197018266 207133556155 142297156968 116796463456 128581406269 129731614323 440954811741 550502436738 645699576277
Signal 0 0 0 0 0 0 0 0 1 1 1 1 1 1 1 1 1
Average 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 1728333333333 9475 4825 43 3075 54 72 -1055 -2085 -309
Std Dev 195115350498 195115350498 195115350498 195115350498 195115350498 195115350498 195115350498 195115350498 1284143683549 1324547092406 1254033492376 115840047767 882307580533 710539700979 29504858357 278774102097 2505474007049
k 05
Control Limit 4
Page 31: Data Science Center Eindhoven - 4TU.Federation · 4TU AMI SRO Big Data Meeting Big Data: Mathematics in Action! November 24, 2017 Data Science Center Eindhoven The Mathematics Behind

Topic Precision farming

Topic Precision farming

Topic Compressed Sensing

Topic Network structure

bull dependencies between nodes in networks measured by degrees of direct neighbours

bull assortativity coefficient of Newman is nothing but Pearsonrsquos correlation statistic

bull inconsistent estimator when variances are infinitebull Spearman rank correlation behaves better but

calculation is computationally intensive bull requires heavy asymptotics

Van der Hofstad R and Litvak N (2014) Degree-Degree Dependencies in Random Graphs with Heavy-Tailed Degrees Internet Mathematics 10 (3-4) pp 287-334

Topic Topological Data Analysis

Common Big Data problem is to choose relevant ldquofeaturesrdquo from high-dimensional data

Combination of machine learning with topological tools (simplices cohomology) yields new algorithmsfor finding patterns and clustering

Presenter
Presentation Notes
httpsyoutubeXfWibrh6stw Gunnar Carsson

Mathematical contributions in general

bull Modellingbull Performance of algorithmsbull Statistical thinking

We need to work hard to makemathematical contributions more explicit

Conclusions

1 Mathematical methods are an important enabler in big data settings

2 Developments in big data settings not only require more computer speed and memory capacity but also new advanced mathematics

  • The Mathematics Behind Big Data
  • Outline
  • What is big data
  • Four Vrsquos of Big Data
  • High-dimensional data ldquo119951≪119953rdquo
  • Topic Google Search
  • Topic Google Search
  • Topic Google Search
  • Topic Google Search
  • Topic Google Search
  • Topic Google Search
  • Equivalent Random Surfer Model
  • Google Matrix
  • Choice of Matrix B
  • Practical Issues
  • Topic Streaming algorithms
  • Topic Uncertainty Quantification
  • Topic Network monitoring
  • Topic Precision farming
  • Topic Precision farming
  • Topic Compressed Sensing
  • Topic Network structure
  • Topic Topological Data Analysis
  • Mathematical contributions in general
  • Conclusions
1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004
Centrality-Betweenness[Agent x Agent Agent x Agent] Average 0 00001 00003 00006 00008 00016 00027 00029 00024 00025 00033 00031 00033 00036 0002 00008 0
Meta-Matrix Hamming Distance 0 00018 00036 0006 00065 00087 00124 0012 00146 00136 00135 00166 00164 00156 0015 0009 0004
Network Centralization-Betweenness[Agent x Agent Agent x Agent] 0 00091 00292 00492 00608 01105 01557 01586 01242 01187 01295 01338 01316 01414 0104 0042 00014
Edge Count-Sequential[Agent x Agent Agent x Agent] 0 001 00025 00017 0 0 00019 00008 00009 00009 00016 00015 0 00007 0 0002 0
Hierarchy[Agent x Agent Agent x Agent] 0 04827 01679 00696 00796 01496 01339 00727 00775 01273 01492 01711 0075 00717 00786 00882 05587
Exclusivity[Agent x Agent Agent x Agent] Average 00001 00003 00003 00003 00004 00004 00005 00005 00005 00006 00006 00006 00006 00006 00006 00004 00002
Network Centralization-Closeness[Agent x Agent Agent x Agent] 00001 00005 00012 00016 00018 00021 00027 0003 00028 00028 00031 0003 00032 00034 00024 00015 00004
Similarity Correlation[Agent x Agent Agent x Agent] Average 00001 00023 00036 00043 0005 0006 0007 00077 0007 00069 0008 00084 0008 00082 00053 0003 00008
Simmelian Ties[Agent x Agent Agent x Agent] Average 00002 00008 00021 00029 0004 0005 00059 00068 00061 00058 00065 00076 00076 0008 00045 00029 00004
Overall Complexity 00003 00015 0003 00043 00055 00067 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Expertise Correlation[Agent x Agent Agent x Agent] Average 00003 00015 0003 00042 00053 00065 00079 00088 00081 00079 00091 00097 00097 00101 0006 00036 00007
Density[Agent x Agent Agent x Agent] 00003 00015 0003 00043 00055 00067 0008 0009 00083 00081 00093 001 00099 00103 00062 00037 00007
Centrality-Inverse Closeness[Agent x Agent Agent x Agent] Average 00003 00057 00191 00306 00387 0053 00733 00943 0084 00877 01111 01117 01125 01206 00587 00257 00018
Efficiency-Global[Agent x Agent Agent x Agent] 00003 00087 00223 00338 00421 00591 00815 0101 00904 00972 01245 01244 01203 01286 00632 00277 00026
Connectedness[Agent x Agent Agent x Agent] 00003 0021 00589 00932 0119 01953 02921 03446 0301 03223 04222 04187 04049 04402 02202 00899 00059
Distinctiveness Correlation[Agent x Agent Agent x Agent] Average 00006 00029 00059 00083 00106 00129 00156 00175 00162 00157 00181 00193 00193 00199 0012 00073 00014
Centrality-Closeness[Agent x Agent Agent x Agent] Average 00027 00028 00029 00031 00032 00035 00042 00048 00044 00044 00053 00052 00054 00058 00038 00031 00027
Boundary Spanner[Agent x Agent Agent x Agent] Average 00027 00191 00328 00328 00546 00574 00683 00601 00656 0071 00683 0071 00847 00656 00738 00546 00246
Network Centralization-Row Degree[Agent x Agent Agent x Agent] 00134 00314 006 00752 00794 0081 00851 00841 00848 00878 00893 00969 00996 00993 00842 00648 0024
Network Centralization-Column Degree[Agent x Agent Agent x Agent] 00134 00588 00819 00834 00849 00892 00878 00896 00876 00905 00948 00996 01024 00993 00842 0062 00322
Network Centralization-Out Degree[Agent x Agent Agent x Agent] 00135 00315 00601 00754 00797 00812 00853 00843 00851 0088 00895 00972 00999 00996 00845 0065 0024
Network Centralization-Total Degree[Agent x Agent Agent x Agent] 00135 00453 00672 00769 00813 00828 00869 0086 00867 00897 00925 00988 01016 00999 00847 00638 00282
Network Centralization-In Degree[Agent x Agent Agent x Agent] 00135 00589 00821 00836 00852 00895 00881 00898 00878 00908 0095 00999 01027 00996 00845 00622 00323
Clustering Coefficient-Watts-Strogatz[Agent x Agent Agent x Agent] Average 0023 00786 0126 01471 01728 01993 02334 02782 02382 02256 02604 02856 02775 02958 02157 01468 00355
Efficiency-Local[Agent x Agent Agent x Agent] 00238 01032 01587 01916 02253 02598 03034 0362 03167 03054 03644 03843 03678 03861 02663 01705 0041
Interlockers[Agent x Agent Agent x Agent] Average 00246 00546 00656 00628 00738 00902 00902 00902 00847 0071 00738 00874 00874 0082 00792 00683 00546
Breadth-Column[Agent x Agent Agent x Agent] 00273 01011 01803 02596 03224 03716 04399 05082 04672 04672 05355 05328 05519 05738 03825 02404 00601
Breadth-Row[Agent x Agent Agent x Agent] 00273 01339 02158 02923 03388 03934 04754 05301 04863 04973 05792 05765 05765 05984 04153 02514 00601
Clique Count[Agent x Agent Agent x Agent] Average 00301 02896 06421 10929 14699 17213 19781 22486 20601 20492 24071 2571 24918 24727 13689 06913 01148
Constraint-Burt[Agent x Agent Agent x Agent] Average 0038 01331 01404 01481 01869 02077 02643 02719 02506 02623 02827 0295 03034 02956 02548 01967 00948
Interdependence[Agent x Agent Agent x Agent] 00485 00073 00042 0003 00024 0002 00017 00015 00016 00016 00014 00013 00014 00013 00022 00037 00176
Effective Network Size-Burt[Agent x Agent Agent x Agent] Average 0053 03636 06977 10333 13118 15388 18575 21156 19688 19928 22983 2382 23991 2429 14392 08408 01957
Redundancy-Row[Agent x Agent Agent x Agent] 00601 03525 08443 12459 16093 19699 23716 26913 24754 23798 27541 29781 29617 30683 1765 10219 01311
Redundancy-Column[Agent x Agent Agent x Agent] 00601 03743 08634 12568 1623 2 23989 27077 24945 24016 27814 30055 29891 30902 17842 10383 01448
Node Levels[Agent x Agent Agent x Agent] Average 00601 09836 14344 23087 24126 36339 46913 42541 3918 43197 51913 44727 46421 48169 47568 23033 03169
Centrality-Bonacich Power[Agent x Agent Agent x Agent] Average 01038 05492 11066 15683 20027 2429 29372 32978 30383 29563 34071 36339 36284 37432 22541 13552 02514
Triad Count[Agent x Agent Agent x Agent] Average 01967 08279 27104 42432 5918 78251 96503 101475 94454 84508 94645 111421 110164 112732 67131 35574 01995
Efficiency[Agent x Agent Agent x Agent] 03333 09552 0964 09658 09658 09746 09798 09806 09795 09818 09835 09822 0982 09828 09809 09756 09607
Speed-Minimum[Agent x Agent Agent x Agent] 05 0125 0125 01 01111 00909 00909 01 01 01 00909 01111 01 01 00769 01111 01667
Edge Count-Skip[Agent x Agent Agent x Agent] 07895 07761 08988 09408 09304 0946 0933 09519 09487 09409 09455 09534 09465 09584 09188 0881 06196
Transitivity[Agent x Agent Agent x Agent] 08372 03475 03372 03238 03336 03626 0375 03495 03503 03253 0319 03266 03302 0336 03822 0405 0258
Edge Count-Pooled[Agent x Agent Agent x Agent] 08421 08657 09432 09669 09714 09764 09665 09751 09748 09704 09735 09737 09759 09788 09612 09435 08152
Speed-Average[Agent x Agent Agent x Agent] 08519 02994 03068 02802 02799 0233 02156 02387 0245 02484 02448 02516 02464 02429 02259 02373 04139
Edge Count-Lateral[Agent x Agent Agent x Agent] 09474 03731 03926 0392 04216 04061 03944 03828 03768 03651 03633 03872 03795 03985 03842 03891 05761
Resemblance Correlation[Agent x Agent Agent x Agent] Average 09994 09971 09941 09917 09894 09871 09844 09825 09838 09843 09819 09807 09807 09801 0988 09927 09986
Fragmentation[Agent x Agent Agent x Agent] 09997 0979 09411 09068 0881 08047 07079 06554 0699 06777 05778 05813 05951 05598 07798 09101 09941
Communicative Need[Agent x Agent Agent x Agent] 1 0608 07457 07662 0801 08069 08098 08233 0817 08124 0784 08169 08368 08365 08212 09004 08039
Upper Boundedness[Agent x Agent Agent x Agent] 1 09021 09909 0999 09987 0994 09966 09993 09989 09965 0995 09928 09987 09988 09986 09983 09073
Average Distance[Agent x Agent Agent x Agent] 11739 33399 326 35685 35731 42917 46372 41893 40814 40263 40845 39749 40577 41172 44266 42139 24163
Network Levels[Agent x Agent Agent x Agent] 2 8 8 10 9 11 11 10 10 10 11 9 10 10 13 9 6
Span Of Control[Agent x Agent Agent x Agent] 2375 27917 42188 48644 50903 52917 51932 54369 53981 5128 52176 55417 54426 55466 46089 40656 20909
Number of AGENT 32 94 112 137 162 183 221 228 213 219 254 250 256 260 190 141 62
Edge Count[Agent x Agent Agent x Agent] 38 201 405 574 733 889 1075 1207 1112 1082 1247 1330 1328 1370 825 496 92
Component Members-Weak[Agent x Agent Agent x Agent] Average 1728798 1248005 101776 862131 710437 559781 367623 299836 366667 345 232377 232213 225628 211885 50694 855628 1487596
Isolate Count[Agent x Agent Agent x Agent] 350 293 269 247 222 198 157 142 159 154 126 125 122 119 187 243 322
Component Count-Weak[Agent x Agent Agent x Agent] 355 301 272 250 226 201 162 146 162 157 128 128 126 122 191 249 329
Component Count-Strong[Agent x Agent Agent x Agent] 355 315 281 255 232 214 177 156 172 169 145 144 137 131 198 256 336
Page 32: Data Science Center Eindhoven - 4TU.Federation · 4TU AMI SRO Big Data Meeting Big Data: Mathematics in Action! November 24, 2017 Data Science Center Eindhoven The Mathematics Behind

Topic Precision farming

Topic Precision farming

Topic Compressed Sensing

Topic Network structure

bull dependencies between nodes in networks measured by degrees of direct neighbours

bull assortativity coefficient of Newman is nothing but Pearsonrsquos correlation statistic

bull inconsistent estimator when variances are infinitebull Spearman rank correlation behaves better but

calculation is computationally intensive bull requires heavy asymptotics

Van der Hofstad R and Litvak N (2014) Degree-Degree Dependencies in Random Graphs with Heavy-Tailed Degrees Internet Mathematics 10 (3-4) pp 287-334

Topic Topological Data Analysis

Common Big Data problem is to choose relevant ldquofeaturesrdquo from high-dimensional data

Combination of machine learning with topological tools (simplices cohomology) yields new algorithmsfor finding patterns and clustering

Presenter
Presentation Notes
httpsyoutubeXfWibrh6stw Gunnar Carsson

Mathematical contributions in general

bull Modellingbull Performance of algorithmsbull Statistical thinking

We need to work hard to makemathematical contributions more explicit

Conclusions

1 Mathematical methods are an important enabler in big data settings

2 Developments in big data settings not only require more computer speed and memory capacity but also new advanced mathematics

  • The Mathematics Behind Big Data
  • Outline
  • What is big data
  • Four Vrsquos of Big Data
  • High-dimensional data ldquo119951≪119953rdquo
  • Topic Google Search
  • Topic Google Search
  • Topic Google Search
  • Topic Google Search
  • Topic Google Search
  • Topic Google Search
  • Equivalent Random Surfer Model
  • Google Matrix
  • Choice of Matrix B
  • Practical Issues
  • Topic Streaming algorithms
  • Topic Uncertainty Quantification
  • Topic Network monitoring
  • Topic Precision farming
  • Topic Precision farming
  • Topic Compressed Sensing
  • Topic Network structure
  • Topic Topological Data Analysis
  • Mathematical contributions in general
  • Conclusions
Page 33: Data Science Center Eindhoven - 4TU.Federation · 4TU AMI SRO Big Data Meeting Big Data: Mathematics in Action! November 24, 2017 Data Science Center Eindhoven The Mathematics Behind

Topic Precision farming

Topic Compressed Sensing

Topic Network structure

bull dependencies between nodes in networks measured by degrees of direct neighbours

bull assortativity coefficient of Newman is nothing but Pearsonrsquos correlation statistic

bull inconsistent estimator when variances are infinitebull Spearman rank correlation behaves better but

calculation is computationally intensive bull requires heavy asymptotics

Van der Hofstad R and Litvak N (2014) Degree-Degree Dependencies in Random Graphs with Heavy-Tailed Degrees Internet Mathematics 10 (3-4) pp 287-334

Topic Topological Data Analysis

Common Big Data problem is to choose relevant ldquofeaturesrdquo from high-dimensional data

Combination of machine learning with topological tools (simplices cohomology) yields new algorithmsfor finding patterns and clustering

Presenter
Presentation Notes
httpsyoutubeXfWibrh6stw Gunnar Carsson

Mathematical contributions in general

bull Modellingbull Performance of algorithmsbull Statistical thinking

We need to work hard to makemathematical contributions more explicit

Conclusions

1 Mathematical methods are an important enabler in big data settings

2 Developments in big data settings not only require more computer speed and memory capacity but also new advanced mathematics

  • The Mathematics Behind Big Data
  • Outline
  • What is big data
  • Four Vrsquos of Big Data
  • High-dimensional data ldquo119951≪119953rdquo
  • Topic Google Search
  • Topic Google Search
  • Topic Google Search
  • Topic Google Search
  • Topic Google Search
  • Topic Google Search
  • Equivalent Random Surfer Model
  • Google Matrix
  • Choice of Matrix B
  • Practical Issues
  • Topic Streaming algorithms
  • Topic Uncertainty Quantification
  • Topic Network monitoring
  • Topic Precision farming
  • Topic Precision farming
  • Topic Compressed Sensing
  • Topic Network structure
  • Topic Topological Data Analysis
  • Mathematical contributions in general
  • Conclusions
Page 34: Data Science Center Eindhoven - 4TU.Federation · 4TU AMI SRO Big Data Meeting Big Data: Mathematics in Action! November 24, 2017 Data Science Center Eindhoven The Mathematics Behind

Topic Compressed Sensing

Topic Network structure

bull dependencies between nodes in networks measured by degrees of direct neighbours

bull assortativity coefficient of Newman is nothing but Pearsonrsquos correlation statistic

bull inconsistent estimator when variances are infinitebull Spearman rank correlation behaves better but

calculation is computationally intensive bull requires heavy asymptotics

Van der Hofstad R and Litvak N (2014) Degree-Degree Dependencies in Random Graphs with Heavy-Tailed Degrees Internet Mathematics 10 (3-4) pp 287-334

Topic Topological Data Analysis

Common Big Data problem is to choose relevant ldquofeaturesrdquo from high-dimensional data

Combination of machine learning with topological tools (simplices cohomology) yields new algorithmsfor finding patterns and clustering

Presenter
Presentation Notes
httpsyoutubeXfWibrh6stw Gunnar Carsson

Mathematical contributions in general

bull Modellingbull Performance of algorithmsbull Statistical thinking

We need to work hard to makemathematical contributions more explicit

Conclusions

1 Mathematical methods are an important enabler in big data settings

2 Developments in big data settings not only require more computer speed and memory capacity but also new advanced mathematics

  • The Mathematics Behind Big Data
  • Outline
  • What is big data
  • Four Vrsquos of Big Data
  • High-dimensional data ldquo119951≪119953rdquo
  • Topic Google Search
  • Topic Google Search
  • Topic Google Search
  • Topic Google Search
  • Topic Google Search
  • Topic Google Search
  • Equivalent Random Surfer Model
  • Google Matrix
  • Choice of Matrix B
  • Practical Issues
  • Topic Streaming algorithms
  • Topic Uncertainty Quantification
  • Topic Network monitoring
  • Topic Precision farming
  • Topic Precision farming
  • Topic Compressed Sensing
  • Topic Network structure
  • Topic Topological Data Analysis
  • Mathematical contributions in general
  • Conclusions
Page 35: Data Science Center Eindhoven - 4TU.Federation · 4TU AMI SRO Big Data Meeting Big Data: Mathematics in Action! November 24, 2017 Data Science Center Eindhoven The Mathematics Behind

Topic Network structure

bull dependencies between nodes in networks measured by degrees of direct neighbours

bull assortativity coefficient of Newman is nothing but Pearsonrsquos correlation statistic

bull inconsistent estimator when variances are infinitebull Spearman rank correlation behaves better but

calculation is computationally intensive bull requires heavy asymptotics

Van der Hofstad R and Litvak N (2014) Degree-Degree Dependencies in Random Graphs with Heavy-Tailed Degrees Internet Mathematics 10 (3-4) pp 287-334

Topic Topological Data Analysis

Common Big Data problem is to choose relevant ldquofeaturesrdquo from high-dimensional data

Combination of machine learning with topological tools (simplices cohomology) yields new algorithmsfor finding patterns and clustering

Presenter
Presentation Notes
httpsyoutubeXfWibrh6stw Gunnar Carsson

Mathematical contributions in general

bull Modellingbull Performance of algorithmsbull Statistical thinking

We need to work hard to makemathematical contributions more explicit

Conclusions

1 Mathematical methods are an important enabler in big data settings

2 Developments in big data settings not only require more computer speed and memory capacity but also new advanced mathematics

  • The Mathematics Behind Big Data
  • Outline
  • What is big data
  • Four Vrsquos of Big Data
  • High-dimensional data ldquo119951≪119953rdquo
  • Topic Google Search
  • Topic Google Search
  • Topic Google Search
  • Topic Google Search
  • Topic Google Search
  • Topic Google Search
  • Equivalent Random Surfer Model
  • Google Matrix
  • Choice of Matrix B
  • Practical Issues
  • Topic Streaming algorithms
  • Topic Uncertainty Quantification
  • Topic Network monitoring
  • Topic Precision farming
  • Topic Precision farming
  • Topic Compressed Sensing
  • Topic Network structure
  • Topic Topological Data Analysis
  • Mathematical contributions in general
  • Conclusions
Page 36: Data Science Center Eindhoven - 4TU.Federation · 4TU AMI SRO Big Data Meeting Big Data: Mathematics in Action! November 24, 2017 Data Science Center Eindhoven The Mathematics Behind

Topic Topological Data Analysis

Common Big Data problem is to choose relevant ldquofeaturesrdquo from high-dimensional data

Combination of machine learning with topological tools (simplices cohomology) yields new algorithmsfor finding patterns and clustering

Presenter
Presentation Notes
httpsyoutubeXfWibrh6stw Gunnar Carsson

Mathematical contributions in general

bull Modellingbull Performance of algorithmsbull Statistical thinking

We need to work hard to makemathematical contributions more explicit

Conclusions

1 Mathematical methods are an important enabler in big data settings

2 Developments in big data settings not only require more computer speed and memory capacity but also new advanced mathematics

  • The Mathematics Behind Big Data
  • Outline
  • What is big data
  • Four Vrsquos of Big Data
  • High-dimensional data ldquo119951≪119953rdquo
  • Topic Google Search
  • Topic Google Search
  • Topic Google Search
  • Topic Google Search
  • Topic Google Search
  • Topic Google Search
  • Equivalent Random Surfer Model
  • Google Matrix
  • Choice of Matrix B
  • Practical Issues
  • Topic Streaming algorithms
  • Topic Uncertainty Quantification
  • Topic Network monitoring
  • Topic Precision farming
  • Topic Precision farming
  • Topic Compressed Sensing
  • Topic Network structure
  • Topic Topological Data Analysis
  • Mathematical contributions in general
  • Conclusions
Page 37: Data Science Center Eindhoven - 4TU.Federation · 4TU AMI SRO Big Data Meeting Big Data: Mathematics in Action! November 24, 2017 Data Science Center Eindhoven The Mathematics Behind

Mathematical contributions in general

bull Modellingbull Performance of algorithmsbull Statistical thinking

We need to work hard to makemathematical contributions more explicit

Conclusions

1 Mathematical methods are an important enabler in big data settings

2 Developments in big data settings not only require more computer speed and memory capacity but also new advanced mathematics

  • The Mathematics Behind Big Data
  • Outline
  • What is big data
  • Four Vrsquos of Big Data
  • High-dimensional data ldquo119951≪119953rdquo
  • Topic Google Search
  • Topic Google Search
  • Topic Google Search
  • Topic Google Search
  • Topic Google Search
  • Topic Google Search
  • Equivalent Random Surfer Model
  • Google Matrix
  • Choice of Matrix B
  • Practical Issues
  • Topic Streaming algorithms
  • Topic Uncertainty Quantification
  • Topic Network monitoring
  • Topic Precision farming
  • Topic Precision farming
  • Topic Compressed Sensing
  • Topic Network structure
  • Topic Topological Data Analysis
  • Mathematical contributions in general
  • Conclusions
Page 38: Data Science Center Eindhoven - 4TU.Federation · 4TU AMI SRO Big Data Meeting Big Data: Mathematics in Action! November 24, 2017 Data Science Center Eindhoven The Mathematics Behind

Conclusions

1 Mathematical methods are an important enabler in big data settings

2 Developments in big data settings not only require more computer speed and memory capacity but also new advanced mathematics

  • The Mathematics Behind Big Data
  • Outline
  • What is big data
  • Four Vrsquos of Big Data
  • High-dimensional data ldquo119951≪119953rdquo
  • Topic Google Search
  • Topic Google Search
  • Topic Google Search
  • Topic Google Search
  • Topic Google Search
  • Topic Google Search
  • Equivalent Random Surfer Model
  • Google Matrix
  • Choice of Matrix B
  • Practical Issues
  • Topic Streaming algorithms
  • Topic Uncertainty Quantification
  • Topic Network monitoring
  • Topic Precision farming
  • Topic Precision farming
  • Topic Compressed Sensing
  • Topic Network structure
  • Topic Topological Data Analysis
  • Mathematical contributions in general
  • Conclusions