Comparing RIPE Atlas and CAIDA Ark Data using BigQuery · Stephen Strowes | MAT WG, RIPE 79 |...

19
Stephen Strowes <[email protected]> | MAT WG, RIPE 79 | 2019-10-17 Comparing RIPE Atlas and CAIDA Ark Data using BigQuery

Transcript of Comparing RIPE Atlas and CAIDA Ark Data using BigQuery · Stephen Strowes | MAT WG, RIPE 79 |...

Page 1: Comparing RIPE Atlas and CAIDA Ark Data using BigQuery · Stephen Strowes  | MAT WG, RIPE 79 | 2019-10-17 Comparing RIPE Atlas and CAIDA Ark Data using BigQuery

Stephen Strowes <[email protected]> | MAT WG, RIPE 79 | 2019-10-17

Comparing RIPE Atlasand CAIDA Ark Datausing BigQuery

Page 2: Comparing RIPE Atlas and CAIDA Ark Data using BigQuery · Stephen Strowes  | MAT WG, RIPE 79 | 2019-10-17 Comparing RIPE Atlas and CAIDA Ark Data using BigQuery

Stephen Strowes <[email protected]> | MAT WG, RIPE79 | 2019-10-17 2

RIPE Atlas• Collect a lot of measurement data

• We provide no general means to filter or analyse that data

Page 3: Comparing RIPE Atlas and CAIDA Ark Data using BigQuery · Stephen Strowes  | MAT WG, RIPE 79 | 2019-10-17 Comparing RIPE Atlas and CAIDA Ark Data using BigQuery

Stephen Strowes <[email protected]> | MAT WG, RIPE79 | 2019-10-17 3

CAIDA Ark• https://www.caida.org/projects/ark/

• Goal: topology discovery

Page 4: Comparing RIPE Atlas and CAIDA Ark Data using BigQuery · Stephen Strowes  | MAT WG, RIPE 79 | 2019-10-17 Comparing RIPE Atlas and CAIDA Ark Data using BigQuery

Stephen Strowes <[email protected]> | MAT WG, RIPE79 | 2019-10-17 4

Google BigQuery• General platform for data warehousing and analysis

• Feedback loop: “is this useful?”

Page 5: Comparing RIPE Atlas and CAIDA Ark Data using BigQuery · Stephen Strowes  | MAT WG, RIPE 79 | 2019-10-17 Comparing RIPE Atlas and CAIDA Ark Data using BigQuery

Stephen Strowes <[email protected]> | MAT WG, RIPE79 | 2019-10-17 5

Google BigQuery• Can manage this much data without flinching

• Few hardware limitations

• Queries are fast, but not real-time- think: O(seconds — minutes), not O(milliseconds)

- responsive enough to iterate during data exploration

• Allows exploration of the full dataset in new ways- ask for results related to ANY measurement

- and you can combine with any other dataset you have access to

Page 6: Comparing RIPE Atlas and CAIDA Ark Data using BigQuery · Stephen Strowes  | MAT WG, RIPE 79 | 2019-10-17 Comparing RIPE Atlas and CAIDA Ark Data using BigQuery

Stephen Strowes <[email protected]> | MAT WG, RIPE79 | 2019-10-17 6

Using BigQuery: Ark + Atlas• Two quick views of the data in this talk

- How long it takes each dataset to reveal responsive ASNs

- How long it takes each dataset to reveal AS adjacencies

Page 7: Comparing RIPE Atlas and CAIDA Ark Data using BigQuery · Stephen Strowes  | MAT WG, RIPE 79 | 2019-10-17 Comparing RIPE Atlas and CAIDA Ark Data using BigQuery

Stephen Strowes <[email protected]> | MAT WG, RIPE79 | 2019-10-17 7

Prefix Matching

Page 8: Comparing RIPE Atlas and CAIDA Ark Data using BigQuery · Stephen Strowes  | MAT WG, RIPE 79 | 2019-10-17 Comparing RIPE Atlas and CAIDA Ark Data using BigQuery

Stephen Strowes <[email protected]> | MAT WG, RIPE79 | 2019-10-17 8

Prefix Matching

Page 9: Comparing RIPE Atlas and CAIDA Ark Data using BigQuery · Stephen Strowes  | MAT WG, RIPE 79 | 2019-10-17 Comparing RIPE Atlas and CAIDA Ark Data using BigQuery

Stephen Strowes <[email protected]> | MAT WG, RIPE79 | 2019-10-17 9

Longest Prefix Matching

Page 10: Comparing RIPE Atlas and CAIDA Ark Data using BigQuery · Stephen Strowes  | MAT WG, RIPE 79 | 2019-10-17 Comparing RIPE Atlas and CAIDA Ark Data using BigQuery

Stephen Strowes <[email protected]> | MAT WG, RIPE79 | 2019-10-17 10

Responsive ASNs

Page 11: Comparing RIPE Atlas and CAIDA Ark Data using BigQuery · Stephen Strowes  | MAT WG, RIPE 79 | 2019-10-17 Comparing RIPE Atlas and CAIDA Ark Data using BigQuery

Stephen Strowes <[email protected]> | MAT WG, RIPE79 | 2019-10-17 11

Responsive ASNs, combined

Page 12: Comparing RIPE Atlas and CAIDA Ark Data using BigQuery · Stephen Strowes  | MAT WG, RIPE 79 | 2019-10-17 Comparing RIPE Atlas and CAIDA Ark Data using BigQuery

Stephen Strowes <[email protected]> | MAT WG, RIPE79 | 2019-10-17 12

Looking at Adjacencies• First: build tables of IPn — IP(n+1) adjacencies

• Then: build tables of ASn — AN(n+1) adjacencies

Page 13: Comparing RIPE Atlas and CAIDA Ark Data using BigQuery · Stephen Strowes  | MAT WG, RIPE 79 | 2019-10-17 Comparing RIPE Atlas and CAIDA Ark Data using BigQuery

Stephen Strowes <[email protected]> | MAT WG, RIPE79 | 2019-10-17 13

Looking at Adjacencies• This is one place where SQL doesn’t quite cut it

• But, BigQuery permits arbitrary code

Page 14: Comparing RIPE Atlas and CAIDA Ark Data using BigQuery · Stephen Strowes  | MAT WG, RIPE 79 | 2019-10-17 Comparing RIPE Atlas and CAIDA Ark Data using BigQuery

Stephen Strowes <[email protected]> | MAT WG, RIPE79 | 2019-10-17 14

Arbitrary Code

Page 15: Comparing RIPE Atlas and CAIDA Ark Data using BigQuery · Stephen Strowes  | MAT WG, RIPE 79 | 2019-10-17 Comparing RIPE Atlas and CAIDA Ark Data using BigQuery

Stephen Strowes <[email protected]> | MAT WG, RIPE79 | 2019-10-17 15

AS Adjacencies

Page 16: Comparing RIPE Atlas and CAIDA Ark Data using BigQuery · Stephen Strowes  | MAT WG, RIPE 79 | 2019-10-17 Comparing RIPE Atlas and CAIDA Ark Data using BigQuery

Stephen Strowes <[email protected]> | MAT WG, RIPE79 | 2019-10-17 16

AS Adjacencies, combined

Page 17: Comparing RIPE Atlas and CAIDA Ark Data using BigQuery · Stephen Strowes  | MAT WG, RIPE 79 | 2019-10-17 Comparing RIPE Atlas and CAIDA Ark Data using BigQuery

Stephen Strowes <[email protected]> | MAT WG, RIPE79 | 2019-10-17 17

Direction of Travel• BigQuery doesn’t solve all problems but it’s extremely powerful

• Ultimately: open up access to (some of?) our data via BigQuery

• We’d like something that looks like the following:- Storage cost, data availability: the NCC’s responsibility

- Computation cost and further storage: on anybody who wants to query the data

Page 18: Comparing RIPE Atlas and CAIDA Ark Data using BigQuery · Stephen Strowes  | MAT WG, RIPE 79 | 2019-10-17 Comparing RIPE Atlas and CAIDA Ark Data using BigQuery

Stephen Strowes <[email protected]> | MAT WG, RIPE79 | 2019-10-17 18

Upcoming• Articles on:

- the work required in the BigQuery platform to query this data

- the distinctions between the two datasets

Page 19: Comparing RIPE Atlas and CAIDA Ark Data using BigQuery · Stephen Strowes  | MAT WG, RIPE 79 | 2019-10-17 Comparing RIPE Atlas and CAIDA Ark Data using BigQuery

?QuestionsStephen Strowes <[email protected]>@sdstrowes