Scaling spark

36
SCALING SPARK ON AWS THE JOURNEY

Transcript of Scaling spark

Page 1: Scaling spark

SCALING SPARK ON AWS

THE JOURNEY

Page 2: Scaling spark

ABOUT US

Page 3: Scaling spark

Alex Rovner, Director of Data EngineeringMedia Platform

Processing Terabytes Daily

Page 4: Scaling spark

PRIOR STATETWO CLUSTERS

CORE & ANALYTICSBOTH IN COLO

Page 5: Scaling spark

CHALLENGES

Page 6: Scaling spark

CHALLENGESSCALABILITYELASTICITY

AGILITY

Page 7: Scaling spark

SPARK

Page 8: Scaling spark

SPARKSCALABLE

FRIENDLY APIPYTHON SUPPORT

Page 9: Scaling spark

AWS

Page 10: Scaling spark

ON-DEMAND COMPUTEFLEXIBLE TERMS

AWS

Page 11: Scaling spark

INSTANCES

Page 12: Scaling spark

D2.8XLARGE48TB OF EPHEMERAL STORAGE

244 GB RAM38 V-CPU

INSTANCES

Page 13: Scaling spark

INSTANCES

WHY EPHEMERAL?

Page 14: Scaling spark

INSTANCES

RESERVED VS ON-DEMAND?

Page 15: Scaling spark

INSTANCES

SPOT?

Page 16: Scaling spark

SPOT

INSTANCESHDFS

D2

D2

D2

D2

Page 17: Scaling spark

INSTANCESWAIT, WHAT ABOUT DATA

LOCALITY?

Page 18: Scaling spark

HADOOP

Page 19: Scaling spark

WHAT ABOUT EMR?

HADOOP

Page 20: Scaling spark

HADOOPCDH 5.3

SPARK 1.2 ON YARN

Page 21: Scaling spark

HADOOPCDH 5.4

SPARK 1.3 ON YARN

Page 22: Scaling spark

HADOOPCDH 5.4

SPARK 1.5 ON YARN

Page 23: Scaling spark

HADOOPRUN THE LATEST VERSION!

TECH.MAGNETIC.COM

Page 24: Scaling spark

AUTO SCALE

Page 25: Scaling spark

CALCULATE CLUSTER UTILIZATION

QUERY CM APIV-CORES AVAILABLE, USED &

PENDING

AUTO SCALE

Page 26: Scaling spark

CALCULATE TARGET CAPACITYTARGET 80% UTILIZATION

LIMIT DOWNSIZING

AUTO SCALE

Page 27: Scaling spark

ADJUST CAPACITY

AUTO SCALE

Page 28: Scaling spark

SPEED BUMPS

Page 29: Scaling spark

SPEED BUMPSAPPLICATION MASTER ON SPOT

YARN LABELS

Page 30: Scaling spark

SPEED BUMPSUSERS ARE IMPATIENTITS NEVER ENOUGH

Page 31: Scaling spark

SPEED BUMPSI AM LOST!YARN LOGS

SET YARN OVERHEADCHECK GC TIME

INCREASE EXECUTOR MEMORYTRY AGAIN

Page 32: Scaling spark

SPEED BUMPS

BROADCASTING IS EVIL

Page 33: Scaling spark

SPEED BUMPSBROADCASTING “LARGE”

DATASETS IS EVIL

Page 34: Scaling spark

CURRENT STATE

THREE CLUSTERSANALYTICS & STREAMING (AWS)CORE (COLO - MOVING SOON!)

Page 35: Scaling spark

BIG SUCCESS!

Page 36: Scaling spark

QUESTIONS?