Debugging & Tuning in Spark

Shiao-An Yuan@sayuan

2016-08-11

Spark Overview

● Cluster Manager (aka Master)● Worker (aka Slave)

● Driver● Executor

http://spark.apache.org/docs/latest/cluster-overview.html

RDD (Resilient Distributed Dataset)

A fault-tolerant collection of elements that can be operated on in parallel

Word Count

val sc: SparkContext = ...

val result = sc.textFile(file) // RDD[String]

.flatMap(_.split(" ")) // RDD[String]

.map(_ -> 1) // RDD[(String, Int)]

.groupByKey() // RDD[(String, Iterable[Int])]

.map(x => (x._1, x._2.sum)) // RDD[(String, Int)]

.collect() // Array[(String, Int])

Lazy, Transformation, Action, Job

groupByKey mapmapflatMap collect

Partition, Shuffle

Stage, Task

DAG (Directed Acyclic Graph)

● RDD operations○ Transformation○ Action

● Lazy● Job● Shuffle● Stage● Partition● Task

Objective

1. A correct and parallelizable algorithm2. Parallelism3. Reduce the overhead from parallelization

Correctness and Parallelizable

● Use small input● Run locally

○ --master local○ --master local[4]○ --master local[*]

Non-RDD Operations

● Avoid long blocking on driver

Data Skew

● repartition() come to rescue?● Hotspots

○ Choose another partitioned key○ Filter unreasonable data

● Trace to it’s source

https://databricks.gitbooks.io/databricks-spark-knowledge-base/content/best_practices/prefer_reducebykey_over_groupbykey.html

Prefer reduceByKey() over groupByKey()

● reduceByKey() combines output before shuffling the data

● Also consider aggregateByKey()● Use groupByKey() if you really

know what you are doing

Shuffle Spill

● Increase partition count● spark.shuffle.spill=false (default since Spark 1.6)● spark.shuffle.memoryFraction● spark.executor.memory

http://www.slideshare.net/databricks/new-developments-in-spark

● partitionBy()● repartitionAndSortWithinPartitions()● spark.sql.autoBroadcastJoinThreshold (default 10 MB)● Join it manually by mapPartitions()

○ Broadcast small RDD■ http://stackoverflow.com/a/17690254/406803

○ Query data from database■ https://groups.google.com/a/lists.datastax.com/d/topic/spark-connector-user/63ILfPqPRYI/discussion

Broadcast Small RDD

val smallRdd = ...

val largeRdd = ...

val smallBroadcast = sc.broadcast(smallRdd.collectAsMap())

val joined = largeRdd.mapPartitions(iter => {

val m = smallBroadcast.value

(k, v) <- iter

if m.contains(k)

} yield (k, (v, m.get(k).get))

}, preservesPartitioning = true)

Query Data from Cassandra

val conf = new SparkConf()

.set("spark.cassandra.connection.host", "127.0.0.1")

val connector = CassandraConnector(conf)

val joined = rdd.mapPartitions(iter => {

connector.withSessionDo(session => {

val stmt = session.prepare("SELECT value FROM table WHERE key=?")

iter.map {

case (k, v) => (k, (v, session.execute(stmt.bind(k)).one()))

Persist

● Storage level○ MEMORY_ONLY○ MEMORY_AND_DISK○ MEMORY_ONLY_SER○ MEMORY_AND_DISK_SER○ DISK_ONLY○ …

● Kryo serialization○ Much faster○ Registration needed

http://spark.apache.org/docs/latest/programming-guide.html#which-storage-level-to-choose

Common Failures

● Large shuffle blocks○ java.lang.IllegalArgumentException: Size exceeds Integer.MAX_VALUE

■ Increase partition count○ MetadataFetchFailedException, FetchFailedException

■ Increase partition count■ Increase `spark.executor.memory`■ …

○ java.lang.OutOfMemoryError: GC overhead over limit exceeded■ May caused by shuffle spill

java.lang.OutOfMemoryError: Java heap space

● Driver○ Increase `spark.driver.memory`○ collect()

■ take()■ saveAsTextFile()

● Executor○ Increase `spark.executor.memory`○ More nodes

java.io.IOException: No space left on device

● SPARK_WORKER_DIR● SPARK_LOCAL_DIRS, spark.local.dir● Shuffle files

○ Only delete after the RDD object has been GC

Other Tips

● Event logs○ spark.eventLog.enabled=true○ ${SPARK_HOME}/sbin/start-history-server.sh

Partitions

● Rule of thumb: ~128 MB per partition● If #partitions <= 2000, but close, bump to just > 2000

● Increase #partitions by repartition()● Decrease #partitions by coalesce()● spark.sql.shuffle.partitions (default 200)

http://www.slideshare.net/cloudera/top-5-mistakes-to-avoid-when-writing-apache-spark-applications

Executors, Cores, Memory!?

● 32 nodes● 16 cores each● 64 GB of RAM each● If you have an application need 32 cores, what is the

correct setting?

http://www.slideshare.net/cloudera/top-5-mistakes-to-avoid-when-writing-apache-spark-applications

Why Spark Debugging / Tuning is Hard?

● Distributed● Lazy● Hard to do benchmark● Spark is sensitive

Conclusion

● When in doubt, repartition!● Avoid shuffle if you can● Choose a reasonable partition count● Premature optimization is the root of all evil -- Donald Knuth

Reference

● Tuning and Debugging in Apache Spark● Top 5 Mistakes to Avoid When Writing Apache Spark

Applications ● How-to: Tune Your Apache Spark Jobs (Part 1)● How-to: Tune Your Apache Spark Jobs (Part 2)

Debugging & Tuning in Spark

Data & Analytics

Transcript of Debugging & Tuning in Spark

Spark Tuning for Enterprise System Administrators

Tuning Spark Streaming for Throughput _ Virdata

JVM and OS Tuning for accelerating Spark application

1 Hadoop Data Analytics - Multisoft Virtual Academy€¦ · · 2017-09-01Data Sampling and Debugging . ... 2 Apache Spark 1 An Introduction to Spark ... Understand performance tuning

Spark Tuning Guide on 3rd Generation Intel® Xeon® Scalable ......Spark Tuning Guide for 3rd Generation Intel® Xeon® Scalable Processors Based Platforms Revision 1.0 Page 4 | Total

BigDebug:)Debugging)Primitives)for) Interactive)Big)Data ...web.cs.ucla.edu/~gulzar/assets/pdf/icse2016-bigdebug-slides.pdf · BigDebug:)Debugging)Primitives)for) Interactive)Big)Data)Processing)in)Spark

Debugging, benchmarking, tuning i.e. software development ...Debugging, benchmarking, tuning i.e. software development tools. Martin . Č. uma Center for High Performance Computing

Understanding Spark Tuning - O'Reilly

TUNING - custom-chrome-europe.com€¦ · 6 EASY TUNING Manual adjustment only DIAG4 TUNE – EASY MODE Available Functions • Fuel Trim Table Adjustment +/- 20% • Spark Advance

Hortonworks Data Platform - Apache Spark Component …€¦ · · 2018-04-15Hortonworks Data Platform: Apache Spark Component Guide ... Tuning Spark ... and debugging Spark shell

WCT1001A/WCT1003A V4.2 Run-Time Debugging · 2019. 9. 13. · Run-time tuning and debugging WCT1001A/WCT1003A V4.2 Run-Time Debugging, User's Guide, Rev. 1, 07/2019 2 NXP Semiconductors

Debugging and Tuning Mobile Web Sites with Modern Web Browsers

The Fifth Elephant 2016: Self-Serve Performance Tuning for Hadoop and Spark

A Year With Spark - Meetupfiles.meetup.com/13722842/Spark Meetup.pdfSpark at scale: Big Data Example Tuning and Performance Spark at Scale: Big Data Example Yes, we use Spark !! Not

Tuning tips for Apache Spark Jobs

Debugging/Tuning Queries via iSeries Navigator Tom McKinley Mac2@us.ibm.com.

Contents€¦ · SmartCarb Operation & Tuning Tuning Top End (continued) C. Examine your spark plug for lean or rich indicators. 1. Lean condition indicators on the spark plugs when

Spark Tuning for Enterprise System Administrators By Anya Bida

Debugging PySpark: Spark Summit East talk by Holden Karau

Towards(aBig(DataDebugger(in( Apache(Spark( - HPTS - … · · 2015-10-02Tuning(Spark(Applicaons ... (“debugging(toolkits”(on(Apache(Spark(where(features(operate(at scale(and(impose(minimal(overheads(on((normal)(program(execu>on(•