In-Memory Performance Durability of Diskapachecon.com/euroadshow18/ignite-spark-iot.pdf · Ignite...

15
© 2018 GridGain Systems, Inc. In-Memory Performance Durability of Disk

Transcript of In-Memory Performance Durability of Diskapachecon.com/euroadshow18/ignite-spark-iot.pdf · Ignite...

Page 1: In-Memory Performance Durability of Diskapachecon.com/euroadshow18/ignite-spark-iot.pdf · Ignite is a memory-centric store for Spark • No data movement from Ignite to Spark •

© 2018 GridGain Systems, Inc.

In-Memory Performance Durability of Disk

Page 2: In-Memory Performance Durability of Diskapachecon.com/euroadshow18/ignite-spark-iot.pdf · Ignite is a memory-centric store for Spark • No data movement from Ignite to Spark •

© 2018 GridGain Systems, Inc.

Apache Ignite and Apache Spark

Where Fast Data Meets the IoT

Akmal ChaudhriGridGain Systems

Page 3: In-Memory Performance Durability of Diskapachecon.com/euroadshow18/ignite-spark-iot.pdf · Ignite is a memory-centric store for Spark • No data movement from Ignite to Spark •

© 2018 GridGain Systems, Inc.

• IoT Demands to Software• IoT Software Stack• Device OS/RTOS• Data Collection and Enrichment• NewSQL Database• Application APIs

• Demo

Agenda

Page 4: In-Memory Performance Durability of Diskapachecon.com/euroadshow18/ignite-spark-iot.pdf · Ignite is a memory-centric store for Spark • No data movement from Ignite to Spark •

© 2018 GridGain Systems, Inc.

IoT Demands to Software

Real-time Processing

SQL, Geo-Spatial

Analytics (BI, ML)

High-Availability

Simple Scalability

Page 5: In-Memory Performance Durability of Diskapachecon.com/euroadshow18/ignite-spark-iot.pdf · Ignite is a memory-centric store for Spark • No data movement from Ignite to Spark •

© 2018 GridGain Systems, Inc.

IoT Software Stack

Device OS/Real-Time OS

Data Collection and Enrichment

NewSQL Database

Application APIs

Page 6: In-Memory Performance Durability of Diskapachecon.com/euroadshow18/ignite-spark-iot.pdf · Ignite is a memory-centric store for Spark • No data movement from Ignite to Spark •

© 2018 GridGain Systems, Inc.

Apache IoT Software Stack

Device OS/Real-Time OS

Data Collection and Enrichment

NewSQL Database

Application APIs

Page 7: In-Memory Performance Durability of Diskapachecon.com/euroadshow18/ignite-spark-iot.pdf · Ignite is a memory-centric store for Spark • No data movement from Ignite to Spark •

© 2018 GridGain Systems, Inc.

Apache MyNewt

Open Source RTOS Cortex M, MIPS Bluetooth, Wifi,

TCP/IP

Secured Bootloader

Remote Firmware Upgrade

Page 8: In-Memory Performance Durability of Diskapachecon.com/euroadshow18/ignite-spark-iot.pdf · Ignite is a memory-centric store for Spark • No data movement from Ignite to Spark •

© 2018 GridGain Systems, Inc.

Data Collection and Enrichment

DURABLE MEMORY

DURABLE MEMORY

Ignite Cluster

Page 9: In-Memory Performance Durability of Diskapachecon.com/euroadshow18/ignite-spark-iot.pdf · Ignite is a memory-centric store for Spark • No data movement from Ignite to Spark •

© 2018 GridGain Systems, Inc.

Apache Ignite Database, Caching and Processing Platform

Memory-Centric Storage

Ignite Native Persistence(Flash, SSD, Intel 3D XPoint)

Third-Party Persistence(RDBMS, HDFS, NoSQL)

SQL Transactions Compute Services MLStreamingKey/Value

IoTFinancialServices

Pharma &Healthcare

E-CommerceTravel & Logistics

Telco

Page 10: In-Memory Performance Durability of Diskapachecon.com/euroadshow18/ignite-spark-iot.pdf · Ignite is a memory-centric store for Spark • No data movement from Ignite to Spark •

© 2018 GridGain Systems, Inc.

Ignite and Spark Integration

Spark Application

Spark Worker

S park

Job

S park

Job

Yarn Mesos Docker HDFS

Spark Worker

S park Job

S park Job

Spark Worker

S park Job

S park Job

In-Memory Shared RDD or DataFrame

GridGain Node GridGain Node GridGain Node

Share state and data among

Spark jobs

No data movement

Boost DataFrame and SQL Performance

SQL on top of RDDs

In-place query execution

Page 11: In-Memory Performance Durability of Diskapachecon.com/euroadshow18/ignite-spark-iot.pdf · Ignite is a memory-centric store for Spark • No data movement from Ignite to Spark •

© 2018 GridGain Systems, Inc.

1. Initial Query

2. Query execution over local data3. Reduce multiple results in one

Ignite Node

CanadaToronto

OttawaMontreal

Calgary

Ignite Node

India Mumbai

New Delhi

1

2

23

SQL Queries Execution Flow

Page 12: In-Memory Performance Durability of Diskapachecon.com/euroadshow18/ignite-spark-iot.pdf · Ignite is a memory-centric store for Spark • No data movement from Ignite to Spark •

© 2018 GridGain Systems, Inc.

• Distributed memory-centric database • Ingests data from HDFS or another storage

• Fully fledged compute platform: SQL, transactions, key-value, collocated processing, ML/DL

• Streaming and compute engine

• OLAP and OLTP • Inclined towards OLAP and focused on MR payloads

Comparing Ignite and Spark

Page 13: In-Memory Performance Durability of Diskapachecon.com/euroadshow18/ignite-spark-iot.pdf · Ignite is a memory-centric store for Spark • No data movement from Ignite to Spark •

© 2018 GridGain Systems, Inc.

Ignite is a memory-centric store for Spark

• No data movement from Ignite to Spark

• In-place query execution

• Boost DataFrame and SQL performance

• Share state and data among Spark jobs

• Faster data and streaming analytics

Ignite and Spark Together

+

Page 14: In-Memory Performance Durability of Diskapachecon.com/euroadshow18/ignite-spark-iot.pdf · Ignite is a memory-centric store for Spark • No data movement from Ignite to Spark •

© 2018 GridGain Systems, Inc.

DEMO

Page 15: In-Memory Performance Durability of Diskapachecon.com/euroadshow18/ignite-spark-iot.pdf · Ignite is a memory-centric store for Spark • No data movement from Ignite to Spark •

© 2018 GridGain Systems, Inc.

Any Questions?

Thank you for joining us. Follow the conversation.http://ignite.apache.org

#apacheignite