Introduction to Apache Cassandra

An Introduction to Cassandra

Sydney Tech Day

instaclustr.com @Instaclustr

Who am I and what do I do?• Adam Zegelin

• Co-founder and Chief Architect of Instaclustr -> www.instaclustr.com

• Instaclustr provides Cassandra-as-a-Service in the cloud.

• Currently in AWS, Azure and Google Cloud in private beta with more to come.

• We currently manage 50+ nodes for various customers, who do various things with it.

Objectives

• A quick history of Databases

• Introducing Cassandra

1980 - Stand Alone and Mainframes

1990 - 2005 Networked Computing

2005+ Real Time Web and Big Data

1980 - Stand Alone and Mainframes

1990 - 2005 Networked Computing

Early 2000’s

• What happens when you have more data than could fit on a single server?

Throw money away at the problem

Lets try a little computer science instead

• BigTable (2006) - 1 Key: Lots of values, Fast sequential access

• Dynamo (2007) - Reliable, Performant, Always On,

• Cassandra (2008) - Dynamo Architecture, BigTable data model and storage

One database, many servers• All servers (nodes) participate

in the cluster

• Shared nothing

• Need more capacity add more servers

• Multiple servers == built in redundancy

What are the benefits to this approach• Linear scalability

Linear scalability

• High Availability (No single point of failure)

What are the benefits to this approach

“During Hurricane Sandy, we lost an entire data center. Completely. Lost. It. Our

application fail-over resulted in us losing just a few moments of serving requests for a

particular region of the country, but our data in Cassandra never went offline.”

Nathan Milford, Outbrain’s head of U.S. IT operations management

• High Availability

• Use commodity hardware

How does it work ?0

PartitioningName Age Postcode Gender

Alice 34 2000 F

Bob 26 2000 M

Eve 25 2004 F

Frank 41 2902 M

How does it work ?

client

consistentHash(“Alice”)

Replication Factor = 3

How do we keep data consistent ?

client

CL.ONE

client

CL.ALL

AckAck

client

CL.QUORUM

Also supports multi-dc replication

client

Add capacity1

client

Thank you!

Introduction to Apache Cassandra

Technology

Transcript of Introduction to Apache Cassandra

Cassandra Day Denver 2014: Introduction to Apache Cassandra

Apache Cassandra on AWS · PDF fileAmazon Web Services – Apache Cassandra on AWS January 2016 Page 3 of 52 Notices 2 Abstract 4 Introduction 4 NoSQL on AWS 5 Cassandra: A Brief Introduction

Apache Cassandra - Einführung

Cassandra Community Webinar - Introduction To Apache Cassandra 1.2

Introduction to CQL and Data Modeling with Apache Cassandra

An Introduction to Apache Cassandra

Taller Apache Cassandra - eventos.citius.usc.eseventos.citius.usc.es/bigdata/workshops/Cassandra.pdf · Introducción Que es Apache Cassandra 3 Apache Cassandra es un motor de bases

Introduction to Apache Cassandra and support within WSO2 Platform

Introduction to data modeling with apache cassandra

Using Apache Spark, Apache Kafka and Apache Cassandra...USING APACHE SPARK, APACHE KAFKA AND APACHE CASSANDRA TO POWER INTELLIGENT APPLICATIONS | 02 Apache Cassandra is well known

About "Apache Cassandra"

Apache Cassandra

Introduction to Cassandra • Why Spark - Apache Cassandra | Apache Kafka | Apache Spark · 2017. 12. 20. · • Introduction to Cassandra • Why Spark + Cassandra • Problem background

Apache Cassandra From The Ground Up - Leanpubsamples.leanpub.com/apachecassandrafromthegroundup-sample.pdfAnIntroductionToNoSQL&Apache Cassandra WelcometoApacheCassandrafromTheGroupUp.Theprimarygoalofthisbooktohelpdevelopers

Cassandra Community Webinar: Apache Cassandra Internals

Diaposotivas apache-cassandra

Apache cassandra an introduction

Cassandra Day Atlanta 2015: Introduction to Apache Cassandra & DataStax Enterprise

Conhecendo Apache Cassandra Meetup - Conhecendo … · Eiti Kimura Coordenador de TI na Movile - Apache Cassandra MVP 2015 - Apache Cassandra MVP 2014 - Contribuidor Apache Cassandra

NoSQL Apache Cassandra