ApacheCassandra Jonathan Ellis, CTO DataStax
Dec, 7 2011
Real NoSQL Applications in the Enterprise Today.
Today’s Database Challenge
Navigating the NoSQL waters
Distributed Horizontally scalable Eventually consistent Non-relational
Column store Document stores Key-value Graph … and more
Cassandra: the best for “big data”
Elegant architecture Operational flexibility Industry-leading performance
You should be using Cassandra for applications requiring high-performance, realtime queries scalability past one machine bulletproof reliability
Bigtable, 2006 Dynamo, 2007
OSS, 2008
Incubator, 2009 TLP, 2010 1.0, October 2011
Cassandra Highlights Multi-master, multi-DC Linearly scalable Larger-than-memory datasets High performance Full durability Integrated caching Tuneable consistency
A single four-core machine; one million inserts + one million updates
Performance
The Cassandra Difference
Scalable Performance
Oracle Exadata ✖ ✔ ✔
MySQL ✖ ✔ ✔
Sharding ✔ ✔ ✖
MongoDB ✔
Operational Ease
Cost Effective
Cassandra ✔ ✔ ✔
HBase ✔ ✖ ✔
And when it comes to Performance, we’re unmatched.
*
*
✖ ✔
“With Cassandra, we get better business agility, and we don’t have to plan capacity in advance, we don’t need to ask permission of other people to build things for us, and we don’t worry about running out of space or power.”
Adrian Cockcroft, Cloud Architect
Netflix’s problems Could not build datacenters fast enough Made decision to go to cloud (AWS) Cassandra on AWS is a key infrastructure
component of its globally distributed streaming product.
Applications include Netflix’s subscriber system, AB testing, and viewing history service (including positions at which members stopped watching a streaming program).
Netflix on Cassandra TODO Fast Cheap Scalable Flexible No SPOF
“Without Cassandra, our engineers would’ve had to create something that could scale to our needs, that would’ve prevented us from focusing on building product and solving problems for Backupify’s users, which are far more important tasks.”
Matt Conway, VP Engineering
Backupify’s problem Cloud-based utility that enables
businesses and consumers to backup, search and restore the content of popular online applications such as Google Apps, Gmail, Facebook, Twitter, and Blogger
Needs: Horizontal scaling Ability to handle high write loads Elasticity with no manual sharding
Backupify on Cassandra Ease of scale enabled engineers to focus
on building great applications DataStax OpsCenter made it easy to
monitor the health and perf of their cluster Reliable, redundant and scalable low-
balance data storage helped eliminate down-time
Ability to offer both backup and storage, but also analysis of data eventually
“You can seamlessly add new nodes and expand your total capacity without deteriorating the performance of the data store. Cassandra has allowed us to scale very effectively.”
Harry Robertson, Tech Lead
Ooyala’s problem Ooyala provides a suite of technologies
and services that support content owners in managing, analyzing and monetizing the digital video they publish online
Needs: Elasticity, to respond to spikes in data scale Ability to respond to increasingly
sophisticated analytic needs of customers
Ooyala on Cassandra Classic “Big Data” problem did not require
re-architecting Application agility was enabled –
developers spend time building cool apps, not figuring out how to scale
Enabled more powerful and granular analytics to their customers
“Cassandra has allowed us to build bigger features faster and more reliably, while using less money and without needing to expand our staff.”
Kyle Ambroff, Sr. Engineer
Formspring’ problem Users of Formspring engage with and learn
more about each other by asking and responding to questions. With close to 4B responses in the system and 30M unique users, they needed: To support explosive growth To seamlessly syndicate user content To avoid sharding Application flexiblity
Formspring on Cassandra No sharding needed – just add nodes to
scale Performance – the popular users with
many followers saw no speed reduction. No more memcached! Flexibility of a schema-optional
architecture is very developer friendly
Why DataStax? DataStax delivers database products and services based on Apache Cassandra from experts who are at the forefront of today's data revolution.
Database Software & Tools Support & Services
DataStax Enterprise DataStax Community DataStax OpsCenter Drivers & Connectors
Production Support Consultative Help Professional Training Online Documentation
DataStax Overview Founded in April 2010 Commercial leader in Apache Cassandra™, the popular
open-source “big data” database Headquartered in San Francisco Bay area 100+ customers 35+ employees (split between San Fran and Austin) Home to Apache Cassandra Chair & most committers Secured $11M in Series B funding in Sep 2011
DataStax Value The simplest way to get started with Apache
Cassandra: DataStax Community Edition A smart, integrated platform that provides
Analytics and Real-Time capabilities in the same database, without any resource contention: DataStax Enterprise
The backing of the Cassandra Experts
100+ customers
DataStax Enterprise 1. DataStax Enterprise
Database Server
2. OpsCenter Enterprise Management solution
3. Expert production support & consultative services
Enterprise Database Server
Leverages resources on-premise or in the cloud
Guarantees uptime with a master-less distributed architecture
Allows for fast application changes via flexible schemas
Handles structured, semi-structured, and unstructured data
Provides advanced security Eliminates the need for separate analytics
system
1
2 3
4
6 5
Real-Time
Analytics
Repl
icat
ion
Enterprise-class database built to handle today’s big-data needs in a cost-effective, easy, and reliable way.
OpsCenter Enterprise
Visual, browser-based user interface
Administration tasks carried out in point-and-click fashion
Allows for visual rebalance of data across a cluster when new nodes are added
Proactive alerts that warn of impending issues
Built-in external notification abilities
OpsCenter Enterprise supplies management, monitoring, and control over DataStax Enterprise
Expert Production Support DataStax Enterprise includes production support and consultative services from the Cassandra experts.
Support service level agreements that range from business hours to 24x7x365
Consultative support for assistance on architecture, design, and tuning
Certified quarterly service packs
Hot-fix support
DataStax Enterprise Compared
Scalable Performance
Oracle Exadata ✖ ✔ ✔
MySQL ✖ ✔ ✔
Sharding ✔ ✔ ✖
MongoDB ✔
Operational Ease
Cost Effective
DataStax Enterprise ✔ ✔ ✔
HBase ✔ ✖ ✔
✖ ✔
Real-Time + Analytics
✔
✖
✔
✖
✖
✖
Oracle NoSQL DB ✔ ✔ ? ✖
DataStax – Your One-Stop Shop DataStax Enterprise and Community Editions Professional Training, Expert Consulting Documentation and Dev Center
http://www.datastax.com/docs http://www.datastax.com/dev
Whitepapers, Case Studies, FAQ’s and more http://www.datastax.com/resources/whitepapers http://www.datastax.com/resources/casestudies
Thank you!
Top Related