Ceph Day Santa Clara: Ceph at DreamHost

31
Ceph at DreamHost A Storage Journey

description

Dallas Kashuba, Co-Founder of DreamHost walks through why they went with Ceph at the Santa Clara Ceph Day.

Transcript of Ceph Day Santa Clara: Ceph at DreamHost

Page 1: Ceph Day Santa Clara: Ceph at DreamHost

Ceph at DreamHost

A Storage Journey

Page 2: Ceph Day Santa Clara: Ceph at DreamHost

About Me• One of the original four of

DreamHost

• Still active daily at DreamHost

• Have spent a lot of time working on the Ops side.

Page 3: Ceph Day Santa Clara: Ceph at DreamHost

• Hosting company founded in 1997

• Sage’s other company

• shared hosting, virtual servers, dedicated servers, cloud storage, cloud computing

• 375k customers, 1.3MM websites

Page 4: Ceph Day Santa Clara: Ceph at DreamHost

Storage JourneyA long strange trip

Page 5: Ceph Day Santa Clara: Ceph at DreamHost

His name was Destro

Page 6: Ceph Day Santa Clara: Ceph at DreamHost

... and then there were more.

Page 7: Ceph Day Santa Clara: Ceph at DreamHost

The First NetApp

Page 8: Ceph Day Santa Clara: Ceph at DreamHost

Remote Failover

Page 9: Ceph Day Santa Clara: Ceph at DreamHost

Remote Failover

Page 10: Ceph Day Santa Clara: Ceph at DreamHost

Meanwhile...

Page 11: Ceph Day Santa Clara: Ceph at DreamHost

... and still more.

Page 12: Ceph Day Santa Clara: Ceph at DreamHost

Lots of NetApps• Peak of around 125 individual

NetApps

• Smallish capacity on each (8TB)

• Internal software continuously moving data between NetApps

• Lots of time spent managing nearly full filers

Page 13: Ceph Day Santa Clara: Ceph at DreamHost

Ideal

Page 14: Ceph Day Santa Clara: Ceph at DreamHost

Reality

Page 15: Ceph Day Santa Clara: Ceph at DreamHost

Hosting Landscape

• Included storage had grown from 50MB to gigabytes, then terabytes.

• Prices stayed the same.

• Eventually went to unlimited Storage

• Usage per customer skyrocketed.

Page 16: Ceph Day Santa Clara: Ceph at DreamHost

Failed Experiments

Page 17: Ceph Day Santa Clara: Ceph at DreamHost

Failed Experiments

• ATAoE and XFS-based systems

• Performance & Stability issues

• 2006 era gear

Page 18: Ceph Day Santa Clara: Ceph at DreamHost

Failed Experiments

• High capacity• Nice features• Expensive• 85% full and it

failed

Page 19: Ceph Day Santa Clara: Ceph at DreamHost

Some Success

• First on Sun hardware then Supermicro

• Great stability

• Not enough IO for front-line network storage

Page 20: Ceph Day Santa Clara: Ceph at DreamHost

Back to Basics

Page 21: Ceph Day Santa Clara: Ceph at DreamHost

Local RAID

• SATA drives had grown in capacity and were very cheap

• 4-6TB per hosting server

• Less dependence on congested network

• Smaller failure domains

The Good

Page 22: Ceph Day Santa Clara: Ceph at DreamHost

Local RAID

• No more quota, too slow to scan filesystem

• No more fast failovers

• Multiple hour filesystem check with ext3

• More failure domains

The Bad

Page 23: Ceph Day Santa Clara: Ceph at DreamHost

Local RAID

• Complete RAID loss more common than anticipated

• Multiple days to fully restore from backup

The Ugly

Page 24: Ceph Day Santa Clara: Ceph at DreamHost

Storage TodayLight at the end of the tunnel

Page 25: Ceph Day Santa Clara: Ceph at DreamHost

Hybrid Mix

• We learned something from every step of the way

• No one size fits all when it comes to storage

• Use whatever is best for the job

• Be ready to change

Best Tool For The Job

Page 26: Ceph Day Santa Clara: Ceph at DreamHost

A Bit of Everything• Clustered NetApps and NFS for

email

• Local RAID in hosting servers

• ZFS and OpenSolaris backup servers

• Ceph for DreamObjects and DreamCompute

Best Tool For The Job

Page 27: Ceph Day Santa Clara: Ceph at DreamHost

• Object Storage, S3/Swift compatible

• 2+ Petabytes raw storage

• 3x replication, 900+ OSDs

• RGW behind HAProxy

• Row, rack, node and disk fault tolerant

Page 28: Ceph Day Santa Clara: Ceph at DreamHost

• OpenStack-based Public Cloud

• 3+ Petabytes raw storage

• All storage is on Ceph RBD

• Boot and Attachable Volumes

• Nicira SDN + Ceph, Live Migration

Page 29: Ceph Day Santa Clara: Ceph at DreamHost
Page 30: Ceph Day Santa Clara: Ceph at DreamHost

CephFS & The Future

• The return of Failovers

• No more backup servers

• No more major disk-related outages

• Fault tolerant low cost hosting

Storage Panacea?

Page 31: Ceph Day Santa Clara: Ceph at DreamHost

Thanks!@dallas

[email protected]