Salvatore Sanfilippo – How Redis Cluster works, and why - NoSQL matters Barcelona 2014

Redis Clusterdesign tradeoffs @antirez - Pivotal

What is performance?

• Low latency.

• IOPS.

• Operations quality and data model.

Go Cluster

• Redis Cluster must have same Redis use case.

• Tradeoffs are inherently needed in DS.

• CAP? Merge values? Strong consistency and consensus? How to replicate values?

CP systems

Client S1

CAP: consistency price is added latency

CP systems

Client S1

Reply to client after majority ACKs

And… there is the diskS1 S2 S3

Disk Disk Disk

CP algorithms may require fsync-befor-ack. Durability / Consistency not always orthogonal.

AP systems

Client

Eventual consistency with merges? (note: merge is not strictly part of EC)

Client

A = {1,2,3,8,12,13,14}

A = {2,3,8,11,12,1}

Many kinds of consistencies• “C” of CAP is strong consistency.

• It is not the only available tradeoff of course.

• Consistency is the set of liveness and safety properties a given system provides.

• Eventual consistency: like to say nothing at all. What liveness/safety properties if not “C”?

Redis Cluster

Client

Sharding and replication (asynchronous).

Asynchronous replication

Client A,B,C

async ACK

Full Mesh

A,B,C A,B,C

D,E,F D,E,F

• Heartbeats.

• Nodes gossip.

• Failover auth.

• Config update.

No proxy, but redirections

A,B,C D,E,F G,H,I L,M,N O,P,Q R,S,T

Client Client

Failure detection

• Failure reports within window of time (via gossip).

• Trigger for actual failover.

• Two main states: PFAIL -> FAIL.

Failure detection

S1 is not responding?S1 = PFAIL

S1 = PFAIL

Failure detection

PFAIL state propagatesS1 = PFAIL

S1 = PFAIL Reported by:

S2, S4

S1 = PFAIL

Failure detection

PFAIL state propagatesS1 = PFAIL

S1 = FAIL

S1 = PFAIL

Failure detection

Force FAIL stateS1 = FAIL

S1 = FAIL

Global slots config

• A master FAIL state triggers a failover.

• Cluster needs a coherent view of configuration.

• Who is serving this slot currently?

• Slots config must eventually converge.

Raft and failover• Config propagation is solved using ideas from the

Raft algorithm (just a subset).

• Raft is a consensus algorithm built on top of different “layers”.

• Raft paper is already a classic (highly recommended).

• Full Raft not needed for Redis Cluster slots config.

Failover and config

FailedSlave

Master

Epoch = Epoch+1(logical clock)

Vote for me!

Too easy?

• Why we don’t need full Raft?

• Because our config is idempotent: when the partition heals we can trow away slots config for new versions.

• Same algorithm is used in Sentinel v2 and works well.

Config propagation

• After a successful failover, new slot config is broadcasted.

• If there are partitions, when they heal, config will get updated (broadcasted from time to time, plus stale config detection and UPADTE messages).

• Config with greater Epoch always wins.

Redis Cluster consistency?

• Eventual consistent: last failover wins.

• In the “vanilla” losing writes is unbound.

• Mechanisms to avoid unbound data loss.

Failure mode… #1

Client A,B,C

Failed

lost write…

Failure mode #2Client

Minority side Majority side

Boud divergencesClient A,B,C

Minority side Majority sideAfter node-tim

More data safety?• OP logging until async ACK received.

• Re-played to master when node turns into slave.

• “Safe” connections, on demand.

• Example SADD (idempotent + commutative).

• SET-LWW foo bar <wall-clock>.

Multi key ops

• Hey hashtags!

• {user:1000}.following {user:1000}.followers.

• Unavailable for small windows, but no data exchange between nodes.

Multi key ops (availability)

• Single key ops: always available during resharding.

• Multi key ops, available if:

• No manual resharding of this hash slot in progress.

• Resharding in progress, but source or destination node have all keys.

• Otherwise we get a -TRYAGAIN error.

{User:1}.key_A {User:2}.Key_B

{User:1}.key_A {User:1}.Key_B

SUNION key_A key_B-TRYAGAIN

SUNION key_A key_B… output …

Redis Cluster ETA

• Release Candidate available.

• We’ll go stable in Q1 2015.

• Ask me anything.

Salvatore Sanfilippo – How Redis Cluster works, and why - NoSQL matters Barcelona 2014

Data & Analytics

Transcript of Salvatore Sanfilippo – How Redis Cluster works, and why - NoSQL matters Barcelona 2014

2012-10-12 - NoSQL in .NET - mit Redis und Mongodb

The WebOrion Software Solutions€¦ · MongoDB, BigTable, Redis, etc. are the example of NoSQL database. NoSQL Injection is security vulnerability that lets an attacker to inject

Mongo db &lamp;redis,nosql

API analytics with Redis and Google Bigquery. NoSQL matters edition

Redis Cluster by S. Sanfilippo

Bancos de dados NoSQL - Redis e MongoDB

1 số ứng dụng của Redis, NoSQL tại MXH Tamtay.vn

Otimizando sites com o nosql redis

SOPHIA: Online Reconﬁguration of Clustered NoSQL Databases ... · for two popular NoSQL databases, Cassandra and Redis. 1Introduction Automatically tuning database management systems

Object oriented Development of Distributed Applications ...next-scripting.org/xowiki/download/file/docs/nx/... · § NoSQL Database Concepts § Redis, Cassandra, MongoDB § Consistency

COSC 6339 Big Data Analytics NoSQL (II) Redis and Memcachedgabriel/courses/cosc6339_s17/BDA_21_NoSQL_4.pdf · 1 COSC 6339 Big Data Analytics NoSQL (II) – Redis and Memcached Edgar

Redis NoSQL (Key-value) Database

de cache a nosql. - qconsp.com · Um banco orientado a documentos: MongoDB, CouchDB Um cache: Redis é tão persistente quando PostGre-> Redis Persistence Demystified. A NOTÍCIA.

2012-05-14 NoSQL in .NET - mit Redis und MongoDB

Multiple NoSQL Use Cases with Redis Modules...2 About Redis Open source. The leading in-memory database platform, supporting any high performance OLTP or OLAP use case. • The open

Les bases de donne es NoSQL et le Big Datafc.isima.fr/~lacomme/NoSQL/chapitre_gratuit/chapitre3_apres_fusion.pdf · version "ligne de commande" de Redis. Le démarrage du client se

2 Sanfilippo Wetenschapsdag - stofwisselingsziekten · 2016. 12. 23. · Sanfilippo onderzoek wereldwijd 2e Sanfilippo Wetenschapsdag . De hersenen: moeilijk te bereiken voor geneesmiddelen.

NoSQL - sigs.de · NoSQL is specialization! ... +-kein tuning / config. Document Databases. any JS--ClientClient no Middleware! ... MongoDB, Redis, Membase, ...

Wprowadzenie do NoSql - zielona-gora-jug.github.io · wstęp do NoSql PostgreSql, Redis Hbase, MongoDb, Neo4j, Agenda Coherence, Rozwiązania hybrydowe, Na co warto zwrócić uwagę,

Redis - From LAMP to NoSQL (CloudTW meetup-14)