Best Practices: Managing a Healthy Couchbase Server Deployment: Couchbase Connect 2015

26
MANAGING A HEALTHY COUCHBASE SERVER DEPLOYMENT Alex Ma | Principal Solutions Engineer | [email protected]

Transcript of Best Practices: Managing a Healthy Couchbase Server Deployment: Couchbase Connect 2015

MANAGING A HEALTHY COUCHBASE SERVER DEPLOYMENT

Alex Ma | Principal Solutions Engineer | [email protected]

©2015 Couchbase Inc. 2

Whats Available?

What does all of that mean??????

External Monitoring systems

Agenda

Whats Available?

©2015 Couchbase Inc. 4

High Level Command line statistics

©2015 Couchbase Inc. 5

High Level Command line statistics

CBStats command – ALL 281 Statistics available EP-engine specific metrics Key Value format Cumulative and Snapshot metrics Per bucket – node level or cluster wide. Integrates with traditional Monitoring systems

©2015 Couchbase Inc. 6

Lower Level Statistics

©2015 Couchbase Inc. 7

Lower Level Statistics

CBStats command - TIMINGS Histogram format Better for real time analysis

©2015 Couchbase Inc. 8

Stats through REST

©2015 Couchbase Inc. 9

Stats through REST

Metrics for all components available. JSON Format Last 60 Measurements returned by default Snapshot metrics

©2015 Couchbase Inc. 10

Web Console Stats

©2015 Couchbase Inc. 11

Web Console Stats

Best for real time troubleshooting Cluster or node specific statistics Summarized Logging Some alerting Details on Statistics

©2015 Couchbase Inc. 12

System Level Statistics

What does it all mean?

©2015 Couchbase Inc. 14

Application Performance

1 2 3

4 5

©2015 Couchbase Inc. 15

Application Performance

General workload Resident Percentage High and Low

watermarks Cache Miss Ratio Disk reads per second Temp OOM

©2015 Couchbase Inc. 16

DISK

Disk Space Compaction Rebalance

Disk IO Sustained vs Peak load Disk write queue and

Backoffs Drain vs Fill rates

©2015 Couchbase Inc. 17

DISK

Disk Space Compaction Rebalance

Disk IO Sustained vs Peak load Disk write queue and

Backoffs Drain vs Fill rates

©2015 Couchbase Inc. 18

DISK

Disk Space Compaction Rebalance

Disk IO Sustained vs Peak load Disk write queue and

Backoffs Drain vs Fill rates

©2015 Couchbase Inc. 19

DISK

Disk Space Compaction Rebalance

Disk IO Sustained vs Peak load Disk write queue and

Backoffs Drain vs Fill rates

Safety

‘Active vBuckets’ count across all the servers should be

equal to “1024”

‘Replica vBuckets’ count across all the servers should

be equal to “1024 * <num of replica’s configured>”

©2015 Couchbase Inc. 21

Network

Connections Current Usage/Limits Connections in

CLOSE_WAIT

Network utilization

©2015 Couchbase Inc. 22

Network

Connections Current Usage/Limits Connections in

CLOSE_WAIT

Network utilization

External Monitoring Systems

©2015 Couchbase Inc. 24

External Monitoring Systems

Questions?

Thank you.