Fine grained monitoring

Fine-grained Monitoring at HP

Mark Seger

Hewlett Packard

Cloud Services

4/19/2013 1Fine-grained Monitoring

Agenda

• What is the problem we’re trying to solve

• Introduction to collectl

• Monitoring Swift & Glance

• Monitoring VMs


Conflicting Problem Statements

• (Ks of nodes) X (hundreds of metrics)– And you want to centrally monitor them how often?

– And store then in a database for future mining?

– Politely pause for laughter…

• Reality– Choose a frequency on the order of a minute or more

– Don’t collect all data at the same time

– Don’t collect everything

• BUT when problems arise– Granularity measured in minutes is too coarse

– If samples aren’t taken at the same time, how do you correlate?

– There’s almost never enough detailed data to provide answers


Isn’t the solution obvious?

• Use one central tool and another local tool

• But this has its own problems as well

– Will the data cross-correlate? Hopefully…

– What about customizations?

– Do you really want to collect the same data twice?

• At HP we’ve chosen a hybrid model

– Use a lightweight local data collection tool, some redundancy• Collect cpu, disk, net & mem every second; key processes every 5

• Extend it to add OpenStack monitoring capabilities

• Send subset/updates to centralized monitor every minute or more

– Central tool: collectd, Local tool: collectl no relation!


Introduction to collectl

• Open source tool, in use for many years

• Roots in HPC, so knows how to be efficient

– Think of it as SAR on steroids

– Can monitor at sub-second intervals when needed

– Synchronizes samples across cluster to usecs for correlation

– Can generate data in plottable formats

– Can record data locally and send over a socket• And do it at different frequencies!

• Can also write to a snapshot file which is what we’re doing

– Has an API for extensibility

– Several utilities for plotting and real-time cluster monitoring


Summary Mode Formats

#<-------CPU--------><-----------Disks-----------><-----------Network--------->

#cpu sys inter ctxsw KBRead Reads KBWrit Writes netKBi pkt-in netKBo pkt-out

10 9 206 94 0 0 0 0 0 1 0 0

26 26 183 80 0 0 1279 27 18 78 12 37

27 27 396 70 0 0 31597 275 0 6 0 5

9 9 341 71 0 0 32629 274 4 43 0 2

### RECORD 3 >>> cag-dl380-01 <<< (1176471932.010) (Fri Apr 13 09:45:32 2007) ###

# CPU SUMMARY (INTR, CTXSW & PROC /sec)

# User Nice Sys Wait IRQ Soft Steal Idle Intr Ctxsw Proc RunQ Run Avg1 Avg5 Avg15

0 0 0 0 0 0 0 99 1070 206 4 246 0 0.10 0.03 0.01

# DISK SUMMARY (/sec)

#KBRead RMerged Reads SizeKB KBWrite WMerged Writes SizeKB

0 0 0 0 0 0 0 0

# NETWORK SUMMARY (/sec)

# KBIn PktIn SizeIn MultI CmpI ErrIn KBOut PktOut SizeO CmpO ErrOut

4 32 133 0 0 0 28 31 926 0 0

Verbose: collectl --verbose

Brief: collectl

Detail Mode Format

# SINGLE CPU STATISTICS

# Cpu User Nice Sys Wait IRQ Soft Steal Idle

0 2 0 0 0 0 0 0 98

1 0 0 0 0 0 0 0 100

2 0 0 0 0 0 0 0 100

3 2 0 0 0 0 0 0 98

# DISK STATISTICS (/sec)

# <---------reads---------><---------writes---------><--------averages--------> Pct

#Name KBytes Merged IOs Size KBytes Merged IOs Size RWSize QLen Wait SvcTim Util

sda 0 0 0 0 0 0 0 0 0 0 0 0 0

sdb 0 0 0 0 0 0 0 0 0 0 0 0 0

sdc 0 0 0 0 0 0 0 0 0 0 0 0 0

sdd 0 0 0 0 0 0 0 0 0 0 0 0 0

hda 0 0 0 0 0 0 0 0 0 0 0 0 0

# NETWORK STATISTICS (/sec)

#Num Name KBIn PktIn SizeIn MultI CmpI ErrIn KBOut PktOut SizeO CmpO ErrOut

0 lo: 0 0 0 0 0 0 0 0 0 0 0

1 eth0: 1 11 144 0 0 0 1 7 257 0 0

2 eth1: 0 1 64 0 0 0 0 1 64 0 0

3 sit0: 0 0 0 0 0 0 0 0 0 0 0

collectl -sCDN

Let’s talk about swift/glance monitoring

• The real question is what metrics do they expose?– Track GETs, PUTs, etc

– Include object sizes and timings

– Also provides error codes/text

• Tail swift logs and write rolling counters every second– Operation types

– Object size and network bandwidth histograms, though b/w can be misleading

• Also generate hourly/daily summaries, retaining 1 week’s worth

• Same utility also knows how to parse glance logs– Separates tracking of metadata operations

cat /var/log/perf/ops/opscount.txt

get: 29452211 put: 84775192 del: 12433208 post: 65666 pat: 0 head: 28489510 e4xx: 174473 e500: 4774

get 25679667 547036 58147 28824 234 8839362141 23961566 717292 124028 95240 69479 35804 12355 5796 6814 8794

put 49048442 489078 123922 41085 715 12633635902 36213403 120714 10620 3815 4293 3666 6570 4030 188 0


Collectl plugins

• Use collectl’s import API– Read opscount file every monitoring interval logging to disk

– Can also be used interactively

– Most importantly, supported by collectl utilities

• Use collectl’s export API to write to local file every minute– Aligned to top-of-minute to avoid RRD messing with the data

• Use collectd’s putval capabilility to upload when file changes


Here’s what we can see locally

# <----CPU[HYPER]-----><----------Network----------><-------Ops--------

>

#Time cpu sys inter ctxsw KBIn PktIn KBOut PktOut GKBs PKBs Ops Errs

15:18:16 0 0 214 131 28 53 77 90 0 64 1 0

15:18:17 0 0 556 505 57 171 39 175 0 22 2 0

15:18:18 0 0 390 13776 143 223 614 522 0 0 0 0

15:18:19 0 0 292 12881 3 31 3 27 0 86 7 0

Note you can mix’n match with any standard collectl data in brief mode

# <--------------------operations-------------------->

#Time GetKB PutKB Gets Puts Dels Post Pats Head E4xx E500

15:20:36 0 0 1 2 1 0 0 0 0 0

15:20:37 0 64 1 3 1 0 0 0 0 0

15:20:38 0 0 0 0 0 0 0 0 0 0

15:20:39 0 0 2 0 1 0 0 0 0 0

Lots more detail in verbose mode

# <----------------- network gets up to-----------------><----------------- network puts up to---

# 0MB 10MB 20MB 30MB 40MB 50MB 60MB 70MB 80MB 90MB 100M 0MB 10MB 20MB 30MB 40MB 50MB 60MB 70MB

0 0 0 0 0 0 0 0 0 0 0 64 2 0 0 0 0 0 0

0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0

0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0

# OPS SUMMARY (/sec)

# <----------gets---------><----------puts--------->

# 0MB 1MB 10MB 100M 1GB 0MB 1MB 10MB 100M 1GB

0 0 0 0 0 1 0 0 0 0

0 0 0 0 0 2 0 0 0 0

0 0 0 0 0 1 0 0 0 0

0 0 0 0 0 0 0 0 0 0


What about KVM?

• Uses several collectl plugins– Collectl tells us the command line used to start each process

• Parses line for instance ID and mac address

• Another collectl plugin tell us mac -> vnet name mapping

– Collectl also tracks I/O for each processs

– Another to monitor our block storage service

– Can use nova manage to look up user info by instance

• This data also sent to collectd once/minute

segerm@nv-aw2az1-compute0004:~$ sudo collectl --import vnet:bockc --export kvmsum

# PROCESS SUMMARY (counters are /sec)

# PID THRD S SysT UsrT Pct N AccumTim BckI BckO DskI DskO NetI NetO Instance UserID BockServer(s)

13273 6 S 0.01 0.05 1 4 15:39:28 0 0 0 12 0 0 000d5cc3 31689020408812

18093 14 S 0.02 0.26 7 4 41:15:00 0 0 0 0 0 0 000e33d7 29387151913164

19517 1 S 0.00 0.01 1 1 01:39:54 0 0 0 0 0 0 000e23e1 84575604886783

30287 1 S 0.00 0.00 0 1 05:30:11 0 0 0 0 0 0 000bf753 22420103441357 10.8.14.129

30739 2 S 0.00 0.00 0 2 12:18:42 0 0 0 0 0 0 00061147 30248174159870


Using the cloud to monitor the cloud

• Each morning slightly after midnight

– Ask each node to generate a summary of yesterday

– Do parallel copy to pull back to central node

– Also do parallel copy of plottable data

– Generate a set of 24 hour plots in batch, slow but worth it• Investigating parallelizing some of this too


Currently, a very crude prototype

Daily numbers


Error Counts and Bandwidth Too


Hyperlinks to exact error text


Can even get operations by node by hour


colplot

• Web-based plotting tool

• If collectl can collect it, colplot can plot it


Links to plots…


Operations and PUT histograms

Note – the sample sizes are 1 second and plots only 10KB


Collectl Multiplexor: colmux• Think of collectl top-anything

– Including any plugins

– Runs collectl in real-time against set of nodes

– Sorts by any column and can dynamically change with arrow keys

– 2 different output formats

• Can also playback historical data for diagnostic analysis

# OPS SUMMARY (/sec) Fri Mar 22 15:45:05 2013 Connected: 14 of 14

# <--------------------operations-------------------->

#Host GetKB PutKB Gets Puts Dels Post Pats Head E4xx E500

sw-aw2az1-proxy014 0 224 0 4 0 0 0 0 0 0

sw-aw2az1-proxy010 0 166 0 3 0 0 0 0 0 0

sw-aw2az1-proxy009 0 100 0 2 0 0 0 0 0 0

sw-aw2az1-proxy005 0 77 0 2 0 0 0 0 0 0

Time 001 003 004 005 007 008 | 001 003 004 005 007 008 | Get Put

16:02:45 0 0 0 1 0 0 | 0 23 0 0 0 0 | 1 23

16:02:46 0 0 0 0 0 0 | 0 18 0 0 0 0 | 0 18

16:02:47 0 0 0 0 0 0 | 0 23 3 0 3 1 | 0 30

16:02:48 0 0 0 0 0 0 | 1 19 15 2 0 0 | 0 37

16:02:49 0 0 0 0 0 0 | 1 10 19 0 2 1 | 0 33


Monitoring 192 nodes

Idle Nodes CPU Burst Very Busy Erratic

You don’t even have to be able to read the output to see what’s happening


Questions?


Fine grained monitoring

Technology

Transcript of Fine grained monitoring