ch 10 Monitoring NCNU CSIE 97321012 林似真 Stella
description
Transcript of ch 10 Monitoring NCNU CSIE 97321012 林似真 Stella
![Page 1: ch 10 Monitoring NCNU CSIE 97321012 林似真 Stella](https://reader033.fdocuments.net/reader033/viewer/2022061614/568148c2550346895db5e083/html5/thumbnails/1.jpg)
ch 10 Monitoring
NCNU CSIE 97321012 林似真 Stella
![Page 2: ch 10 Monitoring NCNU CSIE 97321012 林似真 Stella](https://reader033.fdocuments.net/reader033/viewer/2022061614/568148c2550346895db5e083/html5/thumbnails/2.jpg)
NCNU CSIE Stella2010/6/8 2
ganglia
![Page 3: ch 10 Monitoring NCNU CSIE 97321012 林似真 Stella](https://reader033.fdocuments.net/reader033/viewer/2022061614/568148c2550346895db5e083/html5/thumbnails/3.jpg)
NCNU CSIE Stella2010/6/8 3
[ gæ gl ] Kˈ ŋ ɪəK
![Page 4: ch 10 Monitoring NCNU CSIE 97321012 林似真 Stella](https://reader033.fdocuments.net/reader033/viewer/2022061614/568148c2550346895db5e083/html5/thumbnails/4.jpg)
NCNU CSIE Stella2010/6/8 4
![Page 5: ch 10 Monitoring NCNU CSIE 97321012 林似真 Stella](https://reader033.fdocuments.net/reader033/viewer/2022061614/568148c2550346895db5e083/html5/thumbnails/5.jpg)
NCNU CSIE Stella2010/6/8 5
outline● WHY?● HOW?● WHAT?
● Ganglia● Distributed Monitorin
g System● Ganglia Architecture
– gmond– gmetad– web front-end
● Monitor Intersection● Ganglia Data Flow● Distribution and
Install● Example
● Reference● Q&A
![Page 6: ch 10 Monitoring NCNU CSIE 97321012 林似真 Stella](https://reader033.fdocuments.net/reader033/viewer/2022061614/568148c2550346895db5e083/html5/thumbnails/6.jpg)
NCNU CSIE Stella2010/6/8 6
WHY?
![Page 7: ch 10 Monitoring NCNU CSIE 97321012 林似真 Stella](https://reader033.fdocuments.net/reader033/viewer/2022061614/568148c2550346895db5e083/html5/thumbnails/7.jpg)
NCNU CSIE Stella2010/6/8 7
Why to Monitor?
● 系統狀態の監控● 解決系統の問題● 系統或應用程式の調校● 系統升級時の依據
![Page 8: ch 10 Monitoring NCNU CSIE 97321012 林似真 Stella](https://reader033.fdocuments.net/reader033/viewer/2022061614/568148c2550346895db5e083/html5/thumbnails/8.jpg)
NCNU CSIE Stella2010/6/8 8
HOW?
![Page 9: ch 10 Monitoring NCNU CSIE 97321012 林似真 Stella](https://reader033.fdocuments.net/reader033/viewer/2022061614/568148c2550346895db5e083/html5/thumbnails/9.jpg)
NCNU CSIE Stella2010/6/8 9
How to Monitor?
● 單一主機の監控● 分散式監控系統
![Page 10: ch 10 Monitoring NCNU CSIE 97321012 林似真 Stella](https://reader033.fdocuments.net/reader033/viewer/2022061614/568148c2550346895db5e083/html5/thumbnails/10.jpg)
NCNU CSIE Stella2010/6/8 10
How to Monitor?
● 單一主機の監控● commands : top, free, uptime, ps● Log file● Shell scripts● MRTG● ...etc. 分散式監控系統
![Page 11: ch 10 Monitoring NCNU CSIE 97321012 林似真 Stella](https://reader033.fdocuments.net/reader033/viewer/2022061614/568148c2550346895db5e083/html5/thumbnails/11.jpg)
NCNU CSIE Stella2010/6/8 11
How to Monitor?
● 分散式監控系統● Network utilization● CPU utilization● Hardware● System load● Disk utilization● ...etc.
![Page 12: ch 10 Monitoring NCNU CSIE 97321012 林似真 Stella](https://reader033.fdocuments.net/reader033/viewer/2022061614/568148c2550346895db5e083/html5/thumbnails/12.jpg)
NCNU CSIE Stella2010/6/8 12
Ganglia?
● Nagios ←→ Ganglia● graph display → easy to see
● show host metrics● visualize trends
![Page 13: ch 10 Monitoring NCNU CSIE 97321012 林似真 Stella](https://reader033.fdocuments.net/reader033/viewer/2022061614/568148c2550346895db5e083/html5/thumbnails/13.jpg)
NCNU CSIE Stella2010/6/8 13
WHAT?
![Page 14: ch 10 Monitoring NCNU CSIE 97321012 林似真 Stella](https://reader033.fdocuments.net/reader033/viewer/2022061614/568148c2550346895db5e083/html5/thumbnails/14.jpg)
NCNU CSIE Stella2010/6/8 14
WHAT
IS
GANGLIA?
![Page 16: ch 10 Monitoring NCNU CSIE 97321012 林似真 Stella](https://reader033.fdocuments.net/reader033/viewer/2022061614/568148c2550346895db5e083/html5/thumbnails/16.jpg)
NCNU CSIE Stella2010/6/8 16
Ganglia
● distributed monitoring system● monitor tool for clusters and grids● view live or historical statistics
● CPU load averages● network utilization
● long-term trends● graph display
![Page 17: ch 10 Monitoring NCNU CSIE 97321012 林似真 Stella](https://reader033.fdocuments.net/reader033/viewer/2022061614/568148c2550346895db5e083/html5/thumbnails/17.jpg)
NCNU CSIE Stella2010/6/8 17
WHAT
IS
Distributed System
?
![Page 18: ch 10 Monitoring NCNU CSIE 97321012 林似真 Stella](https://reader033.fdocuments.net/reader033/viewer/2022061614/568148c2550346895db5e083/html5/thumbnails/18.jpg)
NCNU CSIE Stella2010/6/8 18
Distributed Monitoring System
● Components : 1.daemons that reside on cluster nodes
2.server that collects clusters state information from nodes
3.GUI-based front-end, which provides system activity visulization
→ gmond
→ gmetad
→ web front-end
as for Ganglia components :
![Page 19: ch 10 Monitoring NCNU CSIE 97321012 林似真 Stella](https://reader033.fdocuments.net/reader033/viewer/2022061614/568148c2550346895db5e083/html5/thumbnails/19.jpg)
NCNU CSIE Stella2010/6/8 19
Ganglia Architecture
![Page 20: ch 10 Monitoring NCNU CSIE 97321012 林似真 Stella](https://reader033.fdocuments.net/reader033/viewer/2022061614/568148c2550346895db5e083/html5/thumbnails/20.jpg)
NCNU CSIE Stella2010/6/8 20
gmond
● Ganglia monitoring daemon● Multithreaded ● Responsibilities :
1.Monitor changes in host states
2.Announce relevant changes
3.Listen to the state of all the other ganglia nodes via a unicast or multicast channel
4.Answer requests for an XML description of the cluster state.
![Page 21: ch 10 Monitoring NCNU CSIE 97321012 林似真 Stella](https://reader033.fdocuments.net/reader033/viewer/2022061614/568148c2550346895db5e083/html5/thumbnails/21.jpg)
NCNU CSIE Stella2010/6/8 21
gmetad
● Ganglia meta daemon● Aggregate data, information● steps :
1.Poll gmond, clusters, or other gmetad
2.Parse the collected XML
3.Save numeric metrics in RRD(round-robin database)
4.Export aggregated XML over TCP to clients
![Page 22: ch 10 Monitoring NCNU CSIE 97321012 林似真 Stella](https://reader033.fdocuments.net/reader033/viewer/2022061614/568148c2550346895db5e083/html5/thumbnails/22.jpg)
NCNU CSIE Stella2010/6/8 22
web front-end
● web interface written in PHP● Graph generated by gmetad displays histor
ical information● highly dynamic
![Page 23: ch 10 Monitoring NCNU CSIE 97321012 林似真 Stella](https://reader033.fdocuments.net/reader033/viewer/2022061614/568148c2550346895db5e083/html5/thumbnails/23.jpg)
NCNU CSIE Stella2010/6/8 23
monitor interaction
● Gmond runs on each cluster node; gmetad can fail over between nodes.
![Page 24: ch 10 Monitoring NCNU CSIE 97321012 林似真 Stella](https://reader033.fdocuments.net/reader033/viewer/2022061614/568148c2550346895db5e083/html5/thumbnails/24.jpg)
NCNU CSIE Stella2010/6/8 24
![Page 25: ch 10 Monitoring NCNU CSIE 97321012 林似真 Stella](https://reader033.fdocuments.net/reader033/viewer/2022061614/568148c2550346895db5e083/html5/thumbnails/25.jpg)
NCNU CSIE Stella2010/6/8 25
● Distribution and Install
![Page 26: ch 10 Monitoring NCNU CSIE 97321012 林似真 Stella](https://reader033.fdocuments.net/reader033/viewer/2022061614/568148c2550346895db5e083/html5/thumbnails/26.jpg)
NCNU CSIE Stella2010/6/8 26
example
● official website demo ● wikimedia demo
![Page 27: ch 10 Monitoring NCNU CSIE 97321012 林似真 Stella](https://reader033.fdocuments.net/reader033/viewer/2022061614/568148c2550346895db5e083/html5/thumbnails/27.jpg)
NCNU CSIE Stella2010/6/8 27
Refernce
![Page 28: ch 10 Monitoring NCNU CSIE 97321012 林似真 Stella](https://reader033.fdocuments.net/reader033/viewer/2022061614/568148c2550346895db5e083/html5/thumbnails/28.jpg)
NCNU CSIE Stella2010/6/8 28
Reference
● Ganglia HowTo ● Ganglia official website● wikipedia/Ganglia
![Page 29: ch 10 Monitoring NCNU CSIE 97321012 林似真 Stella](https://reader033.fdocuments.net/reader033/viewer/2022061614/568148c2550346895db5e083/html5/thumbnails/29.jpg)
NCNU CSIE Stella2010/6/8 29
Q & A
![Page 30: ch 10 Monitoring NCNU CSIE 97321012 林似真 Stella](https://reader033.fdocuments.net/reader033/viewer/2022061614/568148c2550346895db5e083/html5/thumbnails/30.jpg)
NCNU CSIE Stella2010/6/8 30
Fin.Now You Can Rest Easy (In Peace ?).