INFSO-RI-508833 Enabling Grids for E-sciencE An introduction to EGEE Guy Warner NeSC Edinburgh.
INFSO-RI-508833 Enabling Grids for E-sciencE EGEE 1 st EU Review – 9 th to 11 th February 2005...
Transcript of INFSO-RI-508833 Enabling Grids for E-sciencE EGEE 1 st EU Review – 9 th to 11 th February 2005...
INFSO-RI-508833
Enabling Grids for E-sciencE
www.eu-egee.org
<LCG-EGEE Operations>
EGEE 1st EU Review – 9th to 11th February 2005CERN<Piotr Nyczyk, Hélène Cordier><CIC-ON-DUTY, SA1 >
LCG-EGEE Operations 2
Enabling Grids for E-sciencE
INFSO-RI-508833
Content
• SA1 – Core Infrastructure Centers – Definition– Operational tool
• Cic Operations– Cic-on-duty
Definition Procedure Operations
– Monitoring tools
• Scenarios of escalation– Severity– Deadline Expiration
• Next steps
LCG-EGEE Operations 3
Enabling Grids for E-sciencE
INFSO-RI-508833
– Operate essential grid services and act as Grid Operation Center
– Objectives Transparency Information sharing - 24x7 Troubleshooting in conjonction with ROCs
– Current state of operations Procedures defined and used Monitoring tools and in-depth testing Communication tool Problem Tracking Tool
CIC
[This sketch has been provided by Pierre Girard]
[This map has been provided by Matt Thorpe]
CIC
CICCIC
CICCIC
CICCIC
CICCIC
CICCIC
RCRC
RCRC RCRC
RCRC
RCRC
ROCROC
RCRC
RCRC
RCRCRCRC
RCRCRCRC
ROCROC
RCRC
RCRC RCRC
RCRC
RCRC
ROCROC
RCRC
RCRC
RCRC
RCRC
ROCROC
RCOMCOMC
LCG-EGEE Operations 4
Enabling Grids for E-sciencE
INFSO-RI-508833
CIC Web Portalhttp://cic.in2p3.fr/
• Objectives of CIC portal
– Centralized tool for Egee actors to use Functional needs
– Communication tool for inter-operability Needs and feedback CICs pro-active resolutions
– Provide a repository Knowledge: configurations,
published sites data, Faqs Procedures and processes Existing tools
LCG-EGEE Operations 5
Enabling Grids for E-sciencE
INFSO-RI-508833
Cic-on-duty
Cic–on-duty agenda
weekly shifts – 8/5
GGUS, ROC User-support, mail
Monitor, diagnose troubles
Contact site administrators, ROC
Problem Tracking Tool
Follow-up
GDA meetings
COD meetingsLog file
LCG-EGEE Operations 6
Enabling Grids for E-sciencE
INFSO-RI-508833
Cic operational procedure
[This procedure is provided by OMC]
High level abstraction
of core tools results
Link all existing tools monitoring diagnosis communication
• follow up log
CIC Operations https://cic.in2p3.fr/index.php?id=cic&subid=cic_dash2
Ops Procedure
LCG-EGEE Operations 7
Enabling Grids for E-sciencE
INFSO-RI-508833
Cic-on-duty Dashboardhttps://cic.in2p3.fr/pages/cic/framedashboard.html
All-in-one dashboard
LCG-EGEE Operations 8
Enabling Grids for E-sciencE
INFSO-RI-508833
Selection of Monitoring tools
GIIS Monitor GIIS Monitor graphs Sites Functional Tests
GOC Data BaseScheduled Downtimes Live Job Monitor
GridIce – VO view GridIce – fabric view Certificate Lifetime Monitor
Note: Those thumbnails are links and are clickable.
LCG-EGEE Operations 9
Enabling Grids for E-sciencE
INFSO-RI-508833
Monitoringtool
In DepthTesting
GIISMonitor
DiagnosishelpWickipage
ReportSavannah
Follow upCic
mailingtool
RCIncidentclosure
(1) (2) (3) (4)
ROC
CIC
OMC
(5.1)
(5.2)
(5.1)
(6)
SEVERITYESCALATIONPROCEDURE
Scenarios of escalation : 1/2
Monitoringtool
In DepthTesting
GIISMonitor
DiagnosishelpWickipage
ReportSavannah
Follow upCic
mailingtool
RCIncidentclosure
(1) (2) (3) (4)
ROC
CIC
OMC
(5.1)
(5.2)
(5.1)
(6)
SEVERITYESCALATIONPROCEDURE
TZR
GIIS
Goc Wiki
Ticket status
LCG-EGEE Operations 10
Enabling Grids for E-sciencE
INFSO-RI-508833
TaskDeadlineExpirationSavannah
Follow upCic
mailingtool
Escalation procedure2nd Mail
Incidentclosure
People in charge of or tool used
Action
(1) (2)
(3)
(4)
Monitoring tool
Scenarios of escalation : 2/2
1st mail
Blacklist
DeadlineProblem still there
Next deadline
Action takenAction taken
LCG-EGEE Operations 11
Enabling Grids for E-sciencE
INFSO-RI-508833
Next steps
Unification of information coming from various test tools
Move to GGUS
[This display based on RGMA is provided by the russian federation]
http://www.ggus.org