ADCOS Summary
description
Transcript of ADCOS Summary
ADCOS Summary Wahid Bhimji
OverviewAll senior shifts covered – thanks to shifters
No trainee shifters
CENTRAL SERVICES:
Wed-Thu FTS3 (ggus: 106095 closed)
Saturday ~4:00-11:30 GGUS (elog:49792 ok now)
Today – Site services (elog:49841, 49836)
Jobs
Transfers
Daily issues (a few of importance or interest)
Wednesday June 11th
Elog:49728 BNL “ddm: Too many attempts”ADCSUPPORT-3724 Mover changed – closed
Elog: 49739 RAL Disk serverGgus:106090 server recovered – ggus closed
Thursday June 12th
Elog:49751 UKI-SOUTHGRID-OX-HEP Disk serverGgus: 106114 – fixed
Daily issues cont.
Thursday June 12th
Taiwan-LCG2 Stage-out errors Ggus:106153 – permissions on directory – fixed (also
ggus:106190 (Friday) – poss. related srm load
Friday June 13th
BNL transfer issues elog:49778 (namespace server hardware resolved quickly)
Daily issues
Saturday June 14th
Elog:49797 UKI-NORTHGRID-LIV-HEP disk server firewall ggus:106196 – fixed quickly
Sunday June 15th
UKI-LT2-IC-HEP – cvmfs failures – died down but ticket open
NDGF-T1 - out of diskspace for data staging - prod job failures:ggus:106027
Daily issues – today Tuesday June 17th
Very little transfers in DDM
Assigned jobs increasing in many clouds (elog:49841) and jobs failing in NL and DE with “Could not add files to DDM:” (elog:49842)
Various site services not available – e.g. https://sls.cern.ch/sls/service.php?id=atlas-SS07;
https://sls.cern.ch/sls/service.php?id=atlas-SS09
Some (like FZK one) – now green…
ggus current
UsSite-no actionSite-actionSite-actionSite-actionSite-no actionUs –help?Site-actionSite or us?Site or us?Site – no action
Who its sitting with.And if site – is there any action being taken
Thanks to shifters and experts
ggus – closed this week
Jira created v. resolved
Comment: Shifters quiteoften open duplicate(maybe not as obviousas it could be)