ADCOS Summary

12
ADCOS Summary Wahid Bhimji

description

ADCOS Summary. Wahid Bhimji. Overview. All senior shifts covered – thanks to shifters No trainee shifters CENTRAL SERVICES: Wed- T hu FTS3 ( ggus: 106095 closed) Saturday ~4:00-11:30 GGUS ( elog:49792 ok now) Today – Site services (elog:49841, 49836). Jobs. Transfers. - PowerPoint PPT Presentation

Transcript of ADCOS Summary

Page 1: ADCOS Summary

ADCOS Summary Wahid Bhimji

Page 2: ADCOS Summary

OverviewAll senior shifts covered – thanks to shifters

No trainee shifters

CENTRAL SERVICES:

Wed-Thu FTS3 (ggus: 106095 closed)

Saturday ~4:00-11:30 GGUS (elog:49792 ok now)

Today – Site services (elog:49841, 49836)

Page 3: ADCOS Summary

Jobs

Page 4: ADCOS Summary

Transfers

Page 5: ADCOS Summary

Daily issues (a few of importance or interest)

Wednesday June 11th

Elog:49728 BNL “ddm: Too many attempts”ADCSUPPORT-3724 Mover changed – closed

Elog: 49739 RAL Disk serverGgus:106090 server recovered – ggus closed

Thursday June 12th

Elog:49751 UKI-SOUTHGRID-OX-HEP Disk serverGgus: 106114 – fixed

Page 6: ADCOS Summary

Daily issues cont.

Thursday June 12th

Taiwan-LCG2 Stage-out errors Ggus:106153 – permissions on directory – fixed (also

ggus:106190 (Friday) – poss. related srm load

Friday June 13th

BNL transfer issues elog:49778 (namespace server hardware resolved quickly)

Page 7: ADCOS Summary

Daily issues

Saturday June 14th

Elog:49797 UKI-NORTHGRID-LIV-HEP disk server firewall ggus:106196 – fixed quickly

Sunday June 15th

UKI-LT2-IC-HEP – cvmfs failures – died down but ticket open

NDGF-T1 - out of diskspace for data staging - prod job failures:ggus:106027

Page 8: ADCOS Summary

Daily issues – today Tuesday June 17th

Very little transfers in DDM

Assigned jobs increasing in many clouds (elog:49841) and jobs failing in NL and DE with “Could not add files to DDM:” (elog:49842)

Various site services not available – e.g. https://sls.cern.ch/sls/service.php?id=atlas-SS07;

https://sls.cern.ch/sls/service.php?id=atlas-SS09

Some (like FZK one) – now green…

Page 9: ADCOS Summary

ggus current

UsSite-no actionSite-actionSite-actionSite-actionSite-no actionUs –help?Site-actionSite or us?Site or us?Site – no action

Who its sitting with.And if site – is there any action being taken

Page 10: ADCOS Summary

Thanks to shifters and experts

Page 11: ADCOS Summary

ggus – closed this week

Page 12: ADCOS Summary

Jira created v. resolved

Comment: Shifters quiteoften open duplicate(maybe not as obviousas it could be)