Notes Regarding to the Computing Run Coordinator

Remember: CRC = Computing Run Coordinator

Communication

  • how do people know who is CRC?

  • how do people contact the CRC?
    • we will have a cell phone
  • is there any standard report which the CRC gives?

Task

The CRC has the best overview of the Offline Computing plan and status, and he/she is the main operational link to other projects (Online, Offline). He/she participates in daily run meetings and keeps track of open computing issues/problems

Expertise

A computing expert, possibly a core-computing expert with a high experience in some areas and a good knowledge of the CMS Computing systems.

Location

The CRC must be located at CERN. The Computing Shift Person (CSP) and the DataOps Expert On Call are located either in the CMS Center or in the FNAL ROC.

Shift Duration

CRC: 1 week (from Wed 00:00 to Tue 24:00)

The Computing Shift Person (CSP) makes 8 hours shifts from 8AM-4PM (EU zone) and from 16:00-00:00 (US zone). Once we will have enough manpower, we also plan to cover the ASIA zone for CSP, as the DataOps Experts On Call are currently doing.

Computing Plan of the Day

The CRC should have the Computing Plan of the Day ready latest by 08:00 CERN time for the CSP and the DataOps Expert On Call. The plan is accessible from the Main CMS-Centre page to browse "Today plan".

To browse old daily plans or fill-in a new "Computing plan"

Further useful instructions to create and/or modify the Computing Plan of the Day.

I found it convenient to fill the plan the night before and then have some parts updated in the morning, in particular after the daily 11AM run meeting. The DataOps part of the plan should be updated by the ASIA or EU DataOps Expert On Call.

E-logs

Currently there are a lot of different E-logs to look at:

  1. CSP E-log
  2. DataOps Expert On Call E-log
  3. CMS Online Shift E-log
  4. LHC shift E-log

The CSP e-log consist of a main shift-report page where the CSP and the DataOps Expert On Call write their shift summary. This is a main source of information for the CRC, before going to the detailed sub-sections of the E-logs. The plan is to merge A) and B) asap, as presented by Daniele Bonacorsi. The format of the combined E-log is yet to be determined.

  • the CMS Online Shift E-log is also a very useful source of information for the CRC (specially Darin's or Tiziano summaries
  • the CRC should subscribe to at least the hn-cms-global-runs and hn-cms-commissioning HN lists

Communication between CRC and Computing/DataOps shifters/experts

The CRC is not supposed to always sit at the CMS Center, in fact he/she cannot, since he/she needs to attend meetings e.g. to report about the offline computing status. Since the CRC's location is not well determined, any communication channel depending on the needs should be used to contact shifters: e-logs, TV, instant messaging, email, telephone. The CRC should also regularly scan through the intermediate Computing and DataOps e-logs reports, in order to make sure the main issues are covered. Preferably the CRC should subscribe to the e-log and receive an e-mail notification upon each new entry by shifters. he/she should also help trouble-shooting various issues raised by shifters. If he/she doesn’t have time to cover all issues, he/she should at least make sure the relevant expert has been put on most urgent cases

  • DataOps Expert On Call contact details

Communication with other Coordinators

This is an essential part of the CRC role. Actually, since we are setting up these coordination roles in various projects and since the projects are closely interlinked, the ranges of actions of each coordinator are not yet always clearly defined. However these boundaries will be defined better and better as time evolves.

Shift List

An electronic shift list has been setup. Currently (18.09.2008) this tool is not available from outside CERN, but a solution for this issue should be found soon. This shift list contains 3 sections: CRC, CSP, DataSP.

The first two are already extensively used, for any question about the shift management based on that tool please contact Frank.Glege@cern.ch. The CRC is responsible of making sure the shift list is always covered during his/he CRC-week. In absence of the electronic shift tool, twikis containing shift lists have been maintained so far here (CSP), here (DataOps)

Useful Documentation

The CSP shift instructions are still under construction.

The DataOps Expert On Call [https://twiki.cern.ch/twiki/bin/view/CMS/DataOpsShiftInstructions][instructions]] are reasonably complete. There is some overlap between the two. The mid-term goal is to have a Single Offline Computing shift instructions document, with procedures covering all aspects of Offline Computing Monitoring.

Topic revision: r3 - 2008-09-18 - 12:57:19 - ChristophPaus
 
This site is powered by the TWiki collaboration platformCopyright © by the contributing authors. All material on this collaboration platform is the property of the contributing authors.
Ideas, requests, problems regarding TWiki? Send feedback