Notes Regarding to the Computing Run Coordinator
Remember: CRC = Computing Run Coordinator
Communication
- how do people know who is CRC?
- how do people contact the CRC?
- we will have a cell phone
- is there any standard report which the CRC gives?
Task
The CRC has the best overview of the Offline Computing plan and status, and he/she is the main
operational link to other projects (Online, Offline). He/she participates in daily run meetings
and keeps track of open computing issues/problems
Expertise
A computing expert, possibly a core-computing expert with a high experience in some areas and a
good knowledge of the CMS Computing systems.
Location
The CRC must be located at CERN. The Computing Shift Person (CSP) and the DataOps Expert On Call
are located either in the CMS Center or in the FNAL ROC.
Shift Duration
CRC: 1 week (from Wed 00:00 to Tue 24:00)
The Computing Shift Person (CSP) makes 8 hours shifts from 8AM-4PM (EU zone) and from
16:00-00:00 (US zone). Once we will have enough manpower, we also plan to cover the ASIA zone for
CSP, as the DataOps Experts On Call are currently doing.
Computing Plan of the Day
The CRC should have the
Computing Plan of the Day ready
latest by 08:00 CERN time for the CSP and the DataOps Expert On Call. The plan is accessible from
the
Main CMS-Centre page to browse
"Today plan".
To browse old daily plans or fill-in a
new "Computing plan"
Further useful
instructions to create and/or
modify the Computing Plan of the Day.
I found it convenient to fill the plan the night before and then have some parts updated in the
morning, in particular after the daily 11AM run meeting. The DataOps part of the plan should be
updated by the ASIA or EU DataOps Expert On Call.
E-logs
Currently there are a lot of different E-logs to look at:
- CSP E-log
- DataOps Expert On Call E-log
- CMS Online Shift E-log
- LHC shift E-log
The CSP e-log consist of a
main shift-report page
where the CSP and the DataOps Expert On Call write their shift summary. This is a main source of
information for the CRC, before going to the detailed sub-sections of the E-logs. The plan is to
merge A) and B) asap, as presented by
Daniele Bonacorsi. The format of the
combined E-log is yet to be determined.
- the CMS Online Shift E-log is also a very useful source of information for the CRC (specially Darin's or Tiziano summaries
- the CRC should subscribe to at least the hn-cms-global-runs and hn-cms-commissioning HN lists
Communication between CRC and Computing/DataOps shifters/experts
The CRC is not supposed to always sit at the CMS Center, in fact he/she cannot, since he/she
needs to attend meetings
e.g. to report about the offline computing status. Since the CRC's
location is not well determined, any communication channel depending on the needs should be
used to contact shifters: e-logs, TV, instant messaging, email, telephone. The CRC should also
regularly scan through the intermediate Computing and DataOps e-logs reports, in order to make
sure the main issues are covered. Preferably the CRC should subscribe to the e-log and receive
an e-mail notification upon each new entry by shifters. he/she should also help
trouble-shooting various issues raised by shifters. If he/she doesn’t have time to cover
all issues, he/she should at least make sure the relevant expert has been put on most urgent
cases
- DataOps Expert On Call contact details
Communication with other Coordinators
This is an essential part of the CRC role. Actually, since we are setting up these
coordination roles in various projects and since the projects are closely interlinked, the
ranges of actions of each coordinator are not yet always clearly defined. However these
boundaries will be defined better and better as time evolves.
Shift List
An electronic shift list has been
setup.
Currently (18.09.2008) this tool is not available from outside CERN, but a solution for this
issue should be found soon. This shift list contains 3 sections: CRC, CSP, DataSP.
The first two are already extensively used, for any question about the shift management based
on that tool please contact
Frank.Glege@cern.ch. The CRC is responsible of making sure the
shift list is always covered during his/he CRC-week. In absence of the electronic shift tool,
twikis containing shift lists have been maintained so far
here (CSP),
here (DataOps)
Useful Documentation
The
CSP shift instructions
are still under construction.
The DataOps Expert On Call
[https://twiki.cern.ch/twiki/bin/view/CMS/DataOpsShiftInstructions][instructions]]
are reasonably complete. There is some overlap between the two. The mid-term goal is to have
a Single Offline Computing shift instructions document, with procedures covering all aspects
of Offline Computing Monitoring.