Speaker
Description
The ATLAS EventIndex is the complete catalogue of all ATLAS real and simulated events, keeping the references to all permanent files that contain a given event in any processing stage. The Event Picking Service (EPS) is a part of the EventIndex project. It automates the procedure of finding the locations of large numbers of events, extracting and collecting them into separate files. It supports different formats of events and has an elastic workflow for different input data. From time to time some errors occur while performing different workflows and threads. Some of these errors are situational, such as losses of connections to external services, and do not require any action from the user to fix them. The service detects such errors and restarts the necessary job chains automatically. Fixing other type of errors requires specific actions from the administrators of the services. The EPS finds such type of errors and reports them. Finally, the third type of errors that can occur are those that require a lot of time and effort from the developers to correct, but as a result, adding new methods allow them to be avoided automatically in the future.