HTCondor¶
The IGWN Computing Grid is based on top of HTCondor, a specialized workload management system for compute-intensive jobs. HTCondor is used to specify discrete work units (jobs) you want completed that are then distributed across the available resources with sophisticated scheduling, prioritisation, monitoring, and reporting capabilities.
Basic HTCondor Usage¶
For an excellent introduction to HTCondor usage, please refer to the official HTCondor Users' Manual:
https://htcondor.readthedocs.io/en/lts/users-manual/
In addition, the Center for High Throughput Computing in Madison maintain a youtube channel with a variety of HTCondor tutorials for both users and administrators:
https://www.youtube.com/channel/UCd1UBXmZIgB4p85t2tu-gLw
For introductory user tutorials to HTCondor, see e.g. the HTCondor User Tutorials playlist:
https://www.youtube.com/watch?v=oMAvxsFJaw4&list=PLO7gMRGDPNumCuo3pCdRk23GDLNKFVjHn
The remainder of this guide describes the non-standard features of the IGWN Computing Grid.
Access to the IGWN Computing Grid¶
See Access point for details on how to access the IGWN Computing Grid.
IGWN Computing Grid features¶
The following pages describe those features specific to the IGWN Computing Grid.
Mandatory extra features¶
The following extra considerations are MANDATORY:
Optional extra features¶
The following extra considerations are OPTIONAL, depending on the requirements of your workflow:
- Accessing software
- Allow/deny-list sites
- Checkpointing
- Credentials
- Data management
- Environment variables
- GPUs & HTCondor
- Monitoring job status
Condor-LIGO Mailing list¶
There is a mailing-list shared between Condor developers and IGWN to discuss IGWN's use of Condor and any problems. To subscribe, go to
https://lists.aei.mpg.de/cgi-bin/mailman/listinfo/condorligo