Skip to content
  • Jessica Severin's avatar
    implemented a proper 'dead worker on lsf' checking system. Workers are · ba5578d5
    Jessica Severin authored
    created registered to the LSF beekeeper, and the 'dead' check is done only
    where the beekeeper is LSF and it's 15minutes overdue for it's checkin.
    The check is done with an ssh to the workers registered host machine and
    a 'ps' command to see if the registered process_id of the worker is still
    running.  This allows jobs to be submitted via lsf arrays (which only give
    a single LSF job id for the entire array), but still allows each worker
    to be checked separately.
    ba5578d5