-
Notifications
You must be signed in to change notification settings - Fork 41
defer PostJob for a longer time when waiting on Rucio #9079
Copy link
Copy link
Closed
Description
We may be waiting for Rucio transfer to many days (failing the PostJob will not help since we can not resubmit jobs which failed in transfer, Rucio will not accept an old LFN with a different checksum).
In this case it is better not to run many PostJobs uselessly every 30min, since we may end up overloading the AccessPoint resources. Better to increase defer time.
The best (only?) way seems to exit immediately from dag_bootstrap.sh with exit code 4, i.e. deferral stays at 30min (hardcoded in DAG description file), but PJ duration is minimal and Python is not started, saving a lot on memory.
See
#8429 (comment)
Reactions are currently unavailable