Skip to content

defer PostJob for a longer time when waiting on Rucio #9079

@belforte

Description

@belforte

We may be waiting for Rucio transfer to many days (failing the PostJob will not help since we can not resubmit jobs which failed in transfer, Rucio will not accept an old LFN with a different checksum).

In this case it is better not to run many PostJobs uselessly every 30min, since we may end up overloading the AccessPoint resources. Better to increase defer time.

The best (only?) way seems to exit immediately from dag_bootstrap.sh with exit code 4, i.e. deferral stays at 30min (hardcoded in DAG description file), but PJ duration is minimal and Python is not started, saving a lot on memory.

See
#8429 (comment)

Metadata

Metadata

Assignees

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions