Restart job in Condor after certain amount time -
i running jobs on condor , have noticed reason subset of jobs run never complete. there setting in submit file kills , resubmits job if takes on amount of time complete? similar question condor timeout idle jobs except want condor not kill jobs, resubmit them well.
thanks!
you can use kill transition expression in machine class add file (condor user manual). like:
start = true ... +maxjobexecutiontime = xxx #seconds kill = $(activitytimer) > maxjobexecutiontime
like machine kill jobs take more maxexecutiontime. condor retry job.
Comments
Post a Comment