WatchdogLinux

The Watchdog class is used by the Job Wrapper to resolve and monitor the system CPU and memory consumed. The Watchdog can determine if a running job is stalled and indicate this to the Job Wrapper.

This is the Unix / Linux compatible Watchdog subclass.

class DIRAC.WorkloadManagementSystem.JobWrapper.WatchdogLinux.WatchdogLinux(pid, exeThread, spObject, jobCPUTime, memoryLimit=0, processors=1, systemFlag='linux', jobArgs={})

Bases: DIRAC.WorkloadManagementSystem.JobWrapper.Watchdog.Watchdog

__init__(pid, exeThread, spObject, jobCPUTime, memoryLimit=0, processors=1, systemFlag='linux', jobArgs={})

Constructor, takes system flag as argument.

calibrate()

The calibrate method obtains the initial values for system memory and load and calculates the margin for error for the rest of the Watchdog cycle.

execute()

The main agent execution method of the Watchdog.

getDiskSpace()

Obtains the disk space used.

getLoadAverage()

Obtains the load average.

getMemoryUsed()

Obtains the memory used.

getNodeInformation()

Try to obtain system HostName, CPU, Model, cache and memory. This information is not essential to the running of the jobs but will be reported if available.

initialize(loops=0)

Watchdog initialization.

run()

The main watchdog execution method