On a Unix remote host, jobs may fail with error 129 when performing sudo |
Legacy ID:KA389222 The exit code from a batch job is a standard UNIX termination status, which is the same sort of number you get in a shell script when checking the "$?" or "$status" variable after executing a command. Typically, exit code 0 (zero) means successful completion. Codes 1-127 are typically generated by your job itself calling exit() with a non-zero value to terminate itself and indicate an error In this scenario - the job exits with 129 - which means 128 + 1 Signal 1 means SIGHUP and we can see that it happens 5 minutes after the job started. As a general note, whenever the Control-M/Agent communicates with a remote host it opens an SSH session to this machine, and another SFTP connection (in order to transfer the wrapper temp files and then, to clean these logs and to pull out the SYSOUT file from the remote-host back to the Agent). By default the connection closes after 300 seconds / 5 minutes. In order to prevent this error and allow you to initiate a connection to a remote host over SSH with the parameter "ssh -t" for a pseudo tty, we need to increase the timeout value. To do this, please, open the ctm/data/OS.dat file on the Control-M/Agent that communicates with this remote host and add the following parameter: RJX_SSH_CLOSE_TOUT Please set the value to the maximum expected time-frame that the job should execute. For example: Stop and restart the Control-M/Agent to make the change effective, and test the job again. Related Products:
|