Name | hadam3p_saf_2964_1980_1_007247658_0 |
Workunit | 7445898 |
Created | 5 May 2011, 11:55:40 UTC |
Sent | 6 May 2011, 21:12:20 UTC |
Report deadline | 18 Apr 2012, 2:32:20 UTC |
Received | 4 Jun 2011, 23:40:20 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 0 (0x00000000) |
Computer ID | 1025620 |
Run time | 3 days 18 hours 56 min 46 sec |
CPU time | 2 days 2 hours 41 min 5 sec |
Validate state | Invalid |
Credit | 1,309.70 |
Device peak FLOPS | 2.58 GFLOPS |
Application version | UK Met Office HadAM3P-HadRM3P Southern Africa v6.09 windows_intelx86 |
Stderr | <core_client_version>6.10.17</core_client_version> <![CDATA[ <stderr_txt> CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5360, selfPID=3032, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4952, selfPID=2552, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4424, selfPID=1964, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4708, iMonCtr=2 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2072, selfPID=4756, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4660, selfPID=4624, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Called boinc_finish CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3928, selfPID=3916, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5112, selfPID=3152, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4924, selfPID=2236, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4576, selfPID=4372, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=316, selfPID=4332, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2364, selfPID=4136, iMonCtr=1 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5088, selfPID=3388, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3540, selfPID=4732, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5752, iMonCtr=2 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4500, iMonCtr=2 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4156, selfPID=4472, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4552, selfPID=5048, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4768, selfPID=2944, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4672, selfPID=4472, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=0 Model crash detected, will try to restart... Model crashed: READHIST: End of file in READ from history file for namelist NLIHISTO tmp/xaakg.pipe_dummy 2048 Leaving CPDN_Main::Monitor... Called boinc_finish </stderr_txt> <message> <file_xfer_error> <file_name>hadam3p_saf_2964_1980_1_007247658_0_8.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_saf_2964_1980_1_007247658_0_9.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_saf_2964_1980_1_007247658_0_10.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_saf_2964_1980_1_007247658_0_11.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_saf_2964_1980_1_007247658_0_12.zip</file_name> <error_code>-161</error_code> </file_xfer_error> </message> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
02 Jun 2011 23:44:18 | 1025620 | 12867385 | hadam3p_saf_2964_1980_1_007247658_0 | 80,736 | 162,690 | 2.0151 |
01 Jun 2011 00:56:33 | 1025620 | 12867385 | hadam3p_saf_2964_1980_1_007247658_0 | 69,216 | 139,689 | 2.0182 |
30 May 2011 01:11:36 | 1025620 | 12867385 | hadam3p_saf_2964_1980_1_007247658_0 | 57,696 | 116,672 | 2.0222 |
17 May 2011 01:31:14 | 1025620 | 12867385 | hadam3p_saf_2964_1980_1_007247658_0 | 46,185 | 92,822 | 2.0098 |
17 May 2011 01:06:01 | 1025620 | 12867385 | hadam3p_saf_2964_1980_1_007247658_0 | 46,176 | 92,061 | 1.9937 |
13 May 2011 03:34:33 | 1025620 | 12867385 | hadam3p_saf_2964_1980_1_007247658_0 | 34,656 | 69,306 | 1.9998 |
08 May 2011 21:52:59 | 1025620 | 12867385 | hadam3p_saf_2964_1980_1_007247658_0 | 23,136 | 46,587 | 2.0136 |
07 May 2011 18:32:39 | 1025620 | 12867385 | hadam3p_saf_2964_1980_1_007247658_0 | 11,616 | 23,778 | 2.0470 |
©2024 cpdn.org