Name | hadam3p_anz_n9lt_2012_1_008600065_2 |
Workunit | 8746577 |
Created | 27 Mar 2014, 16:55:48 UTC |
Sent | 27 Mar 2014, 17:03:49 UTC |
Report deadline | 9 Mar 2015, 22:23:49 UTC |
Received | 6 Jul 2014, 5:37:26 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 0 (0x00000000) |
Computer ID | 1295575 |
Run time | 3 days 19 hours 34 min 57 sec |
CPU time | 3 days 18 hours 24 min 29 sec |
Validate state | Invalid |
Credit | 3,490.64 |
Device peak FLOPS | 3.50 GFLOPS |
Application version | UK Met Office HadAM3P-HadRM3P Australia New Zealand v6.10 windows_intelx86 |
Stderr | <core_client_version>7.2.42</core_client_version> <![CDATA[ <stderr_txt> Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2364, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4176, selfPID=4632, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=836, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2468, selfPID=4640, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2484, selfPID=4616, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4256, selfPID=4564, iMonCtr=1 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5920, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3196, selfPID=4564, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5824, selfPID=4548, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4680, selfPID=4536, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1488, selfPID=4556, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5184, selfPID=4540, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5584, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4308, selfPID=4524, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4608, selfPID=4580, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5144, selfPID=5144, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2100, selfPID=4528, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6292, selfPID=4524, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6416, selfPID=4532, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6692, selfPID=6692, iMonCtr=2 Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3672, iMonCtr=2 Model crash detected, will try to restart... CSuspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5664, selfPID=4704, iMonCtr=1 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=5268, iMonCtr=1 Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=0, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2544, selfPID=4692, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Called boinc_finish </stderr_txt> <message> upload failure: <file_xfer_error> <file_name>hadam3p_anz_n9lt_2012_1_008600065_2_8.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_anz_n9lt_2012_1_008600065_2_9.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_anz_n9lt_2012_1_008600065_2_10.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_anz_n9lt_2012_1_008600065_2_11.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_anz_n9lt_2012_1_008600065_2_12.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> </message> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
05 Jul 2014 11:31:46 | 1295575 | 16422493 | hadam3p_anz_n9lt_2012_1_008600065_2 | 80,939 | 286,992 | 3.5458 |
02 Jul 2014 08:30:41 | 1295575 | 16422493 | hadam3p_anz_n9lt_2012_1_008600065_2 | 69,419 | 250,621 | 3.6103 |
22 Apr 2014 17:41:49 | 1295575 | 16422493 | hadam3p_anz_n9lt_2012_1_008600065_2 | 57,899 | 218,160 | 3.7679 |
12 Apr 2014 09:10:43 | 1295575 | 16422493 | hadam3p_anz_n9lt_2012_1_008600065_2 | 46,379 | 175,351 | 3.7808 |
06 Apr 2014 14:37:56 | 1295575 | 16422493 | hadam3p_anz_n9lt_2012_1_008600065_2 | 34,859 | 130,597 | 3.7464 |
04 Apr 2014 06:04:23 | 1295575 | 16422493 | hadam3p_anz_n9lt_2012_1_008600065_2 | 23,339 | 86,593 | 3.7102 |
02 Apr 2014 05:56:28 | 1295575 | 16422493 | hadam3p_anz_n9lt_2012_1_008600065_2 | 11,819 | 43,742 | 3.7010 |
©2024 climateprediction.net