Name | hadam3p_anz_m13z_2012_1_009268611_0 |
Workunit | 9361527 |
Created | 1 Dec 2014, 16:19:00 UTC |
Sent | 1 Dec 2014, 16:22:51 UTC |
Report deadline | 13 Nov 2015, 21:42:51 UTC |
Received | 28 Jan 2015, 5:20:32 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 0 (0x00000000) |
Computer ID | 1229644 |
Run time | 1 days 11 hours 18 min 35 sec |
CPU time | 1 days 3 hours 54 min 1 sec |
Validate state | Invalid |
Credit | 1,006.54 |
Device peak FLOPS | 2.97 GFLOPS |
Application version | UK Met Office HadAM3P-HadRM3P Australia New Zealand v6.10 windows_intelx86 |
Stderr | <core_client_version>7.4.27</core_client_version> <![CDATA[ <stderr_txt> Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2612, selfPID=10120, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2672, selfPID=10472, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 16:00:30 (10956): No heartbeat from core client for 30 sec - exiting 16:00:31 (10956): No heartbeat from core client for 30 sec - exiting 16:00:32 (10956): No heartbeat from core client for 30 sec - exiting 16:00:33 (10956): No heartbeat from core client for 30 sec - exiting 16:00:34 (10956): No heartbeat from core client for 30 sec - exiting 16:00:35 (10956): No heartbeat from core client for 30 sec - exiting 16:00:36 (10956): No heartbeat from core client for 30 sec - exiting 16:00:37 (10956): No heartbeat from core client for 30 sec - exiting 16:00:38 (10956): No heartbeat from core client for 30 sec - exiting 16:00:39 (10956): No heartbeat from core client for 30 sec - exiting 16:00:40 (10956): No heartbeat from core client for 30 sec - exiting 16:00:41 (10956): No heartbeat from core client for 30 sec - exiting 16:00:42 (10956): No heartbeat from core client for 30 sec - exiting 16:00:43 (10956): No heartbeat from core client for 30 sec - exiting 16:00:44 (10956): No heartbeat from core client for 30 sec - exiting 16:00:45 (10956): No heartbeat from core client for 30 sec - exiting 16:00:46 (10956): No heartbeat from core client for 30 sec - exiting 16:00:47 (10956): No heartbeat from core client for 30 sec - exiting 16:00:48 (10956): No heartbeat from core client for 30 sec - exiting 16:00:49 (10956): No heartbeat from core client for 30 sec - exiting 16:00:50 (10956): No heartbeat from core client for 30 sec - exiting 16:00:51 (10956): No heartbeat from core client for 30 sec - exiting 16:00:52 (10956): No heartbeat from core client for 30 sec - exiting 16:00:53 (10956): No heartbeat from core client for 30 sec - exiting 16:00:54 (10956): No heartbeat from core client for 30 sec - exiting 16:00:55 (10956): No heartbeat from core client for 30 sec - exiting 16:00:56 (10956): No heartbeat from core client for 30 sec - exiting 16:00:57 (10956): No heartbeat from core client for 30 sec - exiting 16:00:58 (10956): No heartbeat from core client for 30 sec - exiting 16:00:59 (10956): No heartbeat from core client for 30 sec - exiting 16:01:00 (10956): No heartbeat from core client for 30 sec - exiting 16:01:01 (10956): No heartbeat from core client for 30 sec - exiting 16:01:02 (10956): No heartbeat from core client for 30 sec - exiting 16:01:03 (10956): No heartbeat from core client for 30 sec - exiting 16:01:04 (10956): No heartbeat from core client for 30 sec - exiting 16:01:05 (10956): No heartbeat from core client for 30 sec - exiting 16:01:06 (10956): No heartbeat from core client for 30 sec - exiting 16:01:07 (10956): No heartbeat from core client for 30 sec - exiting 16:01:08 (10956): No heartbeat from core client for 30 sec - exiting 16:01:09 (10956): No heartbeat from core client for 30 sec - exiting 16:01:10 (10956): No heartbeat from core client for 30 sec - exiting 16:01:11 (10956): No heartbeat from core client for 30 sec - exiting 16:01:12 (10956): No heartbeat from core client for 30 sec - exiting 16:01:13 (10956): No heartbeat from core client for 30 sec - exiting 16:01:14 (10956): No heartbeat from core client for 30 sec - exiting 16:01:15 (10956): No heartbeat from core client for 30 sec - exiting 16:01:16 (10956): No heartbeat from core client for 30 sec - exiting 16:01:17 (10956): No heartbeat from core client for 30 sec - exiting 16:01:18 (10956): No heartbeat from core client for 30 sec - exiting 16:01:19 (10956): No heartbeat from core client for 30 sec - exiting 16:01:20 (10956): No heartbeat from core client for 30 sec - exiting 16:01:21 (10956): No heartbeat from core client for 30 sec - exiting 16:01:22 (10956): No heartbeat from core client for 30 sec - exiting 16:01:23 (10956): No heartbeat from core client for 30 sec - exiting 16:01:24 (10956): No heartbeat from core client for 30 sec - exiting 16:01:25 (10956): No heartbeat from core client for 30 sec - exiting 16:01:26 (10956): No heartbeat from core client for 30 sec - exiting 16:01:27 (10956): No heartbeat from core client for 30 sec - exiting 16:01:28 (10956): No heartbeat from core client for 30 sec - exiting 16:01:29 (10956): No heartbeat from core client for 30 sec - exiting 16:01:30 (10956): No heartbeat from core client for 30 sec - exiting 16:01:31 (10956): No heartbeat from core client for 30 sec - exiting 16:01:32 (10956): No heartbeat from core client for 30 sec - exiting 16:01:33 (10956): No heartbeat from core client for 30 sec - exiting 16:01:34 (10956): No heartbeat from core client for 30 sec - exiting 16:01:35 (10956): No heartbeat from core client for 30 sec - exiting 16:01:36 (10956): No heartbeat from core client for 30 sec - exiting 16:01:37 (10956): No heartbeat from core client for 30 sec - exiting 16:01:38 (10956): No heartbeat from core client for 30 sec - exiting 16:01:39 (10956): No heartbeat from core client for 30 sec - exiting 16:01:40 (10956): No heartbeat from core client for 30 sec - exiting 16:01:41 (10956): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 16:01:42 (10956): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - Suspend request from BOINC... 16:49:30 (10424): No heartbeat from core client for 30 sec - exiting 16:49:31 (10424): No heartbeat from core client for 30 sec - exiting 16:49:32 (10424): No heartbeat from core client for 30 sec - exiting 16:49:33 (10424): No heartbeat from core client for 30 sec - exiting 16:49:34 (10424): No heartbeat from core client for 30 sec - exiting 16:49:35 (10424): No heartbeat from core client for 30 sec - exiting 16:49:36 (10424): No heartbeat from core client for 30 sec - exiting 16:49:37 (10424): No heartbeat from core client for 30 sec - exiting 16:49:38 (10424): No heartbeat from core client for 30 sec - exiting 16:49:39 (10424): No heartbeat from core client for 30 sec - exiting 16:49:40 (10424): No heartbeat from core client for 30 sec - exiting 16:49:41 (10424): No heartbeat from core client for 30 sec - exiting 16:49:42 (10424): No heartbeat from core client for 30 sec - exiting 16:49:43 (10424): No heartbeat from core client for 30 sec - exiting 16:49:44 (10424): No heartbeat from core client for 30 sec - exiting 16:49:45 (10424): No heartbeat from core client for 30 sec - exiting 16:49:46 (10424): No heartbeat from core client for 30 sec - exiting 16:49:47 (10424): No heartbeat from core client for 30 sec - exiting 16:49:48 (10424): No heartbeat from core client for 30 sec - exiting 16:49:49 (10424): No heartbeat from core client for 30 sec - exiting 16:49:50 (10424): No heartbeat from core client for 30 sec - exiting 16:49:51 (10424): No heartbeat from core client for 30 sec - exiting 16:49:52 (10424): No heartbeat from core client for 30 sec - exiting 16:49:53 (10424): No heartbeat from core client for 30 sec - exiting 16:49:54 (10424): No heartbeat from core client for 30 sec - exiting 16:49:55 (10424): No heartbeat from core client for 30 sec - exiting 16:49:56 (10424): No heartbeat from core client for 30 sec - exiting 16:49:57 (10424): No heartbeat from core client for 30 sec - exiting 16:49:58 (10424): No heartbeat from core client for 30 sec - exiting 16:49:59 (10424): No heartbeat from core client for 30 sec - exiting 16:50:00 (10424): No heartbeat from core client for 30 sec - exiting 16:50:01 (10424): No heartbeat from core client for 30 sec - exiting 16:50:02 (10424): No heartbeat from core client for 30 sec - exiting 16:50:03 (10424): No heartbeat from core client for 30 sec - exiting 16:50:04 (10424): No heartbeat from core client for 30 sec - exiting 16:50:05 (10424): No heartbeat from core client for 30 sec - exiting 16:50:06 (10424): No heartbeat from core client for 30 sec - exiting 16:50:07 (10424): No heartbeat from core client for 30 sec - exiting 16:50:08 (10424): No heartbeat from core client for 30 sec - exiting 16:50:09 (10424): No heartbeat from core client for 30 sec - exiting 16:50:10 (10424): No heartbeat from core client for 30 sec - exiting 16:50:11 (10424): No heartbeat from core client for 30 sec - exiting 16:50:12 (10424): No heartbeat from core client for 30 sec - exiting 16:50:13 (10424): No heartbeat from core client for 30 sec - exiting 16:50:14 (10424): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3828, selfPID=3828, iMonCtr=2 Suspended CPDN Monitor - Suspend request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=9500, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=9720, selfPID=3528, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Called boinc_finish </stderr_txt> <message> upload failure: <file_xfer_error> <file_name>hadam3p_anz_m13z_2012_1_009268611_0_3.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_anz_m13z_2012_1_009268611_0_4.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_anz_m13z_2012_1_009268611_0_5.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_anz_m13z_2012_1_009268611_0_6.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_anz_m13z_2012_1_009268611_0_7.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_anz_m13z_2012_1_009268611_0_8.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_anz_m13z_2012_1_009268611_0_9.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_anz_m13z_2012_1_009268611_0_10.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_anz_m13z_2012_1_009268611_0_11.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_anz_m13z_2012_1_009268611_0_12.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> </message> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
25 Jan 2015 08:50:05 | 1229644 | 17533084 | hadam3p_anz_m13z_2012_1_009268611_0 | 23,339 | 77,828 | 3.3347 |
24 Jan 2015 06:07:16 | 1229644 | 17533084 | hadam3p_anz_m13z_2012_1_009268611_0 | 11,819 | 39,637 | 3.3537 |
©2024 cpdn.org