Name | hadam3p_eu_9efk_1963_1_007738760_0 |
Workunit | 7893868 |
Created | 26 Jan 2012, 15:42:48 UTC |
Sent | 29 Jan 2012, 15:15:20 UTC |
Report deadline | 10 Jan 2013, 20:35:20 UTC |
Received | 29 Feb 2012, 17:50:08 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 0 (0x00000000) |
Computer ID | 962683 |
Run time | 18 hours 41 min 10 sec |
CPU time | 17 hours 13 min 5 sec |
Validate state | Invalid |
Credit | 399.11 |
Device peak FLOPS | 3.31 GFLOPS |
Application version | UK Met Office HadAM3P-HadRM3P Europe v6.09 windows_intelx86 |
Stderr | <core_client_version>6.12.34</core_client_version> <![CDATA[ <stderr_txt> Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1496, iMonCtr=2 Model crash detected, will try to restart... 18:02:02 (2492): No heartbeat from core client for 30 sec - exiting 18:02:03 (2492): No heartbeat from core client for 30 sec - exiting 18:02:04 (2492): No heartbeat from core client for 30 sec - exiting 18:02:05 (2492): No heartbeat from core client for 30 sec - exiting 18:02:06 (2492): No heartbeat from core client for 30 sec - exiting 18:02:07 (2492): No heartbeat from core client for 30 sec - exiting 18:02:08 (2492): No heartbeat from core client for 30 sec - exiting 18:02:09 (2492): No heartbeat from core client for 30 sec - exiting 18:02:10 (2492): No heartbeat from core client for 30 sec - exiting 18:02:11 (2492): No heartbeat from core client for 30 sec - exiting 18:02:12 (2492): No heartbeat from core client for 30 sec - exiting 18:02:13 (2492): No heartbeat from core client for 30 sec - exiting 18:02:14 (2492): No heartbeat from core client for 30 sec - exiting 18:02:15 (2492): No heartbeat from core client for 30 sec - exiting 18:02:16 (2492): No heartbeat from core client for 30 sec - exiting 18:02:17 (2492): No heartbeat from core client for 30 sec - exiting 18:02:18 (2492): No heartbeat from core client for 30 sec - exiting 18:02:19 (2492): No heartbeat from core client for 30 sec - exiting 18:02:20 (2492): No heartbeat from core client for 30 sec - exiting 18:02:21 (2492): No heartbeat from core client for 30 sec - exiting 18:02:22 (2492): No heartbeat from core client for 30 sec - exiting 18:02:23 (2492): No heartbeat from core client for 30 sec - exiting 18:02:24 (2492): No heartbeat from core client for 30 sec - exiting 18:02:25 (2492): No heartbeat from core client for 30 sec - exiting 18:02:26 (2492): No heartbeat from core client for 30 sec - exiting 18:02:27 (2492): No heartbeat from core client for 30 sec - exiting 18:02:28 (2492): No heartbeat from core client for 30 sec - exiting 18:02:29 (2492): No heartbeat from core client for 30 sec - exiting 18:02:30 (2492): No heartbeat from core client for 30 sec - exiting 18:02:31 (2492): No heartbeat from core client for 30 sec - exiting 18:02:32 (2492): No heartbeat from core client for 30 sec - exiting 18:02:33 (2492): No heartbeat from core client for 30 sec - exiting 18:02:34 (2492): No heartbeat from core client for 30 sec - exiting 18:02:35 (2492): No heartbeat from core client for 30 sec - exiting 18:02:36 (2492): No heartbeat from core client for 30 sec - exiting 18:02:37 (2492): No heartbeat from core client for 30 sec - exiting 18:02:38 (2492): No heartbeat from core client for 30 sec - exiting 18:02:39 (2492): No heartbeat from core client for 30 sec - exiting 18:02:40 (2492): No heartbeat from core client for 30 sec - exiting 18:02:41 (2492): No heartbeat from core client for 30 sec - exiting 18:02:42 (2492): No heartbeat from core client for 30 sec - exiting 18:02:43 (2492): No heartbeat from core client for 30 sec - exiting 18:02:44 (2492): No heartbeat from core client for 30 sec - exiting 18:02:45 (2492): No heartbeat from core client for 30 sec - exiting 18:02:46 (2492): No heartbeat from core client for 30 sec - exiting 18:02:47 (2492): No heartbeat from core client for 30 sec - exiting 18:02:48 (2492): No heartbeat from core client for 30 sec - exiting 18:02:49 (2492): No heartbeat from core client for 30 sec - exiting 18:02:50 (2492): No heartbeat from core client for 30 sec - exiting 18:02:51 (2492): No heartbeat from core client for 30 sec - exiting 18:02:52 (2492): No heartbeat from core client for 30 sec - exiting 18:02:53 (2492): No heartbeat from core client for 30 sec - exiting 18:02:54 (2492): No heartbeat from core client for 30 sec - exiting 18:02:55 (2492): No heartbeat from core client for 30 sec - exiting 18:02:56 (2492): No heartbeat from core client for 30 sec - exiting 18:02:57 (2492): No heartbeat from core client for 30 sec - exiting 18:02:58 (2492): No heartbeat from core client for 30 sec - exiting 18:02:59 (2492): No heartbeat from core client for 30 sec - exiting 18:03:00 (2492): No heartbeat from core client for 30 sec - exiting 18:03:01 (2492): No heartbeat from core client for 30 sec - exiting 18:03:02 (2492): No heartbeat from core client for 30 sec - exiting 18:03:03 (2492): No heartbeat from core client for 30 sec - exiting 18:03:04 (2492): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4916, iMonCtr=2 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=4084, iMonCtr=1 Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3956, selfPID=3956, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3956, selfPID=3736, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Called boinc_finish </stderr_txt> <message> upload failure: <file_xfer_error> <file_name>hadam3p_eu_9efk_1963_1_007738760_0_3.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_eu_9efk_1963_1_007738760_0_4.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_eu_9efk_1963_1_007738760_0_5.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_eu_9efk_1963_1_007738760_0_6.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_eu_9efk_1963_1_007738760_0_7.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_eu_9efk_1963_1_007738760_0_8.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_eu_9efk_1963_1_007738760_0_9.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_eu_9efk_1963_1_007738760_0_10.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_eu_9efk_1963_1_007738760_0_11.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_eu_9efk_1963_1_007738760_0_12.zip</file_name> <error_code>-161</error_code> </file_xfer_error> </message> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
26 Feb 2012 16:52:30 | 962683 | 14011287 | hadam3p_eu_9efk_1963_1_007738760_0 | 23,136 | 42,445 | 1.8346 |
18 Feb 2012 12:29:10 | 962683 | 14011287 | hadam3p_eu_9efk_1963_1_007738760_0 | 11,616 | 21,545 | 1.8548 |
©2024 cpdn.org