Name | hadam3p_pnw_q5fh_2013_1_010028874_0 |
Workunit | 10026936 |
Created | 23 Jul 2015, 15:55:20 UTC |
Sent | 28 Jul 2015, 5:48:18 UTC |
Report deadline | 9 Jul 2016, 11:08:18 UTC |
Received | 12 Aug 2015, 19:19:01 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 0 (0x00000000) |
Computer ID | 804413 |
Run time | 2 days 2 hours 17 min 22 sec |
CPU time | 2 days 2 hours 17 min 22 sec |
Validate state | Invalid |
Credit | 1,508.39 |
Device peak FLOPS | 2.99 GFLOPS |
Application version | UK Met Office HadAM3P-HadRM3P Pacific North West v7.27 windows_intelx86 |
Stderr | <core_client_version>6.2.19</core_client_version> <![CDATA[ <stderr_txt> Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 10:55:33 (4996): No heartbeat from client for 30 sec - exiting 10:55:33 (4996): timer handler: client dead, exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:56:22 (4388): No heartbeat from client for 30 sec - exiting 10:56:22 (4388): timer handler: client dead, exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:57:12 (4548): No heartbeat from client for 30 sec - exiting 10:57:12 (4548): timer handler: client dead, exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:58:00 (5088): No heartbeat from client for 30 sec - exiting 10:58:00 (5088): timer handler: client dead, exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:58:49 (4908): No heartbeat from client for 30 sec - exiting 10:58:49 (4908): timer handler: client dead, exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:59:38 (3416): No heartbeat from client for 30 sec - exiting 10:59:38 (3416): timer handler: client dead, exiting CPDN Monitor - No 'heartbeat' from BOINC... 11:00:27 (4268): No heartbeat from client for 30 sec - exiting 11:00:27 (4268): timer handler: client dead, exiting CPDN Monitor - No 'heartbeat' from BOINC... 11:01:16 (224): No heartbeat from client for 30 sec - exiting 11:01:16 (224): timer handler: client dead, exiting CPDN Monitor - No 'heartbeat' from BOINC... 11:02:05 (2220): No heartbeat from client for 30 sec - exiting 11:02:05 (2220): timer handler: client dead, exiting CPDN Monitor - No 'heartbeat' from BOINC... 11:02:54 (3256): No heartbeat from client for 30 sec - exiting 11:02:54 (3256): timer handler: client dead, exiting CPDN Monitor - No 'heartbeat' from BOINC... 11:03:43 (4364): No heartbeat from client for 30 sec - exiting 11:03:43 (4364): timer handler: client dead, exiting CPDN Monitor - No 'heartbeat' from BOINC... 11:04:32 (2492): No heartbeat from client for 30 sec - exiting 11:04:32 (2492): timer handler: client dead, exiting CPDN Monitor - No 'heartbeat' from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=4400, iMonCtr=1 Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4956, selfPID=4956, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4956, selfPID=4128, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... 11:04:56 (4128): called boinc_finish(0) </stderr_txt> <message> <file_xfer_error> <file_name>hadam3p_pnw_q5fh_2013_1_010028874_0_7.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_pnw_q5fh_2013_1_010028874_0_8.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_pnw_q5fh_2013_1_010028874_0_9.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_pnw_q5fh_2013_1_010028874_0_10.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_pnw_q5fh_2013_1_010028874_0_11.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_pnw_q5fh_2013_1_010028874_0_12.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_pnw_q5fh_2013_1_010028874_0_13.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_pnw_q5fh_2013_1_010028874_0_14.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_pnw_q5fh_2013_1_010028874_0_15.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_pnw_q5fh_2013_1_010028874_0_16.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_pnw_q5fh_2013_1_010028874_0_17.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_pnw_q5fh_2013_1_010028874_0_18.zip</file_name> <error_code>-161</error_code> </file_xfer_error> </message> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
12 Aug 2015 19:22:13 | 804413 | 18728820 | hadam3p_pnw_q5fh_2013_1_010028874_0 | 69,419 | 162,365 | 2.3389 |
11 Aug 2015 04:53:55 | 804413 | 18728820 | hadam3p_pnw_q5fh_2013_1_010028874_0 | 57,899 | 135,814 | 2.3457 |
10 Aug 2015 21:26:51 | 804413 | 18728820 | hadam3p_pnw_q5fh_2013_1_010028874_0 | 46,379 | 109,197 | 2.3544 |
10 Aug 2015 14:20:19 | 804413 | 18728820 | hadam3p_pnw_q5fh_2013_1_010028874_0 | 34,859 | 82,347 | 2.3623 |
09 Aug 2015 23:18:00 | 804413 | 18728820 | hadam3p_pnw_q5fh_2013_1_010028874_0 | 23,339 | 55,165 | 2.3636 |
09 Aug 2015 16:26:42 | 804413 | 18728820 | hadam3p_pnw_q5fh_2013_1_010028874_0 | 11,819 | 27,990 | 2.3682 |
©2024 cpdn.org