Name | hadam3p_pnw_avm3_1994_1_007880046_0 |
Workunit | 8035158 |
Created | 16 Apr 2012, 17:25:15 UTC |
Sent | 16 Apr 2012, 17:27:21 UTC |
Report deadline | 29 Mar 2013, 22:47:21 UTC |
Received | 19 May 2012, 13:46:22 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 0 (0x00000000) |
Computer ID | 1175815 |
Run time | 5 days 0 hours 51 min 28 sec |
CPU time | 3 days 19 hours 4 min 12 sec |
Validate state | Invalid |
Credit | 2,254.93 |
Device peak FLOPS | 2.91 GFLOPS |
Application version | UK Met Office HadAM3P-HadRM3P Pacific North West v6.09 windows_intelx86 |
Stderr | <core_client_version>7.0.25</core_client_version> <![CDATA[ <stderr_txt> Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3484, selfPID=3744, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... 12:38:34 (1764): No heartbeat from core client for 30 sec - exiting 12:38:35 (1764): No heartbeat from core client for 30 sec - exiting 12:38:36 (1764): No heartbeat from core client for 30 sec - exiting 12:38:37 (1764): No heartbeat from core client for 30 sec - exiting 12:38:39 (1764): No heartbeat from core client for 30 sec - exiting 12:38:40 (1764): No heartbeat from core client for 30 sec - exiting 12:38:41 (1764): No heartbeat from core client for 30 sec - exiting 12:38:42 (1764): No heartbeat from core client for 30 sec - exiting 12:38:43 (1764): No heartbeat from core client for 30 sec - exiting 12:38:44 (1764): No heartbeat from core client for 30 sec - exiting 12:38:45 (1764): No heartbeat from core client for 30 sec - exiting 12:38:46 (1764): No heartbeat from core client for 30 sec - exiting 12:38:47 (1764): No heartbeat from core client for 30 sec - exiting 12:38:48 (1764): No heartbeat from core client for 30 sec - exiting 12:38:49 (1764): No heartbeat from core client for 30 sec - exiting 12:38:51 (1764): No heartbeat from core client for 30 sec - exiting 12:38:52 (1764): No heartbeat from core client for 30 sec - exiting 12:38:53 (1764): No heartbeat from core client for 30 sec - exiting 12:38:54 (1764): No heartbeat from core client for 30 sec - exiting 12:38:55 (1764): No heartbeat from core client for 30 sec - exiting 12:38:56 (1764): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 22:13:15 (2464): No heartbeat from core client for 30 sec - exiting 22:13:16 (2464): No heartbeat from core client for 30 sec - exiting 22:13:17 (2464): No heartbeat from core client for 30 sec - exiting 22:13:18 (2464): No heartbeat from core client for 30 sec - exiting 22:13:19 (2464): No heartbeat from core client for 30 sec - exiting 22:13:20 (2464): No heartbeat from core client for 30 sec - exiting 22:13:21 (2464): No heartbeat from core client for 30 sec - exiting 22:13:22 (2464): No heartbeat from core client for 30 sec - exiting 22:13:24 (2464): No heartbeat from core client for 30 sec - exiting 22:13:25 (2464): No heartbeat from core client for 30 sec - exiting 22:13:26 (2464): No heartbeat from core client for 30 sec - exiting 22:13:27 (2464): No heartbeat from core client for 30 sec - exiting 22:13:28 (2464): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 09:31:26 (316): No heartbeat from core client for 30 sec - exiting 09:31:27 (316): No heartbeat from core client for 30 sec - exiting 09:31:29 (316): No heartbeat from core client for 30 sec - exiting 09:31:30 (316): No heartbeat from core client for 30 sec - exiting 09:31:31 (316): No heartbeat from core client for 30 sec - exiting 09:31:32 (316): No heartbeat from core client for 30 sec - exiting 09:31:33 (316): No heartbeat from core client for 30 sec - exiting 09:31:34 (316): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 21:44:00 (3164): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 18:56:00 (1608): No heartbeat from core client for 30 sec - exiting 18:56:01 (1608): No heartbeat from core client for 30 sec - exiting 18:56:02 (1608): No heartbeat from core client for 30 sec - exiting 18:56:03 (1608): No heartbeat from core client for 30 sec - exiting 18:56:05 (1608): No heartbeat from core client for 30 sec - exiting 18:56:06 (1608): No heartbeat from core client for 30 sec - exiting 18:56:07 (1608): No heartbeat from core client for 30 sec - exiting 18:56:08 (1608): No heartbeat from core client for 30 sec - exiting 18:56:09 (1608): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 08:31:15 (3076): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 12:50:08 (2112): No heartbeat from core client for 30 sec - exiting 12:50:09 (2112): No heartbeat from core client for 30 sec - exiting 12:50:10 (2112): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 12:56:29 (1788): No heartbeat from core client for 30 sec - exiting 12:56:30 (1788): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:47:26 (3284): No heartbeat from core client for 30 sec - exiting 10:47:27 (3284): No heartbeat from core client for 30 sec - exiting 10:47:28 (3284): No heartbeat from core client for 30 sec - exiting 10:47:29 (3284): No heartbeat from core client for 30 sec - exiting 10:47:30 (3284): No heartbeat from core client for 30 sec - exiting 10:47:32 (3284): No heartbeat from core client for 30 sec - exiting 10:47:33 (3284): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:39:54 (3344): No heartbeat from core client for 30 sec - exiting 10:39:55 (3344): No heartbeat from core client for 30 sec - exiting 10:39:56 (3344): No heartbeat from core client for 30 sec - exiting 10:39:57 (3344): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 19:45:10 (1388): No heartbeat from core client for 30 sec - exiting 19:45:11 (1388): No heartbeat from core client for 30 sec - exiting 19:45:13 (1388): No heartbeat from core client for 30 sec - exiting 19:45:14 (1388): No heartbeat from core client for 30 sec - exiting 19:45:15 (1388): No heartbeat from core client for 30 sec - exiting 19:45:16 (1388): No heartbeat from core client for 30 sec - exiting 19:45:17 (1388): No heartbeat from core client for 30 sec - exiting 19:45:18 (1388): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:15:59 (3060): No heartbeat from core client for 30 sec - exiting 10:16:00 (3060): No heartbeat from core client for 30 sec - exiting 10:16:02 (3060): No heartbeat from core client for 30 sec - exiting 10:16:03 (3060): No heartbeat from core client for 30 sec - exiting 10:16:04 (3060): No heartbeat from core client for 30 sec - exiting 10:16:05 (3060): No heartbeat from core client for 30 sec - exiting 10:16:06 (3060): No heartbeat from core client for 30 sec - exiting 10:16:07 (3060): No heartbeat from core client for 30 sec - exiting 10:16:08 (3060): No heartbeat from core client for 30 sec - exiting 10:16:09 (3060): No heartbeat from core client for 30 sec - exiting 10:16:10 (3060): No heartbeat from core client for 30 sec - exiting 10:16:11 (3060): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3244, selfPID=2924, iMonCtr=1 Model crash detected, will try to restart... 17:45:14 (3224): No heartbeat from core client for 30 sec - exiting 17:45:15 (3224): No heartbeat from core client for 30 sec - exiting 17:45:17 (3224): No heartbeat from core client for 30 sec - exiting 17:45:18 (3224): No heartbeat from core client for 30 sec - exiting 17:45:19 (3224): No heartbeat from core client for 30 sec - exiting 17:45:20 (3224): No heartbeat from core client for 30 sec - exiting 17:45:21 (3224): No heartbeat from core client for 30 sec - exiting 17:45:22 (3224): No heartbeat from core client for 30 sec - exiting 17:45:23 (3224): No heartbeat from core client for 30 sec - exiting 17:45:24 (3224): No heartbeat from core client for 30 sec - exiting 17:45:25 (3224): No heartbeat from core client for 30 sec - exiting 17:45:26 (3224): No heartbeat from core client for 30 sec - exiting 17:45:27 (3224): No heartbeat from core client for 30 sec - exiting Regional yearly means requires 12 input files got 9 CPDN Monitor - No 'heartbeat' from BOINC... Model crashed: READHIST: End of file in READ from history file for namelist NLIHISTO tmp/xaakg.pipe_dummy 2048 Model crashed: READHIST: End of file in READ from history file for namelist NLIHISTO tmp/xaakg.pipe_dummy 2048 Leaving CPDN_Main::Monitor... Regional yearly means requires 12 input files got 9 zip error: Could not create output file (was replacing the original zip file) Called boinc_finish </stderr_txt> <message> upload failure: <file_xfer_error> <file_name>hadam3p_pnw_avm3_1994_1_007880046_0_10.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_pnw_avm3_1994_1_007880046_0_11.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_pnw_avm3_1994_1_007880046_0_12.zip</file_name> <error_code>-161</error_code> </file_xfer_error> </message> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
18 May 2012 12:22:55 | 1175815 | 14414462 | hadam3p_pnw_avm3_1994_1_007880046_0 | 103,776 | 306,866 | 2.9570 |
16 May 2012 20:01:29 | 1175815 | 14414462 | hadam3p_pnw_avm3_1994_1_007880046_0 | 92,256 | 274,833 | 2.9790 |
16 May 2012 08:47:21 | 1175815 | 14414462 | hadam3p_pnw_avm3_1994_1_007880046_0 | 80,736 | 241,212 | 2.9877 |
12 May 2012 12:56:41 | 1175815 | 14414462 | hadam3p_pnw_avm3_1994_1_007880046_0 | 69,216 | 208,307 | 3.0095 |
05 May 2012 12:28:01 | 1175815 | 14414462 | hadam3p_pnw_avm3_1994_1_007880046_0 | 57,696 | 165,461 | 2.8678 |
03 May 2012 15:27:47 | 1175815 | 14414462 | hadam3p_pnw_avm3_1994_1_007880046_0 | 46,176 | 132,507 | 2.8696 |
02 May 2012 11:45:02 | 1175815 | 14414462 | hadam3p_pnw_avm3_1994_1_007880046_0 | 34,656 | 99,938 | 2.8837 |
01 May 2012 10:23:43 | 1175815 | 14414462 | hadam3p_pnw_avm3_1994_1_007880046_0 | 23,137 | 67,021 | 2.8967 |
30 Apr 2012 18:57:51 | 1175815 | 14414462 | hadam3p_pnw_avm3_1994_1_007880046_0 | 23,136 | 66,631 | 2.8800 |
27 Apr 2012 18:13:41 | 1175815 | 14414462 | hadam3p_pnw_avm3_1994_1_007880046_0 | 11,619 | 33,921 | 2.9194 |
27 Apr 2012 17:03:16 | 1175815 | 14414462 | hadam3p_pnw_avm3_1994_1_007880046_0 | 11,616 | 33,484 | 2.8826 |
©2024 cpdn.org