Name | hadam3p_pnw_qalp_2036_1_008369303_0 |
Workunit | 8520162 |
Created | 15 May 2013, 20:53:42 UTC |
Sent | 15 May 2013, 20:53:57 UTC |
Report deadline | 28 Apr 2014, 2:13:57 UTC |
Received | 1 Jun 2013, 2:04:51 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 0 (0x00000000) |
Computer ID | 1264600 |
Run time | 2 days 18 hours 53 min 14 sec |
CPU time | 2 days 17 hours 6 min 32 sec |
Validate state | Invalid |
Credit | 2,004.61 |
Device peak FLOPS | 2.55 GFLOPS |
Application version | UK Met Office HadAM3P-HadRM3P Pacific North West v6.09 windows_intelx86 |
Stderr | <core_client_version>7.0.28</core_client_version> <![CDATA[ <stderr_txt> 19:37:37 (5192): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 07:06:47 (1564): No heartbeat from core client for 30 sec - exiting 07:06:48 (1564): No heartbeat from core client for 30 sec - exiting 07:06:49 (1564): No heartbeat from core client for 30 sec - exiting 07:06:50 (1564): No heartbeat from core client for 30 sec - exiting 07:06:51 (1564): No heartbeat from core client for 30 sec - exiting 07:06:52 (1564): No heartbeat from core client for 30 sec - exiting 07:06:53 (1564): No heartbeat from core client for 30 sec - exiting 07:06:54 (1564): No heartbeat from core client for 30 sec - exiting 07:06:55 (1564): No heartbeat from core client for 30 sec - exiting 07:06:56 (1564): No heartbeat from core client for 30 sec - exiting 07:06:57 (1564): No heartbeat from core client for 30 sec - exiting 07:06:59 (1564): No heartbeat from core client for 30 sec - exiting 07:07:00 (1564): No heartbeat from core client for 30 sec - exiting 07:07:01 (1564): No heartbeat from core client for 30 sec - exiting 07:07:02 (1564): No heartbeat from core client for 30 sec - exiting 07:07:03 (1564): No heartbeat from core client for 30 sec - exiting 07:07:04 (1564): No heartbeat from core client for 30 sec - exiting 07:07:05 (1564): No heartbeat from core client for 30 sec - exiting 07:07:06 (1564): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 13:11:44 (4508): No heartbeat from core client for 30 sec - exiting 13:11:45 (4508): No heartbeat from core client for 30 sec - exiting 13:11:46 (4508): No heartbeat from core client for 30 sec - exiting 13:11:47 (4508): No heartbeat from core client for 30 sec - exiting 13:11:48 (4508): No heartbeat from core client for 30 sec - exiting 13:11:49 (4508): No heartbeat from core client for 30 sec - exiting 13:11:50 (4508): No heartbeat from core client for 30 sec - exiting 13:11:51 (4508): No heartbeat from core client for 30 sec - exiting 13:11:52 (4508): No heartbeat from core client for 30 sec - exiting 13:11:53 (4508): No heartbeat from core client for 30 sec - exiting 13:11:54 (4508): No heartbeat from core client for 30 sec - exiting 13:11:56 (4508): No heartbeat from core client for 30 sec - exiting 13:11:57 (4508): No heartbeat from core client for 30 sec - exiting 13:11:58 (4508): No heartbeat from core client for 30 sec - exiting 13:11:59 (4508): No heartbeat from core client for 30 sec - exiting 13:12:00 (4508): No heartbeat from core client for 30 sec - exiting 13:12:01 (4508): No heartbeat from core client for 30 sec - exiting 13:12:02 (4508): No heartbeat from core client for 30 sec - exiting 13:12:03 (4508): No heartbeat from core client for 30 sec - exiting 13:12:04 (4508): No heartbeat from core client for 30 sec - exiting 13:12:05 (4508): No heartbeat from core client for 30 sec - exiting 13:12:06 (4508): No heartbeat from core client for 30 sec - exiting 13:12:08 (4508): No heartbeat from core client for 30 sec - exiting 13:12:09 (4508): No heartbeat from core client for 30 sec - exiting 13:12:10 (4508): No heartbeat from core client for 30 sec - exiting 13:12:11 (4508): No heartbeat from core client for 30 sec - exiting 13:12:12 (4508): No heartbeat from core client for 30 sec - exiting 13:12:13 (4508): No heartbeat from core client for 30 sec - exiting 13:12:14 (4508): No heartbeat from core client for 30 sec - exiting 13:12:15 (4508): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5692, selfPID=4064, iMonCtr=1 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3536, iMonCtr=2 19:20:05 (5824): No heartbeat from core client for 30 sec - exiting 19:20:06 (5824): No heartbeat from core client for 30 sec - exiting 19:20:07 (5824): No heartbeat from core client for 30 sec - exiting 19:20:08 (5824): No heartbeat from core client for 30 sec - exiting 19:20:09 (5824): No heartbeat from core client for 30 sec - exiting 19:20:10 (5824): No heartbeat from core client for 30 sec - exiting 19:20:11 (5824): No heartbeat from core client for 30 sec - exiting 19:20:12 (5824): No heartbeat from core client for 30 sec - exiting 19:20:13 (5824): No heartbeat from core client for 30 sec - exiting 19:20:14 (5824): No heartbeat from core client for 30 sec - exiting 19:20:16 (5824): No heartbeat from core client for 30 sec - exiting 19:20:17 (5824): No heartbeat from core client for 30 sec - exiting 19:20:18 (5824): No heartbeat from core client for 30 sec - exiting 19:20:19 (5824): No heartbeat from core client for 30 sec - exiting 19:20:20 (5824): No heartbeat from core client for 30 sec - exiting 19:20:21 (5824): No heartbeat from core client for 30 sec - exiting 19:20:22 (5824): No heartbeat from core client for 30 sec - exiting 19:20:23 (5824): No heartbeat from core client for 30 sec - exiting 19:20:24 (5824): No heartbeat from core client for 30 sec - exiting 19:20:25 (5824): No heartbeat from core client for 30 sec - exiting 19:20:27 (5824): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5564, selfPID=4460, iMonCtr=1 Model crash detected, will try to restart... 13:24:09 (3960): No heartbeat from core client for 30 sec - exiting 13:24:11 (3960): No heartbeat from core client for 30 sec - exiting 13:24:12 (3960): No heartbeat from core client for 30 sec - exiting 13:23:59 (3960): No heartbeat from core client for 30 sec - exiting 13:24:00 (3960): No heartbeat from core client for 30 sec - exiting 13:24:01 (3960): No heartbeat from core client for 30 sec - exiting 13:24:02 (3960): No heartbeat from core client for 30 sec - exiting 13:24:03 (3960): No heartbeat from core client for 30 sec - exiting 13:24:05 (3960): No heartbeat from core client for 30 sec - exiting 13:24:06 (3960): No heartbeat from core client for 30 sec - exiting 13:24:07 (3960): No heartbeat from core client for 30 sec - exiting 13:24:08 (3960): No heartbeat from core client for 30 sec - exiting 13:24:09 (3960): No heartbeat from core client for 30 sec - exiting 13:24:10 (3960): No heartbeat from core client for 30 sec - exiting 13:24:11 (3960): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6700, selfPID=3204, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6224, selfPID=4124, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7144, selfPID=5016, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Regional yearly means requires 12 input files got 8 Called boinc_finish </stderr_txt><message> upload failure: <file_xfer_error> <file_name>hadam3p_pnw_qalp_2036_1_008369303_0_9.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_pnw_qalp_2036_1_008369303_0_10.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_pnw_qalp_2036_1_008369303_0_11.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_pnw_qalp_2036_1_008369303_0_12.zip</file_name> <error_code>-161</error_code> </file_xfer_error> </message> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
29 May 2013 00:35:03 | 1264600 | 15785756 | hadam3p_pnw_qalp_2036_1_008369303_0 | 92,256 | 219,529 | 2.3796 |
28 May 2013 17:12:25 | 1264600 | 15785756 | hadam3p_pnw_qalp_2036_1_008369303_0 | 80,736 | 192,712 | 2.3869 |
27 May 2013 16:21:55 | 1264600 | 15785756 | hadam3p_pnw_qalp_2036_1_008369303_0 | 69,216 | 165,812 | 2.3956 |
27 May 2013 16:21:55 | 1264600 | 15785756 | hadam3p_pnw_qalp_2036_1_008369303_0 | 57,696 | 137,988 | 2.3916 |
22 May 2013 00:20:03 | 1264600 | 15785756 | hadam3p_pnw_qalp_2036_1_008369303_0 | 46,176 | 110,374 | 2.3903 |
19 May 2013 23:47:44 | 1264600 | 15785756 | hadam3p_pnw_qalp_2036_1_008369303_0 | 34,656 | 82,754 | 2.3879 |
18 May 2013 02:01:04 | 1264600 | 15785756 | hadam3p_pnw_qalp_2036_1_008369303_0 | 23,136 | 55,214 | 2.3865 |
16 May 2013 22:41:05 | 1264600 | 15785756 | hadam3p_pnw_qalp_2036_1_008369303_0 | 11,616 | 27,751 | 2.3890 |
©2024 cpdn.org