climateprediction.net home page
Task 15785756

Task 15785756

Name hadam3p_pnw_qalp_2036_1_008369303_0
Workunit 8520162
Created 15 May 2013, 20:53:42 UTC
Sent 15 May 2013, 20:53:57 UTC
Report deadline 28 Apr 2014, 2:13:57 UTC
Received 1 Jun 2013, 2:04:51 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 1264600
Run time 2 days 18 hours 53 min 14 sec
CPU time 2 days 17 hours 6 min 32 sec
Validate state Invalid
Credit 2,004.61
Device peak FLOPS 2.55 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Pacific North West v6.09
windows_intelx86
Stderr
<core_client_version>7.0.28</core_client_version>
<![CDATA[
<stderr_txt>
19:37:37 (5192): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
07:06:47 (1564): No heartbeat from core client for 30 sec - exiting
07:06:48 (1564): No heartbeat from core client for 30 sec - exiting
07:06:49 (1564): No heartbeat from core client for 30 sec - exiting
07:06:50 (1564): No heartbeat from core client for 30 sec - exiting
07:06:51 (1564): No heartbeat from core client for 30 sec - exiting
07:06:52 (1564): No heartbeat from core client for 30 sec - exiting
07:06:53 (1564): No heartbeat from core client for 30 sec - exiting
07:06:54 (1564): No heartbeat from core client for 30 sec - exiting
07:06:55 (1564): No heartbeat from core client for 30 sec - exiting
07:06:56 (1564): No heartbeat from core client for 30 sec - exiting
07:06:57 (1564): No heartbeat from core client for 30 sec - exiting
07:06:59 (1564): No heartbeat from core client for 30 sec - exiting
07:07:00 (1564): No heartbeat from core client for 30 sec - exiting
07:07:01 (1564): No heartbeat from core client for 30 sec - exiting
07:07:02 (1564): No heartbeat from core client for 30 sec - exiting
07:07:03 (1564): No heartbeat from core client for 30 sec - exiting
07:07:04 (1564): No heartbeat from core client for 30 sec - exiting
07:07:05 (1564): No heartbeat from core client for 30 sec - exiting
07:07:06 (1564): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
13:11:44 (4508): No heartbeat from core client for 30 sec - exiting
13:11:45 (4508): No heartbeat from core client for 30 sec - exiting
13:11:46 (4508): No heartbeat from core client for 30 sec - exiting
13:11:47 (4508): No heartbeat from core client for 30 sec - exiting
13:11:48 (4508): No heartbeat from core client for 30 sec - exiting
13:11:49 (4508): No heartbeat from core client for 30 sec - exiting
13:11:50 (4508): No heartbeat from core client for 30 sec - exiting
13:11:51 (4508): No heartbeat from core client for 30 sec - exiting
13:11:52 (4508): No heartbeat from core client for 30 sec - exiting
13:11:53 (4508): No heartbeat from core client for 30 sec - exiting
13:11:54 (4508): No heartbeat from core client for 30 sec - exiting
13:11:56 (4508): No heartbeat from core client for 30 sec - exiting
13:11:57 (4508): No heartbeat from core client for 30 sec - exiting
13:11:58 (4508): No heartbeat from core client for 30 sec - exiting
13:11:59 (4508): No heartbeat from core client for 30 sec - exiting
13:12:00 (4508): No heartbeat from core client for 30 sec - exiting
13:12:01 (4508): No heartbeat from core client for 30 sec - exiting
13:12:02 (4508): No heartbeat from core client for 30 sec - exiting
13:12:03 (4508): No heartbeat from core client for 30 sec - exiting
13:12:04 (4508): No heartbeat from core client for 30 sec - exiting
13:12:05 (4508): No heartbeat from core client for 30 sec - exiting
13:12:06 (4508): No heartbeat from core client for 30 sec - exiting
13:12:08 (4508): No heartbeat from core client for 30 sec - exiting
13:12:09 (4508): No heartbeat from core client for 30 sec - exiting
13:12:10 (4508): No heartbeat from core client for 30 sec - exiting
13:12:11 (4508): No heartbeat from core client for 30 sec - exiting
13:12:12 (4508): No heartbeat from core client for 30 sec - exiting
13:12:13 (4508): No heartbeat from core client for 30 sec - exiting
13:12:14 (4508): No heartbeat from core client for 30 sec - exiting
13:12:15 (4508): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5692, selfPID=4064, iMonCtr=1
Model crash detected, will try to restart...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3536, iMonCtr=2
19:20:05 (5824): No heartbeat from core client for 30 sec - exiting
19:20:06 (5824): No heartbeat from core client for 30 sec - exiting
19:20:07 (5824): No heartbeat from core client for 30 sec - exiting
19:20:08 (5824): No heartbeat from core client for 30 sec - exiting
19:20:09 (5824): No heartbeat from core client for 30 sec - exiting
19:20:10 (5824): No heartbeat from core client for 30 sec - exiting
19:20:11 (5824): No heartbeat from core client for 30 sec - exiting
19:20:12 (5824): No heartbeat from core client for 30 sec - exiting
19:20:13 (5824): No heartbeat from core client for 30 sec - exiting
19:20:14 (5824): No heartbeat from core client for 30 sec - exiting
19:20:16 (5824): No heartbeat from core client for 30 sec - exiting
19:20:17 (5824): No heartbeat from core client for 30 sec - exiting
19:20:18 (5824): No heartbeat from core client for 30 sec - exiting
19:20:19 (5824): No heartbeat from core client for 30 sec - exiting
19:20:20 (5824): No heartbeat from core client for 30 sec - exiting
19:20:21 (5824): No heartbeat from core client for 30 sec - exiting
19:20:22 (5824): No heartbeat from core client for 30 sec - exiting
19:20:23 (5824): No heartbeat from core client for 30 sec - exiting
19:20:24 (5824): No heartbeat from core client for 30 sec - exiting
19:20:25 (5824): No heartbeat from core client for 30 sec - exiting
19:20:27 (5824): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5564, selfPID=4460, iMonCtr=1
Model crash detected, will try to restart...
13:24:09 (3960): No heartbeat from core client for 30 sec - exiting
13:24:11 (3960): No heartbeat from core client for 30 sec - exiting
13:24:12 (3960): No heartbeat from core client for 30 sec - exiting
13:23:59 (3960): No heartbeat from core client for 30 sec - exiting
13:24:00 (3960): No heartbeat from core client for 30 sec - exiting
13:24:01 (3960): No heartbeat from core client for 30 sec - exiting
13:24:02 (3960): No heartbeat from core client for 30 sec - exiting
13:24:03 (3960): No heartbeat from core client for 30 sec - exiting
13:24:05 (3960): No heartbeat from core client for 30 sec - exiting
13:24:06 (3960): No heartbeat from core client for 30 sec - exiting
13:24:07 (3960): No heartbeat from core client for 30 sec - exiting
13:24:08 (3960): No heartbeat from core client for 30 sec - exiting
13:24:09 (3960): No heartbeat from core client for 30 sec - exiting
13:24:10 (3960): No heartbeat from core client for 30 sec - exiting
13:24:11 (3960): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6700, selfPID=3204, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6224, selfPID=4124, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7144, selfPID=5016, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Regional yearly means requires 12 input files got 8
Called boinc_finish

</stderr_txt><message>
upload failure: <file_xfer_error>
  <file_name>hadam3p_pnw_qalp_2036_1_008369303_0_9.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_qalp_2036_1_008369303_0_10.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_qalp_2036_1_008369303_0_11.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_qalp_2036_1_008369303_0_12.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>

</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
29 May 2013 00:35:03 1264600 15785756 hadam3p_pnw_qalp_2036_1_008369303_0 92,256 219,529 2.3796
28 May 2013 17:12:25 1264600 15785756 hadam3p_pnw_qalp_2036_1_008369303_0 80,736 192,712 2.3869
27 May 2013 16:21:55 1264600 15785756 hadam3p_pnw_qalp_2036_1_008369303_0 69,216 165,812 2.3956
27 May 2013 16:21:55 1264600 15785756 hadam3p_pnw_qalp_2036_1_008369303_0 57,696 137,988 2.3916
22 May 2013 00:20:03 1264600 15785756 hadam3p_pnw_qalp_2036_1_008369303_0 46,176 110,374 2.3903
19 May 2013 23:47:44 1264600 15785756 hadam3p_pnw_qalp_2036_1_008369303_0 34,656 82,754 2.3879
18 May 2013 02:01:04 1264600 15785756 hadam3p_pnw_qalp_2036_1_008369303_0 23,136 55,214 2.3865
16 May 2013 22:41:05 1264600 15785756 hadam3p_pnw_qalp_2036_1_008369303_0 11,616 27,751 2.3890


©2024 cpdn.org