climateprediction.net home page
Task 16326385

Task 16326385

Name hadam3p_eu_p0mn_2013_1_008543526_0
Workunit 8691038
Created 3 Mar 2014, 19:34:58 UTC
Sent 3 Mar 2014, 21:13:10 UTC
Report deadline 14 Feb 2015, 2:33:10 UTC
Received 8 Mar 2014, 22:52:51 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 1291948
Run time 22 hours 59 min 40 sec
CPU time 11 hours 39 min 43 sec
Validate state Invalid
Credit 399.11
Device peak FLOPS 3.42 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Europe v6.09
windows_intelx86
Stderr
<core_client_version>7.2.39</core_client_version>
<![CDATA[
<stderr_txt>
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6984, selfPID=6984, iMonCtr=2
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2484, selfPID=4052, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Called boinc_finish
19:41:22 (3984): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
11:02:40 (3160): No heartbeat from core client for 30 sec - exiting
11:02:41 (3160): No heartbeat from core client for 30 sec - exiting
11:02:42 (3160): No heartbeat from core client for 30 sec - exiting
11:02:43 (3160): No heartbeat from core client for 30 sec - exiting
11:02:44 (3160): No heartbeat from core client for 30 sec - exiting
11:02:45 (3160): No heartbeat from core client for 30 sec - exiting
11:02:46 (3160): No heartbeat from core client for 30 sec - exiting
11:02:47 (3160): No heartbeat from core client for 30 sec - exiting
11:02:49 (3160): No heartbeat from core client for 30 sec - exiting
11:02:50 (3160): No heartbeat from core client for 30 sec - exiting
11:02:51 (3160): No heartbeat from core client for 30 sec - exiting
11:02:52 (3160): No heartbeat from core client for 30 sec - exiting
11:02:53 (3160): No heartbeat from core client for 30 sec - exiting
11:02:54 (3160): No heartbeat from core client for 30 sec - exiting
11:02:55 (3160): No heartbeat from core client for 30 sec - exiting
11:02:56 (3160): No heartbeat from core client for 30 sec - exiting
11:02:57 (3160): No heartbeat from core client for 30 sec - exiting
11:02:58 (3160): No heartbeat from core client for 30 sec - exiting
11:03:00 (3160): No heartbeat from core client for 30 sec - exiting
11:03:01 (3160): No heartbeat from core client for 30 sec - exiting
11:03:02 (3160): No heartbeat from core client for 30 sec - exiting
11:03:03 (3160): No heartbeat from core client for 30 sec - exiting
11:03:04 (3160): No heartbeat from core client for 30 sec - exiting
11:03:05 (3160): No heartbeat from core client for 30 sec - exiting
11:03:06 (3160): No heartbeat from core client for 30 sec - exiting
11:03:07 (3160): No heartbeat from core client for 30 sec - exiting
11:03:08 (3160): No heartbeat from core client for 30 sec - exiting
11:03:09 (3160): No heartbeat from core client for 30 sec - exiting
11:03:10 (3160): No heartbeat from core client for 30 sec - exiting
11:03:12 (3160): No heartbeat from core client for 30 sec - exiting
11:03:13 (3160): No heartbeat from core client for 30 sec - exiting
11:03:14 (3160): No heartbeat from core client for 30 sec - exiting
11:03:15 (3160): No heartbeat from core client for 30 sec - exiting
11:03:16 (3160): No heartbeat from core client for 30 sec - exiting
11:03:17 (3160): No heartbeat from core client for 30 sec - exiting
11:03:18 (3160): No heartbeat from core client for 30 sec - exiting
11:03:19 (3160): No heartbeat from core client for 30 sec - exiting
11:03:20 (3160): No heartbeat from core client for 30 sec - exiting
11:03:22 (3160): No heartbeat from core client for 30 sec - exiting
11:03:23 (3160): No heartbeat from core client for 30 sec - exiting
11:03:24 (3160): No heartbeat from core client for 30 sec - exiting
11:03:25 (3160): No heartbeat from core client for 30 sec - exiting
11:03:26 (3160): No heartbeat from core client for 30 sec - exiting
11:03:27 (3160): No heartbeat from core client for 30 sec - exiting
11:03:28 (3160): No heartbeat from core client for 30 sec - exiting
11:03:29 (3160): No heartbeat from core client for 30 sec - exiting
11:03:30 (3160): No heartbeat from core client for 30 sec - exiting
11:03:32 (3160): No heartbeat from core client for 30 sec - exiting
11:03:33 (3160): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5724, selfPID=2880, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Called boinc_finish

</stderr_txt><message>
upload failure: <file_xfer_error>
  <file_name>hadam3p_eu_p0mn_2013_1_008543526_0_3.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_p0mn_2013_1_008543526_0_4.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_p0mn_2013_1_008543526_0_5.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_p0mn_2013_1_008543526_0_6.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_p0mn_2013_1_008543526_0_7.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_p0mn_2013_1_008543526_0_8.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_p0mn_2013_1_008543526_0_9.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_p0mn_2013_1_008543526_0_10.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_p0mn_2013_1_008543526_0_11.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_p0mn_2013_1_008543526_0_12.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>

</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
06 Mar 2014 15:52:13 1291948 16326385 hadam3p_eu_p0mn_2013_1_008543526_0 23,136 29,099 1.2577
04 Mar 2014 19:54:32 1291948 16326385 hadam3p_eu_p0mn_2013_1_008543526_0 11,616 14,551 1.2527


©2024 cpdn.org