climateprediction.net home page
Task 15191672

Task 15191672

Name hadam3p_eu_wzxm_1969_1_006909122_1
Workunit 7112438
Created 27 Aug 2012, 6:05:03 UTC
Sent 27 Aug 2012, 9:48:28 UTC
Report deadline 9 Aug 2013, 15:08:28 UTC
Received 29 Aug 2012, 13:34:24 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 1224497
Run time 19 hours 21 min 28 sec
CPU time 19 hours 10 min 23 sec
Validate state Invalid
Credit 597.84
Device peak FLOPS 3.66 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Europe v6.09
windows_intelx86
Stderr
<core_client_version>7.0.28</core_client_version>
<![CDATA[
<stderr_txt>
Global Worker:: CPDN process is not running, exiting, bRController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4244, selfPID=3376, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5652, selfPID=4984, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
07:07:50 (4844): No heartbeat from core client for 30 sec - exiting
07:07:51 (4844): No heartbeat from core client for 30 sec - exiting
07:07:52 (4844): No heartbeat from core client for 30 sec - exiting
07:07:53 (4844): No heartbeat from core client for 30 sec - exiting
07:07:54 (4844): No heartbeat from core client for 30 sec - exiting
07:07:55 (4844): No heartbeat from core client for 30 sec - exiting
07:07:56 (4844): No heartbeat from core client for 30 sec - exiting
07:07:57 (4844): No heartbeat from core client for 30 sec - exiting
07:07:58 (4844): No heartbeat from core client for 30 sec - exiting
07:07:59 (4844): No heartbeat from core client for 30 sec - exiting
07:08:00 (4844): No heartbeat from core client for 30 sec - exiting
07:08:01 (4844): No heartbeat from core client for 30 sec - exiting
07:08:02 (4844): No heartbeat from core client for 30 sec - exiting
07:08:04 (4844): No heartbeat from core client for 30 sec - exiting
07:08:05 (4844): No heartbeat from core client for 30 sec - exiting
07:08:06 (4844): No heartbeat from core client for 30 sec - exiting
07:08:07 (4844): No heartbeat from core client for 30 sec - exiting
07:08:08 (4844): No heartbeat from core client for 30 sec - exiting
07:08:09 (4844): No heartbeat from core client for 30 sec - exiting
07:08:10 (4844): No heartbeat from core client for 30 sec - exiting
07:08:11 (4844): No heartbeat from core client for 30 sec - exiting
07:08:12 (4844): No heartbeat from core client for 30 sec - exiting
07:08:13 (4844): No heartbeat from core client for 30 sec - exiting
07:08:14 (4844): No heartbeat from core client for 30 sec - exiting
07:08:16 (4844): No heartbeat from core client for 30 sec - exiting
07:08:17 (4844): No heartbeat from core client for 30 sec - exiting
07:08:18 (4844): No heartbeat from core client for 30 sec - exiting
07:08:19 (4844): No heartbeat from core client for 30 sec - exiting
07:08:20 (4844): No heartbeat from core client for 30 sec - exiting
07:08:21 (4844): No heartbeat from core client for 30 sec - exiting
07:08:22 (4844): No heartbeat from core client for 30 sec - exiting
07:08:23 (4844): No heartbeat from core client for 30 sec - exiting
07:08:24 (4844): No heartbeat from core client for 30 sec - exiting
07:08:25 (4844): No heartbeat from core client for 30 sec - exiting
07:08:26 (4844): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
09:24:10 (4628): No heartbeat from core client for 30 sec - exiting
09:24:11 (4628): No heartbeat from core client for 30 sec - exiting
09:24:12 (4628): No heartbeat from core client for 30 sec - exiting
09:24:13 (4628): No heartbeat from core client for 30 sec - exiting
09:24:14 (4628): No heartbeat from core client for 30 sec - exiting
09:24:16 (4628): No heartbeat from core client for 30 sec - exiting
09:24:17 (4628): No heartbeat from core client for 30 sec - exiting
09:24:18 (4628): No heartbeat from core client for 30 sec - exiting
09:24:19 (4628): No heartbeat from core client for 30 sec - exiting
09:24:20 (4628): No heartbeat from core client for 30 sec - exiting
09:24:21 (4628): No heartbeat from core client for 30 sec - exiting
09:24:22 (4628): No heartbeat from core client for 30 sec - exiting
09:24:23 (4628): No heartbeat from core client for 30 sec - exiting
09:24:24 (4628): No heartbeat from core client for 30 sec - exiting
09:24:25 (4628): No heartbeat from core client for 30 sec - exiting
09:24:26 (4628): No heartbeat from core client for 30 sec - exiting
09:24:27 (4628): No heartbeat from core client for 30 sec - exiting
09:24:28 (4628): No heartbeat from core client for 30 sec - exiting
09:24:29 (4628): No heartbeat from core client for 30 sec - exiting
09:24:30 (4628): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
09:28:02 (4752): No heartbeat from core client for 30 sec - exiting
09:28:03 (4752): No heartbeat from core client for 30 sec - exiting
09:28:04 (4752): No heartbeat from core client for 30 sec - exiting
09:28:05 (4752): No heartbeat from core client for 30 sec - exiting
09:28:06 (4752): No heartbeat from core client for 30 sec - exiting
09:28:07 (4752): No heartbeat from core client for 30 sec - exiting
09:28:08 (4752): No heartbeat from core client for 30 sec - exiting
09:28:09 (4752): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Signal 11 received, exiting...
Called boinc_finish
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4912, selfPID=4912, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4912, selfPID=5532, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Called boinc_finish

</stderr_txt>
<message>
upload failure: <file_xfer_error>
  <file_name>hadam3p_eu_wzxm_1969_1_006909122_1_4.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_wzxm_1969_1_006909122_1_5.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_wzxm_1969_1_006909122_1_6.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_wzxm_1969_1_006909122_1_7.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_wzxm_1969_1_006909122_1_8.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_wzxm_1969_1_006909122_1_9.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_wzxm_1969_1_006909122_1_10.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_wzxm_1969_1_006909122_1_11.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_wzxm_1969_1_006909122_1_12.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>

</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
29 Aug 2012 09:15:31 1224497 15191672 hadam3p_eu_wzxm_1969_1_006909122_1 34,656 60,743 1.7527
29 Aug 2012 04:04:18 1224497 15191672 hadam3p_eu_wzxm_1969_1_006909122_1 23,136 40,950 1.7700
28 Aug 2012 03:44:47 1224497 15191672 hadam3p_eu_wzxm_1969_1_006909122_1 11,620 20,307 1.7476
28 Aug 2012 01:24:13 1224497 15191672 hadam3p_eu_wzxm_1969_1_006909122_1 11,616 20,016 1.7231


©2024 climateprediction.net