climateprediction.net home page
Task 15266223

Task 15266223

Name hadam3p_eu_83md_2006_1_008195411_0
Workunit 8350535
Created 11 Sep 2012, 7:26:02 UTC
Sent 11 Sep 2012, 17:44:33 UTC
Report deadline 24 Aug 2013, 23:04:33 UTC
Received 8 Oct 2012, 17:05:59 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 1206204
Run time 1 days 0 hours 10 min 49 sec
CPU time 1 hours 29 min 21 sec
Validate state Invalid
Credit 399.25
Device peak FLOPS 2.65 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Europe v6.09
windows_intelx86
Stderr
<core_client_version>7.0.28</core_client_version>
<![CDATA[
<stderr_txt>
14:15:03 (2804): No heartbeat from core client for 30 sec - exiting
14:15:04 (2804): No heartbeat from core client for 30 sec - exiting
14:15:05 (2804): No heartbeat from core client for 30 sec - exiting
14:15:06 (2804): No heartbeat from core client for 30 sec - exiting
14:15:07 (2804): No heartbeat from core client for 30 sec - exiting
14:15:08 (2804): No heartbeat from core client for 30 sec - exiting
14:15:09 (2804): No heartbeat from core client for 30 sec - exiting
14:15:10 (2804): No heartbeat from core client for 30 sec - exiting
14:15:11 (2804): No heartbeat from core client for 30 sec - exiting
14:15:12 (2804): No heartbeat from core client for 30 sec - exiting
14:15:13 (2804): No heartbeat from core client for 30 sec - exiting
14:15:14 (2804): No heartbeat from core client for 30 sec - exiting
14:15:15 (2804): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
14:31:48 (5484): No heartbeat from core client for 30 sec - exiting
14:31:49 (5484): No heartbeat from core client for 30 sec - exiting
14:31:50 (5484): No heartbeat from core client for 30 sec - exiting
14:31:51 (5484): No heartbeat from core client for 30 sec - exiting
14:31:53 (5484): No heartbeat from core client for 30 sec - exiting
16:41:19 (3976): No heartbeat from core client for 30 sec - exiting
16:41:20 (3976): No heartbeat from core client for 30 sec - exiting
16:41:21 (3976): No heartbeat from core client for 30 sec - exiting
16:41:23 (3976): No heartbeat from core client for 30 sec - exiting
16:41:24 (3976): No heartbeat from core client for 30 sec - exiting
16:41:25 (3976): No heartbeat from core client for 30 sec - exiting
16:41:26 (3976): No heartbeat from core client for 30 sec - exiting
16:41:27 (3976): No heartbeat from core client for 30 sec - exiting
16:41:28 (3976): No heartbeat from core client for 30 sec - exiting
16:41:29 (3976): No heartbeat from core client for 30 sec - exiting
16:41:30 (3976): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5560, selfPID=2040, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
18:34:53 (7184): No heartbeat from core client for 30 sec - exiting
18:34:54 (7184): No heartbeat from core client for 30 sec - exiting
18:34:55 (7184): No heartbeat from core client for 30 sec - exiting
18:34:56 (7184): No heartbeat from core client for 30 sec - exiting
18:34:57 (7184): No heartbeat from core client for 30 sec - exiting
18:34:58 (7184): No heartbeat from core client for 30 sec - exiting
18:34:59 (7184): No heartbeat from core client for 30 sec - exiting
18:35:00 (7184): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4600, selfPID=4600, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=8244, selfPID=8244, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...

Model crashed: READHIST: End of file in READ from history file for namelist NLIHISTO                                                                                                                                                                                           tmp/xaakm.pipe_dummy                                                            2048    
Leaving CPDN_Main::Monitor...
Called boinc_finish
CPDN Monitor - Quit request from BOINC...
13:00:42 (6820): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...

</stderr_txt>
<message>
upload failure: <file_xfer_error>
  <file_name>hadam3p_eu_83md_2006_1_008195411_0_3.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_83md_2006_1_008195411_0_4.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_83md_2006_1_008195411_0_5.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_83md_2006_1_008195411_0_6.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_83md_2006_1_008195411_0_7.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_83md_2006_1_008195411_0_8.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_83md_2006_1_008195411_0_9.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_83md_2006_1_008195411_0_10.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_83md_2006_1_008195411_0_11.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_83md_2006_1_008195411_0_12.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>

</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
02 Oct 2012 19:47:12 1206204 15266223 hadam3p_eu_83md_2006_1_008195411_0 23,144 64,977 2.8075
02 Oct 2012 18:45:58 1206204 15266223 hadam3p_eu_83md_2006_1_008195411_0 23,142 64,681 2.7950
02 Oct 2012 18:45:58 1206204 15266223 hadam3p_eu_83md_2006_1_008195411_0 23,140 64,364 2.7815
02 Oct 2012 17:52:04 1206204 15266223 hadam3p_eu_83md_2006_1_008195411_0 23,136 64,049 2.7684
23 Sep 2012 18:12:57 1206204 15266223 hadam3p_eu_83md_2006_1_008195411_0 11,616 34,583 2.9772


©2024 cpdn.org