climateprediction.net home page
Task 18347931

Task 18347931

Name hadam3p_anz_m78h_2012_1_009782342_0
Workunit 9838306
Created 24 Apr 2015, 17:04:12 UTC
Sent 2 May 2015, 17:32:49 UTC
Report deadline 13 Apr 2016, 22:52:49 UTC
Received 18 May 2015, 11:37:06 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 1344064
Run time 4 days 22 hours 22 min 7 sec
CPU time 4 days 4 hours 21 min 35 sec
Validate state Invalid
Credit 2,000.18
Device peak FLOPS 3.48 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Australia New Zealand v6.10
windows_intelx86
Stderr
<core_client_version>7.4.42</core_client_version>
<![CDATA[
<stderr_txt>
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
09:55:11 (6376): No heartbeat from core client for 30 sec - exiting
09:55:12 (6376): No heartbeat from core client for 30 sec - exiting
09:55:13 (6376): No heartbeat from core client for 30 sec - exiting
09:55:14 (6376): No heartbeat from core client for 30 sec - exiting
09:55:15 (6376): No heartbeat from core client for 30 sec - exiting
09:55:16 (6376): No heartbeat from core client for 30 sec - exiting
09:55:17 (6376): No heartbeat from core client for 30 sec - exiting
09:55:18 (6376): No heartbeat from core client for 30 sec - exiting
09:55:19 (6376): No heartbeat from core client for 30 sec - exiting
09:55:20 (6376): No heartbeat from core client for 30 sec - exiting
09:55:21 (6376): No heartbeat from core client for 30 sec - exiting
09:55:22 (6376): No heartbeat from core client for 30 sec - exiting
09:55:23 (6376): No heartbeat from core client for 30 sec - exiting
09:55:24 (6376): No heartbeat from core client for 30 sec - exiting
09:55:25 (6376): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
12:04:00 (6628): No heartbeat from core client for 30 sec - exiting
12:04:01 (6628): No heartbeat from core client for 30 sec - exiting
12:04:02 (6628): No heartbeat from core client for 30 sec - exiting
12:04:03 (6628): No heartbeat from core client for 30 sec - exiting
12:04:04 (6628): No heartbeat from core client for 30 sec - exiting
12:04:05 (6628): No heartbeat from core client for 30 sec - exiting
12:04:06 (6628): No heartbeat from core client for 30 sec - exiting
12:04:07 (6628): No heartbeat from core client for 30 sec - exiting
12:04:08 (6628): No heartbeat from core client for 30 sec - exiting
12:04:09 (6628): No heartbeat from core client for 30 sec - exiting
12:04:10 (6628): No heartbeat from core client for 30 sec - exiting
12:04:11 (6628): No heartbeat from core client for 30 sec - exiting
12:04:12 (6628): No heartbeat from core client for 30 sec - exiting
12:04:13 (6628): No heartbeat from core client for 30 sec - exiting
12:04:14 (6628): No heartbeat from core client for 30 sec - exiting
12:04:15 (6628): No heartbeat from core client for 30 sec - exiting
12:04:16 (6628): No heartbeat from core client for 30 sec - exiting
12:04:17 (6628): No heartbeat from core client for 30 sec - exiting
12:04:18 (6628): No heartbeat from core client for 30 sec - exiting
12:04:19 (6628): No heartbeat from core client for 30 sec - exiting

Model crashed: READHIST: End of file in READ from history file for namelist NLIHISTO                                                                                                                                                                                           tmp/xaakm.pipe_dummy                                                            2048    
12:04:20 (6628): No heartbeat from core client for 30 sec - exiting
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=0, iMonCtr=2
CPDN Monitor - No 'heartbeat' from BOINC...

Model crashed: READHIST: End of file in READ from history file for namelist NLIHISTO                                                                                                                                                                                           tmp/xaakm.pipe_dummy                                                            2048    
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=0, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7456, selfPID=6236, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Called boinc_finish

</stderr_txt>
<message>
upload failure: <file_xfer_error>
  <file_name>hadam3p_anz_m78h_2012_1_009782342_0_5.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_anz_m78h_2012_1_009782342_0_6.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_anz_m78h_2012_1_009782342_0_7.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_anz_m78h_2012_1_009782342_0_8.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_anz_m78h_2012_1_009782342_0_9.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_anz_m78h_2012_1_009782342_0_10.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_anz_m78h_2012_1_009782342_0_11.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_anz_m78h_2012_1_009782342_0_12.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>

</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
12 May 2015 08:33:43 1344064 18347931 hadam3p_anz_m78h_2012_1_009782342_0 46,379 316,246 6.8187
10 May 2015 20:41:05 1344064 18347931 hadam3p_anz_m78h_2012_1_009782342_0 34,859 237,357 6.8091
08 May 2015 19:25:24 1344064 18347931 hadam3p_anz_m78h_2012_1_009782342_0 23,339 157,962 6.7682
08 May 2015 19:12:06 1344064 18347931 hadam3p_anz_m78h_2012_1_009782342_0 11,819 78,295 6.6245


©2024 climateprediction.net