climateprediction.net home page
Task 16631193

Task 16631193

Name hadam3p_anz_rdgg_2012_1_008747054_0
Workunit 8893032
Created 9 May 2014, 8:53:47 UTC
Sent 10 May 2014, 11:04:23 UTC
Report deadline 22 Apr 2015, 16:24:23 UTC
Received 2 Jun 2014, 12:05:25 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 1208839
Run time 6 days 7 hours 41 min 57 sec
CPU time 5 days 8 hours 5 min 50 sec
Validate state Invalid
Credit 2,000.18
Device peak FLOPS 1.86 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Australia New Zealand v6.10
windows_intelx86
Stderr
<core_client_version>7.2.42</core_client_version>
<![CDATA[
<stderr_txt>
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
07:49:33 (5552): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
17:11:26 (10912): No heartbeat from core client for 30 sec - exiting
17:11:27 (10912): No heartbeat from core client for 30 sec - exiting
17:11:28 (10912): No heartbeat from core client for 30 sec - exiting
17:11:29 (10912): No heartbeat from core client for 30 sec - exiting
17:11:30 (10912): No heartbeat from core client for 30 sec - exiting
17:11:31 (10912): No heartbeat from core client for 30 sec - exiting
17:11:32 (10912): No heartbeat from core client for 30 sec - exiting
17:11:33 (10912): No heartbeat from core client for 30 sec - exiting
17:11:34 (10912): No heartbeat from core client for 30 sec - exiting
17:11:35 (10912): No heartbeat from core client for 30 sec - exiting
17:11:36 (10912): No heartbeat from core client for 30 sec - exiting
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=9880, selfPID=8264, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
CPDN Monitor - Quit request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=11184, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=9484, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2280, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
09:58:38 (13316): No heartbeat from core client for 30 sec - exiting
09:58:39 (13316): No heartbeat from core client for 30 sec - exiting
09:58:41 (13316): No heartbeat from core client for 30 sec - exiting
09:58:42 (13316): No heartbeat from core client for 30 sec - exiting
09:58:43 (13316): No heartbeat from core client for 30 sec - exiting
09:58:44 (13316): No heartbeat from core client for 30 sec - exiting
09:58:45 (13316): No heartbeat from core client for 30 sec - exiting
09:58:46 (13316): No heartbeat from core client for 30 sec - exiting
09:58:47 (13316): No heartbeat from core client for 30 sec - exiting
09:58:48 (13316): No heartbeat from core client for 30 sec - exiting
09:58:49 (13316): No heartbeat from core client for 30 sec - exiting
09:58:50 (13316): No heartbeat from core client for 30 sec - exiting
09:58:51 (13316): No heartbeat from core client for 30 sec - exiting
09:58:52 (13316): No heartbeat from core client for 30 sec - exiting
09:58:53 (13316): No heartbeat from core client for 30 sec - exiting
09:58:54 (13316): No heartbeat from core client for 30 sec - exiting
09:58:55 (13316): No heartbeat from core client for 30 sec - exiting
09:58:56 (13316): No heartbeat from core client for 30 sec - exiting
09:58:57 (13316): No heartbeat from core client for 30 sec - exiting
09:58:58 (13316): No heartbeat from core client for 30 sec - exiting
09:58:59 (13316): No heartbeat from core client for 30 sec - exiting
09:59:00 (13316): No heartbeat from core client for 30 sec - exiting
09:59:01 (13316): No heartbeat from core client for 30 sec - exiting
09:59:02 (13316): No heartbeat from core client for 30 sec - exiting
09:59:03 (13316): No heartbeat from core client for 30 sec - exiting
09:59:04 (13316): No heartbeat from core client for 30 sec - exiting
09:59:05 (13316): No heartbeat from core client for 30 sec - exiting
09:59:06 (13316): No heartbeat from core client for 30 sec - exiting
09:59:07 (13316): No heartbeat from core client for 30 sec - exiting
09:59:08 (13316): No heartbeat from core client for 30 sec - exiting
09:59:09 (13316): No heartbeat from core client for 30 sec - exiting
09:59:10 (13316): No heartbeat from core client for 30 sec - exiting
09:59:11 (13316): No heartbeat from core client for 30 sec - exiting
09:59:12 (13316): No heartbeat from core client for 30 sec - exiting
09:59:13 (13316): No heartbeat from core client for 30 sec - exiting
09:59:14 (13316): No heartbeat from core client for 30 sec - exiting
09:59:15 (13316): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=11236, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=9608, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=8520, selfPID=4760, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...

Model crashed: READHIST: End of file in READ from history file for namelist NLIHISTO                                                                                                                                                                                           tmp/xaakm.pipe_dummy                                                            2048    
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1392, selfPID=1392, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1392, selfPID=7968, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Called boinc_finish

</stderr_txt>
<message>
upload failure: <file_xfer_error>
  <file_name>hadam3p_anz_rdgg_2012_1_008747054_0_5.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_anz_rdgg_2012_1_008747054_0_6.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_anz_rdgg_2012_1_008747054_0_7.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_anz_rdgg_2012_1_008747054_0_8.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_anz_rdgg_2012_1_008747054_0_9.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_anz_rdgg_2012_1_008747054_0_10.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_anz_rdgg_2012_1_008747054_0_11.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_anz_rdgg_2012_1_008747054_0_12.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>

</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
30 May 2014 20:08:36 1208839 16631193 hadam3p_anz_rdgg_2012_1_008747054_0 46,379 418,372 9.0207
21 May 2014 16:32:58 1208839 16631193 hadam3p_anz_rdgg_2012_1_008747054_0 34,859 310,249 8.9001
15 May 2014 10:30:30 1208839 16631193 hadam3p_anz_rdgg_2012_1_008747054_0 23,339 206,090 8.8303
12 May 2014 18:22:50 1208839 16631193 hadam3p_anz_rdgg_2012_1_008747054_0 11,819 108,774 9.2033


©2024 cpdn.org