climateprediction.net home page
Task 16422493

Task 16422493

Name hadam3p_anz_n9lt_2012_1_008600065_2
Workunit 8746577
Created 27 Mar 2014, 16:55:48 UTC
Sent 27 Mar 2014, 17:03:49 UTC
Report deadline 9 Mar 2015, 22:23:49 UTC
Received 6 Jul 2014, 5:37:26 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 1295575
Run time 3 days 19 hours 34 min 57 sec
CPU time 3 days 18 hours 24 min 29 sec
Validate state Invalid
Credit 3,490.64
Device peak FLOPS 3.50 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Australia New Zealand v6.10
windows_intelx86
Stderr
<core_client_version>7.2.42</core_client_version>
<![CDATA[
<stderr_txt>
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2364, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4176, selfPID=4632, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=836, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2468, selfPID=4640, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2484, selfPID=4616, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4256, selfPID=4564, iMonCtr=1
Model crash detected, will try to restart...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5920, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3196, selfPID=4564, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5824, selfPID=4548, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4680, selfPID=4536, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1488, selfPID=4556, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5184, selfPID=4540, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5584, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4308, selfPID=4524, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4608, selfPID=4580, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5144, selfPID=5144, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2100, selfPID=4528, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6292, selfPID=4524, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6416, selfPID=4532, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6692, selfPID=6692, iMonCtr=2
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3672, iMonCtr=2
Model crash detected, will try to restart...
CSuspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5664, selfPID=4704, iMonCtr=1
Model crash detected, will try to restart...
Global Worker:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=5268, iMonCtr=1
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=0, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2544, selfPID=4692, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Called boinc_finish

</stderr_txt>
<message>
upload failure: <file_xfer_error>
  <file_name>hadam3p_anz_n9lt_2012_1_008600065_2_8.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_anz_n9lt_2012_1_008600065_2_9.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_anz_n9lt_2012_1_008600065_2_10.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_anz_n9lt_2012_1_008600065_2_11.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_anz_n9lt_2012_1_008600065_2_12.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>

</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
05 Jul 2014 11:31:46 1295575 16422493 hadam3p_anz_n9lt_2012_1_008600065_2 80,939 286,992 3.5458
02 Jul 2014 08:30:41 1295575 16422493 hadam3p_anz_n9lt_2012_1_008600065_2 69,419 250,621 3.6103
22 Apr 2014 17:41:49 1295575 16422493 hadam3p_anz_n9lt_2012_1_008600065_2 57,899 218,160 3.7679
12 Apr 2014 09:10:43 1295575 16422493 hadam3p_anz_n9lt_2012_1_008600065_2 46,379 175,351 3.7808
06 Apr 2014 14:37:56 1295575 16422493 hadam3p_anz_n9lt_2012_1_008600065_2 34,859 130,597 3.7464
04 Apr 2014 06:04:23 1295575 16422493 hadam3p_anz_n9lt_2012_1_008600065_2 23,339 86,593 3.7102
02 Apr 2014 05:56:28 1295575 16422493 hadam3p_anz_n9lt_2012_1_008600065_2 11,819 43,742 3.7010


©2024 climateprediction.net