climateprediction.net home page
Task 18311419

Task 18311419

Name hadam3p_anz_d51i_2013_1_009723314_1
Workunit 9796611
Created 16 Apr 2015, 17:38:10 UTC
Sent 16 Apr 2015, 17:56:18 UTC
Report deadline 28 Mar 2016, 23:16:18 UTC
Received 31 May 2015, 15:02:44 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 1221356
Run time 10 days 18 hours 33 min 51 sec
CPU time 9 days 7 hours 45 min 6 sec
Validate state Invalid
Credit 5,477.92
Device peak FLOPS 2.81 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Australia New Zealand v6.10
windows_intelx86
Stderr
<core_client_version>7.4.42</core_client_version>
<![CDATA[
<stderr_txt>
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5824, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=656, iMonCtr=2
Model crash detected, will try to restart...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4692, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5140, selfPID=5512, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4780, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2672, iMonCtr=2
Model crash detected, will try to restart...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5896, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5816, iMonCtr=2
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6724, selfPID=6228, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6944, selfPID=6500, iMonCtr=1
Model crash detected, will try to restart...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5148, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3580, selfPID=5716, iMonCtr=1
Model crash detected, will try to restart...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5576, iMonCtr=2
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6424, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6508, selfPID=6044, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5364, iMonCtr=2
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6800, selfPID=4288, iMonCtr=1
Model crash detected, will try to restart...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6868, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6924, selfPID=2576, iMonCtr=1
Model crash detected, will try to restart...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5320, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=880, selfPID=3632, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5912, selfPID=5152, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5516, selfPID=5700, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4984, iMonCtr=2
Model crash detected, will try to restart...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5016, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4576, iMonCtr=2
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6468, selfPID=5484, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2320, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
20:52:35 (7080): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7032, selfPID=7032, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5588, iMonCtr=2
Model crash detected, will try to restart...
Global WorGlobal Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5868, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5204, iMonCtr=2
Model crash detected, will try to restart...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5724, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5020, iMonCtr=2
Model crash detected, will try to restart...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1028, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5604, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4080, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=664, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3440, selfPID=1600, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4876, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3124, selfPID=5712, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4244, selfPID=4244, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=4188, iMonCtr=1
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=0, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=8056, selfPID=7436, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Called boinc_finish

</stderr_txt>
<message>
upload failure: <file_xfer_error>
  <file_name>hadam3p_anz_d51i_2013_1_009723314_1_12.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>

</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
27 May 2015 18:56:11 1221356 18311419 hadam3p_anz_d51i_2013_1_009723314_1 127,019 745,447 5.8688
24 May 2015 11:13:05 1221356 18311419 hadam3p_anz_d51i_2013_1_009723314_1 115,499 677,897 5.8693
22 May 2015 08:20:44 1221356 18311419 hadam3p_anz_d51i_2013_1_009723314_1 103,979 610,355 5.8700
18 May 2015 15:57:38 1221356 18311419 hadam3p_anz_d51i_2013_1_009723314_1 92,459 542,949 5.8723
14 May 2015 20:54:10 1221356 18311419 hadam3p_anz_d51i_2013_1_009723314_1 80,939 475,633 5.8764
10 May 2015 18:50:48 1221356 18311419 hadam3p_anz_d51i_2013_1_009723314_1 69,419 407,729 5.8734
08 May 2015 19:15:36 1221356 18311419 hadam3p_anz_d51i_2013_1_009723314_1 57,899 339,852 5.8697
08 May 2015 19:06:37 1221356 18311419 hadam3p_anz_d51i_2013_1_009723314_1 46,379 271,903 5.8626
29 Apr 2015 18:27:12 1221356 18311419 hadam3p_anz_d51i_2013_1_009723314_1 34,859 204,363 5.8626
26 Apr 2015 12:16:13 1221356 18311419 hadam3p_anz_d51i_2013_1_009723314_1 23,339 136,688 5.8566
23 Apr 2015 22:52:38 1221356 18311419 hadam3p_anz_d51i_2013_1_009723314_1 11,819 69,198 5.8548


©2024 climateprediction.net