climateprediction.net home page
Task 12796373

Task 12796373

Name hadam3p_saf_0wcu_1989_1_006875110_1
Workunit 7078426
Created 13 Apr 2011, 2:27:51 UTC
Sent 13 Apr 2011, 4:53:27 UTC
Report deadline 25 Mar 2012, 10:13:27 UTC
Received 24 Apr 2011, 17:23:39 UTC
Server state Over
Outcome Didn't need
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 1143106
Run time 1 days 13 hours 32 min 31 sec
CPU time 1 days 8 hours 17 min 1 sec
Validate state Invalid
Credit 562.38
Device peak FLOPS 2.44 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Southern Africa v6.09
windows_intelx86
Stderr
<core_client_version>6.6.28</core_client_version>
<![CDATA[
<stderr_txt>
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5968, iMonCtr=2
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2140, selfPID=4132, iMonCtr=1
Model crash detected, will try to restart...
16:21:53 (4316): No heartbeat from core client for 30 sec - exiting
16:21:54 (4316): No heartbeat from core client for 30 sec - exiting
16:21:55 (4316): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6040, selfPID=4980, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4488, selfPID=6012, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1748, selfPID=764, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4516, selfPID=4308, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=500, selfPID=4208, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5580, selfPID=5700, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4596, selfPID=2172, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4708, iMonCtr=2
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4848, selfPID=4444, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5952, selfPID=2648, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5836, selfPID=5960, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3460, selfPID=4332, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3276, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4408, selfPID=2528, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Called boinc_finish

</stderr_txt>
<message>
<file_xfer_error>
  <file_name>hadam3p_saf_0wcu_1989_1_006875110_1_4.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_saf_0wcu_1989_1_006875110_1_5.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_saf_0wcu_1989_1_006875110_1_6.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_saf_0wcu_1989_1_006875110_1_7.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_saf_0wcu_1989_1_006875110_1_8.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_saf_0wcu_1989_1_006875110_1_9.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_saf_0wcu_1989_1_006875110_1_10.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_saf_0wcu_1989_1_006875110_1_11.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_saf_0wcu_1989_1_006875110_1_12.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>

</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
23 Apr 2011 04:40:22 1143106 12796373 hadam3p_saf_0wcu_1989_1_006875110_1 34,668 101,594 2.9305
23 Apr 2011 04:20:09 1143106 12796373 hadam3p_saf_0wcu_1989_1_006875110_1 34,656 101,104 2.9174
21 Apr 2011 07:20:44 1143106 12796373 hadam3p_saf_0wcu_1989_1_006875110_1 23,142 67,737 2.9270
20 Apr 2011 18:44:59 1143106 12796373 hadam3p_saf_0wcu_1989_1_006875110_1 23,136 67,177 2.9036
20 Apr 2011 18:44:59 1143106 12796373 hadam3p_saf_0wcu_1989_1_006875110_1 11,616 33,656 2.8974


©2024 climateprediction.net