climateprediction.net home page
Task 11879529

Task 11879529

Name hadam3p_eu_vak9_1969_1_006724206_0
Workunit 6927456
Created 10 Sep 2010, 7:35:25 UTC
Sent 10 Sep 2010, 19:42:20 UTC
Report deadline 24 Aug 2011, 1:02:20 UTC
Received 24 Sep 2010, 10:09:29 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 1099533
Run time 2 days 15 hours 22 min 8 sec
CPU time 2 days 3 hours 8 min 56 sec
Validate state Invalid
Credit 598.07
Device peak FLOPS 2.23 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Europe v6.05
windows_intelx86
Stderr
<core_client_version>6.10.58</core_client_version>
<![CDATA[
<stderr_txt>
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4300, iMonCtr=2
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3260, selfPID=4520, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4212, selfPID=4800, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3544, selfPID=1204, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6268, selfPID=3812, iMonCtr=1
Model crash detected, will try to restart...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2916, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3880, selfPID=1560, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6456, selfPID=1688, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5756, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4024, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
10:56:48 (4796): No heartbeat from core client for 30 sec - exiting
10:56:49 (4796): No heartbeat from core client for 30 sec - exiting
10:56:50 (4796): No heartbeat from core client for 30 sec - exiting
10:56:51 (4796): No heartbeat from core client for 30 sec - exiting
10:56:52 (4796): No heartbeat from core client for 30 sec - exiting
10:56:53 (4796): No heartbeat from core client for 30 sec - exiting
10:56:54 (4796): No heartbeat from core client for 30 sec - exiting
10:56:55 (4796): No heartbeat from core client for 30 sec - exiting
10:56:56 (4796): No heartbeat from core client for 30 sec - exiting
10:56:57 (4796): No heartbeat from core client for 30 sec - exiting
10:56:58 (4796): No heartbeat from core client for 30 sec - exiting
10:56:59 (4796): No heartbeat from core client for 30 sec - exiting
10:57:00 (4796): No heartbeat from core client for 30 sec - exiting
10:57:01 (4796): No heartbeat from core client for 30 sec - exiting
10:57:02 (4796): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=872, selfPID=3568, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3780, iMonCtr=2
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5544, selfPID=5240, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5512, selfPID=4676, iMonCtr=1
Model crash detected, will try to restart...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6068, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4968, selfPID=4384, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5900, selfPID=5664, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...

</stderr_txt>
<message>
<file_xfer_error>
  <file_name>hadam3p_eu_vak9_1969_1_006724206_0_4.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_vak9_1969_1_006724206_0_5.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_vak9_1969_1_006724206_0_6.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_vak9_1969_1_006724206_0_7.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_vak9_1969_1_006724206_0_8.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_vak9_1969_1_006724206_0_9.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_vak9_1969_1_006724206_0_10.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_vak9_1969_1_006724206_0_11.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_vak9_1969_1_006724206_0_12.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>

</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
20 Sep 2010 22:54:43 1099533 11879529 hadam3p_eu_vak9_1969_1_006724206_0 34,669 149,510 4.3125
20 Sep 2010 17:58:53 1099533 11879529 hadam3p_eu_vak9_1969_1_006724206_0 34,656 148,747 4.2921
18 Sep 2010 23:47:56 1099533 11879529 hadam3p_eu_vak9_1969_1_006724206_0 23,136 98,837 4.2720
15 Sep 2010 19:17:21 1099533 11879529 hadam3p_eu_vak9_1969_1_006724206_0 11,616 46,542 4.0067


©2024 climateprediction.net