climateprediction.net home page
Task 14446014

Task 14446014

Name hadam3p_pnw_bg5z_1990_1_007907254_0
Workunit 8062366
Created 17 Apr 2012, 18:33:01 UTC
Sent 11 May 2012, 11:55:50 UTC
Report deadline 23 Apr 2013, 17:15:50 UTC
Received 31 May 2012, 22:14:49 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 1196548
Run time 1 days 22 hours 19 min 22 sec
CPU time 16 hours 18 min 11 sec
Validate state Invalid
Credit 502.72
Device peak FLOPS 2.99 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Pacific North West v6.09
windows_intelx86
Stderr
<core_client_version>7.0.25</core_client_version>
<![CDATA[
<stderr_txt>
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7808, selfPID=696, iMonCtr=1
Model crash detected, will try to restart...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5512, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6884, iMonCtr=2
Model crash detected, will try to restart...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3608, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6472, selfPID=7408, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2572, selfPID=5252, iMonCtr=1
Model crash detected, will try to restart...
12:25:29 (6040): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4960, selfPID=3392, iMonCtr=1
Model crash detected, will try to restart...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4656, selfPID=4656, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4656, selfPID=8088, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Regional yearly means requires 12 input files got 2
00:59:12 (8088): No heartbeat from core client for 30 sec - exiting
Called boinc_finish
00:59:13 (8088): No heartbeat from core client for 30 sec - exiting
00:59:14 (8088): No heartbeat from core client for 30 sec - exiting
16:47:00 (5392): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
21:59:46 (2356): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4700, iMonCtr=2
Model crash detected, will try to restart...
01:11:53 (3088): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...

</stderr_txt>
<message>
upload failure: <file_xfer_error>
  <file_name>hadam3p_pnw_bg5z_1990_1_007907254_0_3.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_bg5z_1990_1_007907254_0_4.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_bg5z_1990_1_007907254_0_5.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_bg5z_1990_1_007907254_0_6.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_bg5z_1990_1_007907254_0_7.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_bg5z_1990_1_007907254_0_8.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_bg5z_1990_1_007907254_0_9.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_bg5z_1990_1_007907254_0_10.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_bg5z_1990_1_007907254_0_11.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_bg5z_1990_1_007907254_0_12.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>

</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
23 May 2012 09:07:06 1196548 14446014 hadam3p_pnw_bg5z_1990_1_007907254_0 23,136 54,257 2.3451
17 May 2012 08:13:42 1196548 14446014 hadam3p_pnw_bg5z_1990_1_007907254_0 11,616 27,355 2.3549


©2024 cpdn.org