climateprediction.net home page
Task 17549972

Task 17549972

Name hadam3p_pnw_hd1j_2011_1_009287661_0
Workunit 9371849
Created 4 Dec 2014, 11:39:57 UTC
Sent 4 Dec 2014, 12:34:53 UTC
Report deadline 16 Nov 2015, 17:54:53 UTC
Received 29 Dec 2014, 11:43:27 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 1246680
Run time 1 days 9 hours 23 min 31 sec
CPU time 20 hours 7 min 1 sec
Validate state Invalid
Credit 507.13
Device peak FLOPS 2.52 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Pacific North West v7.22
windows_intelx86
Stderr
<core_client_version>7.0.28</core_client_version>
<![CDATA[
<stderr_txt>
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4704, iMonCtr=2
Model crash detected, will try to restart...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5292, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5824, selfPID=4784, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4888, iMonCtr=2
Model crash detected, will try to restart...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2572, iMonCtr=2
Leaving CPDN_Main::Monitor...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5060, iMonCtr=2
Suspended CPDN Monitor - Suspend request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4720, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4992, selfPID=4420, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
CGlobal Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5100, iMonCtr=2t
roller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5056, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
GController:: CPDN p process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=r7ocesis nCtr=2
 running, exiting, bRetVal = 1, checkPID=0, selfPID=4988, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Global Worker:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=7568, iMonCtr=1
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3956, selfPID=7040, iMonCtr=1
Model crash detected, will try to restart...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3956, selfPID=3956, iMonCtr=2
Leaving CPDN_Main::Monitor...
19:52:49 (7040): called boinc_finish

</stderr_txt>
<message>
upload failure: <file_xfer_error>
  <file_name>hadam3p_pnw_hd1j_2011_1_009287661_0_3.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_hd1j_2011_1_009287661_0_4.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_hd1j_2011_1_009287661_0_5.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_hd1j_2011_1_009287661_0_6.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_hd1j_2011_1_009287661_0_7.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_hd1j_2011_1_009287661_0_8.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_hd1j_2011_1_009287661_0_9.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_hd1j_2011_1_009287661_0_10.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_hd1j_2011_1_009287661_0_11.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_hd1j_2011_1_009287661_0_12.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>

</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
15 Dec 2014 11:25:02 1246680 17549972 hadam3p_pnw_hd1j_2011_1_009287661_0 23,339 57,539 2.4654
09 Dec 2014 12:48:42 1246680 17549972 hadam3p_pnw_hd1j_2011_1_009287661_0 11,819 28,442 2.4065


©2024 cpdn.org