climateprediction.net home page
Task 14590061

Task 14590061

Name hadam3p_pnw_c9mp_1976_1_007945440_1
Workunit 8100552
Created 26 Apr 2012, 0:07:58 UTC
Sent 26 Apr 2012, 0:12:45 UTC
Report deadline 8 Apr 2013, 5:32:45 UTC
Received 16 May 2012, 23:01:17 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 1149107
Run time 2 days 1 hours 52 min 40 sec
CPU time 2 days 0 hours 16 min 28 sec
Validate state Invalid
Credit 2,004.61
Device peak FLOPS 4.35 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Pacific North West v6.09
windows_intelx86
Stderr
<core_client_version>7.0.25</core_client_version>
<![CDATA[
<stderr_txt>
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4792, selfPID=3276, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4320, selfPID=4320, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4744, selfPID=4256, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3340, selfPID=3340, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5024, selfPID=4240, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Regional yearly means requires 12 input files got 5
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4892, selfPID=3844, iMonCtr=1
Model crash detected, will try to restart...
22:31:51 (792): No heartbeat from core client for 30 sec - exiting
22:31:52 (792): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4556, selfPID=4488, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3036, selfPID=3808, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3400, selfPID=4572, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3476, selfPID=4404, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4612, selfPID=3848, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4452, selfPID=4800, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4012, iMonCtr=2
Model crash detected, will try to restart...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4708, iMonCtr=2
Leaving CPDN_Main::Monitor...
Regional yearly means requires 12 input files got 8
Called boinc_finish

</stderr_txt>
<message>
upload failure: <file_xfer_error>
  <file_name>hadam3p_pnw_c9mp_1976_1_007945440_1_9.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_c9mp_1976_1_007945440_1_10.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_c9mp_1976_1_007945440_1_11.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_c9mp_1976_1_007945440_1_12.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>

</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
15 May 2012 03:26:57 1149107 14590061 hadam3p_pnw_c9mp_1976_1_007945440_1 92,256 157,107 1.7029
11 May 2012 21:16:22 1149107 14590061 hadam3p_pnw_c9mp_1976_1_007945440_1 80,736 137,473 1.7027
10 May 2012 01:06:32 1149107 14590061 hadam3p_pnw_c9mp_1976_1_007945440_1 69,225 119,013 1.7192
10 May 2012 00:05:26 1149107 14590061 hadam3p_pnw_c9mp_1976_1_007945440_1 69,216 118,790 1.7162
06 May 2012 23:00:53 1149107 14590061 hadam3p_pnw_c9mp_1976_1_007945440_1 57,704 98,348 1.7044
06 May 2012 21:59:41 1149107 14590061 hadam3p_pnw_c9mp_1976_1_007945440_1 57,696 98,070 1.6998
04 May 2012 11:58:53 1149107 14590061 hadam3p_pnw_c9mp_1976_1_007945440_1 46,176 78,153 1.6925
02 May 2012 12:41:08 1149107 14590061 hadam3p_pnw_c9mp_1976_1_007945440_1 34,665 58,542 1.6888
02 May 2012 11:39:58 1149107 14590061 hadam3p_pnw_c9mp_1976_1_007945440_1 34,656 58,311 1.6826
27 Apr 2012 01:12:11 1149107 14590061 hadam3p_pnw_c9mp_1976_1_007945440_1 23,136 39,004 1.6859
26 Apr 2012 14:18:47 1149107 14590061 hadam3p_pnw_c9mp_1976_1_007945440_1 11,616 19,655 1.6921


©2024 cpdn.org