climateprediction.net home page
Task 15141043

Task 15141043

Name hadam3p_saf_1h0x_1992_1_006959497_1
Workunit 7162813
Created 18 Aug 2012, 22:38:14 UTC
Sent 18 Aug 2012, 23:38:20 UTC
Report deadline 1 Aug 2013, 4:58:20 UTC
Received 11 Sep 2012, 23:16:46 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 1228315
Run time 5 days 11 hours 13 min 32 sec
CPU time 2 days 1 hours 40 min 9 sec
Validate state Invalid
Credit 1,496.58
Device peak FLOPS 2.68 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Southern Africa v6.09
windows_intelx86
Stderr
<core_client_version>7.0.28</core_client_version>
<![CDATA[
<stderr_txt>
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3896, selfPID=3896, iMonCtr=2
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5572, selfPID=3716, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5440, selfPID=5440, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
09:46:56 (4092): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6128, selfPID=3368, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5672, selfPID=4892, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4420, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2936, selfPID=3672, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4792, selfPID=4792, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5580, selfPID=3820, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
GController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5160, selfPID=2156, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2052, selfPID=4188, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4552, selfPID=2420, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Glontrobler:: CPDN procel Wosksr:i CPDN process  n not ununningexitinging, bRetVal = 1,, ccehePcD=0,D=e0,P selfP96,=3M35Ct, 2i
Monel r=ash
 detected, will try to restart...
Leaving CPDN_Main::Monitor...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
09:11:14 (4452): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3964, selfPID=4848, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
09:52:11 (3636): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3368, iMonCtr=2
Model crash detected, will try to restart...

</stderr_txt>
<message>
upload failure: <file_xfer_error>
  <file_name>hadam3p_saf_1h0x_1992_1_006959497_1_9.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_saf_1h0x_1992_1_006959497_1_10.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_saf_1h0x_1992_1_006959497_1_11.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_saf_1h0x_1992_1_006959497_1_12.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>

</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
01 Sep 2012 05:59:52 1228315 15141043 hadam3p_saf_1h0x_1992_1_006959497_1 92,256 269,602 2.9223
30 Aug 2012 04:34:58 1228315 15141043 hadam3p_saf_1h0x_1992_1_006959497_1 80,736 236,455 2.9287
29 Aug 2012 01:53:42 1228315 15141043 hadam3p_saf_1h0x_1992_1_006959497_1 69,216 202,727 2.9289
27 Aug 2012 07:11:01 1228315 15141043 hadam3p_saf_1h0x_1992_1_006959497_1 57,696 170,647 2.9577
25 Aug 2012 04:28:53 1228315 15141043 hadam3p_saf_1h0x_1992_1_006959497_1 46,176 137,458 2.9768
23 Aug 2012 07:01:05 1228315 15141043 hadam3p_saf_1h0x_1992_1_006959497_1 34,656 103,308 2.9810
22 Aug 2012 03:10:05 1228315 15141043 hadam3p_saf_1h0x_1992_1_006959497_1 23,136 68,621 2.9660
20 Aug 2012 06:34:02 1228315 15141043 hadam3p_saf_1h0x_1992_1_006959497_1 11,619 34,352 2.9565
20 Aug 2012 06:34:02 1228315 15141043 hadam3p_saf_1h0x_1992_1_006959497_1 11,616 33,924 2.9205


©2024 cpdn.org