climateprediction.net home page
Task 14680623

Task 14680623

Name hadam3p_eu_d0bn_2001_1_007966629_0
Workunit 8121743
Created 16 May 2012, 15:39:19 UTC
Sent 3 Jun 2012, 21:55:10 UTC
Report deadline 17 May 2013, 3:15:10 UTC
Received 3 Jul 2012, 18:13:57 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 1153648
Run time 2 days 9 hours 25 min 33 sec
CPU time 2 days 6 hours 17 min 26 sec
Validate state Invalid
Credit 995.30
Device peak FLOPS 1.87 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Europe v6.09
windows_intelx86
Stderr
<core_client_version>7.0.25</core_client_version>
<![CDATA[
<stderr_txt>
Suspended CPDN Monitor - Suspend request from BOINC...
21:17:13 (4236): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
21:28:02 (3340): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3860, selfPID=5068, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6736, selfPID=1004, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4676, iMonCtr=2
Model crash detected, will try to restart...
13:20:11 (4712): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5636, selfPID=5636, iMonCtr=2
Suspended CPDN Monitor - Suspend request from BOINC...
CGlobal Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3068, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5312, selfPID=5184, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2312, selfPID=3908, iMonCtr=1
Model crash detected, will try to restart...
22:48:00 (2268): No heartbeat from core client for 30 sec - exiting
22:48:01 (2268): No heartbeat from core client for 30 sec - exiting
22:48:02 (2268): No heartbeat from core client for 30 sec - exiting
22:48:03 (2268): No heartbeat from core client for 30 sec - exiting
22:48:04 (2268): No heartbeat from core client for 30 sec - exiting
22:48:05 (2268): No heartbeat from core client for 30 sec - exiting
22:48:06 (2268): No heartbeat from core client for 30 sec - exiting
22:48:07 (2268): No heartbeat from core client for 30 sec - exiting
22:48:08 (2268): No heartbeat from core client for 30 sec - exiting
22:48:09 (2268): No heartbeat from core client for 30 sec - exiting
22:48:10 (2268): No heartbeat from core client for 30 sec - exiting
22:48:11 (2268): No heartbeat from core client for 30 sec - exiting
22:48:12 (2268): No heartbeat from core client for 30 sec - exiting
22:48:13 (2268): No heartbeat from core client for 30 sec - exiting
22:48:14 (2268): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
08:53:27 (3488): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4084, selfPID=4084, iMonCtr=2
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6140, selfPID=5932, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4988, selfPID=3740, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5716, selfPID=3796, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4724, selfPID=3964, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Called boinc_finish

</stderr_txt>
<message>
upload failure: <file_xfer_error>
  <file_name>hadam3p_eu_d0bn_2001_1_007966629_0_6.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_d0bn_2001_1_007966629_0_7.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_d0bn_2001_1_007966629_0_8.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_d0bn_2001_1_007966629_0_9.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_d0bn_2001_1_007966629_0_10.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_d0bn_2001_1_007966629_0_11.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_d0bn_2001_1_007966629_0_12.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>

</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
28 Jun 2012 11:21:22 1153648 14680623 hadam3p_eu_d0bn_2001_1_007966629_0 57,696 174,624 3.0266
27 Jun 2012 10:04:13 1153648 14680623 hadam3p_eu_d0bn_2001_1_007966629_0 46,176 139,609 3.0234
26 Jun 2012 11:48:40 1153648 14680623 hadam3p_eu_d0bn_2001_1_007966629_0 34,656 104,752 3.0226
25 Jun 2012 12:54:43 1153648 14680623 hadam3p_eu_d0bn_2001_1_007966629_0 23,136 69,586 3.0077
18 Jun 2012 14:57:06 1153648 14680623 hadam3p_eu_d0bn_2001_1_007966629_0 11,616 34,501 2.9701


©2024 cpdn.org