climateprediction.net home page
Task 15300965

Task 15300965

Name hadam3p_eu_w98e_1973_1_006807318_1
Workunit 7010634
Created 22 Sep 2012, 13:14:12 UTC
Sent 23 Sep 2012, 15:24:34 UTC
Report deadline 5 Sep 2013, 20:44:34 UTC
Received 21 Mar 2013, 22:48:09 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 967258
Run time 7 days 6 hours 54 min 59 sec
CPU time
Validate state Invalid
Credit 1,392.75
Device peak FLOPS 1.91 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Europe v6.09
windows_intelx86
Stderr
<core_client_version>7.0.28</core_client_version>
<![CDATA[
<stderr_txt>
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
15:29:33 (7792): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8092, iMonCtr=2
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6216, selfPID=6216, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
14:17:13 (7232): No heartbeat from core client for 30 sec - exiting
14:17:15 (7232): No heartbeat from core client for 30 sec - exiting
14:17:16 (7232): No heartbeat from core client for 30 sec - exiting
14:17:17 (7232): No heartbeat from core client for 30 sec - exiting
14:17:18 (7232): No heartbeat from core client for 30 sec - exiting
14:17:19 (7232): No heartbeat from core client for 30 sec - exiting
14:17:20 (7232): No heartbeat from core client for 30 sec - exiting
14:17:21 (7232): No heartbeat from core client for 30 sec - exiting
14:17:22 (7232): No heartbeat from core client for 30 sec - exiting
14:17:23 (7232): No heartbeat from core client for 30 sec - exiting
14:17:24 (7232): No heartbeat from core client for 30 sec - exiting
14:17:25 (7232): No heartbeat from core client for 30 sec - exiting
14:17:27 (7232): No heartbeat from core client for 30 sec - exiting
14:17:28 (7232): No heartbeat from core client for 30 sec - exiting
14:17:29 (7232): No heartbeat from core client for 30 sec - exiting
14:17:30 (7232): No heartbeat from core client for 30 sec - exiting
14:17:31 (7232): No heartbeat from core client for 30 sec - exiting
14:17:32 (7232): No heartbeat from core client for 30 sec - exiting
14:17:33 (7232): No heartbeat from core client for 30 sec - exiting
14:17:34 (7232): No heartbeat from core client for 30 sec - exiting
14:17:35 (7232): No heartbeat from core client for 30 sec - exiting
14:17:36 (7232): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6680, selfPID=6680, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5544, selfPID=5544, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
14:52:27 (6484): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6668, selfPID=6668, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6668, selfPID=5428, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
17:16:52 (5428): No heartbeat from core client for 30 sec - exiting
17:16:53 (5428): No heartbeat from core client for 30 sec - exiting
17:16:54 (5428): No heartbeat from core client for 30 sec - exiting
17:16:55 (5428): No heartbeat from core client for 30 sec - exiting
17:16:57 (5428): No heartbeat from core client for 30 sec - exiting
17:16:58 (5428): No heartbeat from core client for 30 sec - exiting
17:16:59 (5428): No heartbeat from core client for 30 sec - exiting
17:17:00 (5428): No heartbeat from core client for 30 sec - exiting
17:17:01 (5428): No heartbeat from core client for 30 sec - exiting
17:17:02 (5428): No heartbeat from core client for 30 sec - exiting
17:17:03 (5428): No heartbeat from core client for 30 sec - exiting
17:17:04 (5428): No heartbeat from core client for 30 sec - exiting
17:17:05 (5428): No heartbeat from core client for 30 sec - exiting
17:17:06 (5428): No heartbeat from core client for 30 sec - exiting
17:17:07 (5428): No heartbeat from core client for 30 sec - exiting
17:17:08 (5428): No heartbeat from core client for 30 sec - exiting
17:17:10 (5428): No heartbeat from core client for 30 sec - exiting
17:17:11 (5428): No heartbeat from core client for 30 sec - exiting
17:17:12 (5428): No heartbeat from core client for 30 sec - exiting
Called boinc_finish
17:17:17 (5428): No heartbeat from core client for 30 sec - exiting
17:17:18 (5428): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4660, selfPID=4660, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6916, selfPID=5268, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=544, selfPID=6012, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
21:46:58 (3384): No heartbeat from core client for 30 sec - exiting
21:46:59 (3384): No heartbeat from core client for 30 sec - exiting
21:47:00 (3384): No heartbeat from core client for 30 sec - exiting
21:47:01 (3384): No heartbeat from core client for 30 sec - exiting
21:47:02 (3384): No heartbeat from core client for 30 sec - exiting
21:47:03 (3384): No heartbeat from core client for 30 sec - exiting
21:47:04 (3384): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
21:47:06 (3384): No heartbeat from core client for 30 sec - exiting
21:47:07 (3384): No heartbeat from core client for 30 sec - exiting
21:47:08 (3384): No heartbeat from core client for 30 sec - exiting

</stderr_txt>
<message>
upload failure: <file_xfer_error>
  <file_name>hadam3p_eu_w98e_1973_1_006807318_1_8.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_w98e_1973_1_006807318_1_9.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_w98e_1973_1_006807318_1_10.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_w98e_1973_1_006807318_1_11.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_w98e_1973_1_006807318_1_12.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>

</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
25 Feb 2013 19:33:51 967258 15300965 hadam3p_eu_w98e_1973_1_006807318_1 80,736 455,020 5.6359
04 Feb 2013 12:45:14 967258 15300965 hadam3p_eu_w98e_1973_1_006807318_1 69,216 384,898 5.5608
22 Jan 2013 13:50:24 967258 15300965 hadam3p_eu_w98e_1973_1_006807318_1 57,696 320,219 5.5501
16 Jan 2013 16:51:31 967258 15300965 hadam3p_eu_w98e_1973_1_006807318_1 46,176 261,541 5.6640
10 Jan 2013 15:20:41 967258 15300965 hadam3p_eu_w98e_1973_1_006807318_1 34,656 218,026 6.2911
20 Oct 2012 15:47:28 967258 15300965 hadam3p_eu_w98e_1973_1_006807318_1 23,136 144,342 6.2388
30 Sep 2012 14:52:40 967258 15300965 hadam3p_eu_w98e_1973_1_006807318_1 11,616 71,457 6.1516


©2024 climateprediction.net