climateprediction.net home page
Task 13260199

Task 13260199

Name hadam3p_saf_2ial_1975_1_007406607_2
Workunit 7604037
Created 16 Aug 2011, 0:35:52 UTC
Sent 16 Aug 2011, 1:05:32 UTC
Report deadline 28 Jul 2012, 6:25:32 UTC
Received 26 Aug 2011, 1:21:10 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 890420
Run time 3 days 4 hours 56 min 19 sec
CPU time 2 days 20 hours 47 min 28 sec
Validate state Invalid
Credit 1,309.70
Device peak FLOPS 2.31 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Southern Africa v6.09
windows_intelx86
Stderr
<core_client_version>6.10.18</core_client_version>
<![CDATA[
<stderr_txt>
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1604, iMonCtr=2
Model crash detected, will try to restart...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4100, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4236, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4288, iMonCtr=2
Model crash detected, will try to restart...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3072, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4736, iMonCtr=2
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5460, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
09:18:55 (5164): No heartbeat from core client for 30 sec - exiting
09:18:56 (5164): No heartbeat from core client for 30 sec - exiting
09:18:57 (5164): No heartbeat from core client for 30 sec - exiting
09:18:58 (5164): No heartbeat from core client for 30 sec - exiting
09:18:59 (5164): No heartbeat from core client for 30 sec - exiting
09:19:00 (5164): No heartbeat from core client for 30 sec - exiting
09:19:01 (5164): No heartbeat from core client for 30 sec - exiting
09:19:02 (5164): No heartbeat from core client for 30 sec - exiting
09:19:03 (5164): No heartbeat from core client for 30 sec - exiting
09:19:04 (5164): No heartbeat from core client for 30 sec - exiting
09:19:05 (5164): No heartbeat from core client for 30 sec - exiting
09:19:06 (5164): No heartbeat from core client for 30 sec - exiting
09:19:07 (5164): No heartbeat from core client for 30 sec - exiting
09:19:08 (5164): No heartbeat from core client for 30 sec - exiting
09:19:09 (5164): No heartbeat from core client for 30 sec - exiting
09:19:10 (5164): No heartbeat from core client for 30 sec - exiting
09:19:11 (5164): No heartbeat from core client for 30 sec - exiting
09:19:12 (5164): No heartbeat from core client for 30 sec - exiting
09:19:13 (5164): No heartbeat from core client for 30 sec - exiting
09:19:14 (5164): No heartbeat from core client for 30 sec - exiting
09:19:15 (5164): No heartbeat from core client for 30 sec - exiting
09:19:16 (5164): No heartbeat from core client for 30 sec - exiting
09:19:17 (5164): No heartbeat from core client for 30 sec - exiting
09:19:18 (5164): No heartbeat from core client for 30 sec - exiting
09:19:19 (5164): No heartbeat from core client for 30 sec - exiting
09:19:20 (5164): No heartbeat from core client for 30 sec - exiting
09:19:21 (5164): No heartbeat from core client for 30 sec - exiting
09:19:22 (5164): No heartbeat from core client for 30 sec - exiting
09:19:23 (5164): No heartbeat from core client for 30 sec - exiting
09:19:24 (5164): No heartbeat from core client for 30 sec - exiting
09:19:25 (5164): No heartbeat from core client for 30 sec - exiting
09:19:26 (5164): No heartbeat from core client for 30 sec - exiting
09:19:27 (5164): No heartbeat from core client for 30 sec - exiting
09:19:28 (5164): No heartbeat from core client for 30 sec - exiting
09:19:29 (5164): No heartbeat from core client for 30 sec - exiting
09:19:30 (5164): No heartbeat from core client for 30 sec - exiting
09:19:31 (5164): No heartbeat from core client for 30 sec - exiting
09:19:32 (5164): No heartbeat from core client for 30 sec - exiting
09:19:33 (5164): No heartbeat from core client for 30 sec - exiting
09:19:34 (5164): No heartbeat from core client for 30 sec - exiting
09:19:35 (5164): No heartbeat from core client for 30 sec - exiting
09:19:36 (5164): No heartbeat from core client for 30 sec - exiting
09:19:37 (5164): No heartbeat from core client for 30 sec - exiting
09:19:38 (5164): No heartbeat from core client for 30 sec - exiting
09:19:39 (5164): No heartbeat from core client for 30 sec - exiting
09:19:40 (5164): No heartbeat from core client for 30 sec - exiting
09:19:41 (5164): No heartbeat from core client for 30 sec - exiting
09:19:42 (5164): No heartbeat from core client for 30 sec - exiting
09:19:43 (5164): No heartbeat from core client for 30 sec - exiting
09:19:44 (5164): No heartbeat from core client for 30 sec - exiting
09:19:45 (5164): No heartbeat from core client for 30 sec - exiting
09:19:46 (5164): No heartbeat from core client for 30 sec - exiting
09:19:47 (5164): No heartbeat from core client for 30 sec - exiting
09:19:48 (5164): No heartbeat from core client for 30 sec - exiting
09:19:49 (5164): No heartbeat from core client for 30 sec - exiting
09:19:50 (5164): No heartbeat from core client for 30 sec - exiting
09:19:51 (5164): No heartbeat from core client for 30 sec - exiting
09:19:52 (5164): No heartbeat from core client for 30 sec - exiting
09:19:53 (5164): No heartbeat from core client for 30 sec - exiting
09:19:54 (5164): No heartbeat from core client for 30 sec - exiting
09:19:55 (5164): No heartbeat from core client for 30 sec - exiting
09:19:56 (5164): No heartbeat from core client for 30 sec - exiting
09:19:57 (5164): No heartbeat from core client for 30 sec - exiting
09:19:58 (5164): No heartbeat from core client for 30 sec - exiting
09:19:59 (5164): No heartbeat from core client for 30 sec - exiting
09:20:00 (5164): No heartbeat from core client for 30 sec - exiting
09:20:01 (5164): No heartbeat from core client for 30 sec - exiting
09:20:02 (5164): No heartbeat from core client for 30 sec - exiting
09:20:03 (5164): No heartbeat from core client for 30 sec - exiting
09:20:04 (5164): No heartbeat from core client for 30 sec - exiting
09:20:05 (5164): No heartbeat from core client for 30 sec - exiting
09:20:06 (5164): No heartbeat from core client for 30 sec - exiting
09:20:07 (5164): No heartbeat from core client for 30 sec - exiting
09:20:08 (5164): No heartbeat from core client for 30 sec - exiting
09:20:09 (5164): No heartbeat from core client for 30 sec - exiting
09:20:10 (5164): No heartbeat from core client for 30 sec - exiting
09:20:11 (5164): No heartbeat from core client for 30 sec - exiting
09:20:12 (5164): No heartbeat from core client for 30 sec - exiting
09:20:13 (5164): No heartbeat from core client for 30 sec - exiting
09:20:14 (5164): No heartbeat from core client for 30 sec - exiting
09:20:15 (5164): No heartbeat from core client for 30 sec - exiting
09:20:16 (5164): No heartbeat from core client for 30 sec - exiting
09:20:17 (5164): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5260, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4248, iMonCtr=2
Model crash detected, will try to restart...
09:14:20 (4992): No heartbeat from core client for 30 sec - exiting
09:14:21 (4992): No heartbeat from core client for 30 sec - exiting
09:14:22 (4992): No heartbeat from core client for 30 sec - exiting
09:14:23 (4992): No heartbeat from core client for 30 sec - exiting
09:14:24 (4992): No heartbeat from core client for 30 sec - exiting
09:14:25 (4992): No heartbeat from core client for 30 sec - exiting
09:14:26 (4992): No heartbeat from core client for 30 sec - exiting
09:14:27 (4992): No heartbeat from core client for 30 sec - exiting
09:14:28 (4992): No heartbeat from core client for 30 sec - exiting
09:14:29 (4992): No heartbeat from core client for 30 sec - exiting
09:14:30 (4992): No heartbeat from core client for 30 sec - exiting
09:14:31 (4992): No heartbeat from core client for 30 sec - exiting
09:14:32 (4992): No heartbeat from core client for 30 sec - exiting
09:14:33 (4992): No heartbeat from core client for 30 sec - exiting
09:14:34 (4992): No heartbeat from core client for 30 sec - exiting
09:14:35 (4992): No heartbeat from core client for 30 sec - exiting
09:14:36 (4992): No heartbeat from core client for 30 sec - exiting
09:14:37 (4992): No heartbeat from core client for 30 sec - exiting
09:14:38 (4992): No heartbeat from core client for 30 sec - exiting
09:14:39 (4992): No heartbeat from core client for 30 sec - exiting
09:14:40 (4992): No heartbeat from core client for 30 sec - exiting
09:14:41 (4992): No heartbeat from core client for 30 sec - exiting
09:14:42 (4992): No heartbeat from core client for 30 sec - exiting
09:14:43 (4992): No heartbeat from core client for 30 sec - exiting
09:14:44 (4992): No heartbeat from core client for 30 sec - exiting
09:14:45 (4992): No heartbeat from core client for 30 sec - exiting
09:14:46 (4992): No heartbeat from core client for 30 sec - exiting
09:14:47 (4992): No heartbeat from core client for 30 sec - exiting
09:14:48 (4992): No heartbeat from core client for 30 sec - exiting
09:14:49 (4992): No heartbeat from core client for 30 sec - exiting
09:14:50 (4992): No heartbeat from core client for 30 sec - exiting
09:14:51 (4992): No heartbeat from core client for 30 sec - exiting
09:14:52 (4992): No heartbeat from core client for 30 sec - exiting
09:14:53 (4992): No heartbeat from core client for 30 sec - exiting
09:14:54 (4992): No heartbeat from core client for 30 sec - exiting
09:14:55 (4992): No heartbeat from core client for 30 sec - exiting
09:14:56 (4992): No heartbeat from core client for 30 sec - exiting
09:14:57 (4992): No heartbeat from core client for 30 sec - exiting
09:14:58 (4992): No heartbeat from core client for 30 sec - exiting
09:14:59 (4992): No heartbeat from core client for 30 sec - exiting
09:15:00 (4992): No heartbeat from core client for 30 sec - exiting
09:15:01 (4992): No heartbeat from core client for 30 sec - exiting
09:15:02 (4992): No heartbeat from core client for 30 sec - exiting
09:15:03 (4992): No heartbeat from core client for 30 sec - exiting
09:15:04 (4992): No heartbeat from core client for 30 sec - exiting
09:15:05 (4992): No heartbeat from core client for 30 sec - exiting
09:15:06 (4992): No heartbeat from core client for 30 sec - exiting
09:15:07 (4992): No heartbeat from core client for 30 sec - exiting
09:15:08 (4992): No heartbeat from core client for 30 sec - exiting
09:15:09 (4992): No heartbeat from core client for 30 sec - exiting
09:15:10 (4992): No heartbeat from core client for 30 sec - exiting
09:15:11 (4992): No heartbeat from core client for 30 sec - exiting
09:15:12 (4992): No heartbeat from core client for 30 sec - exiting
09:15:13 (4992): No heartbeat from core client for 30 sec - exiting
09:15:14 (4992): No heartbeat from core client for 30 sec - exiting
09:15:15 (4992): No heartbeat from core client for 30 sec - exiting
09:15:16 (4992): No heartbeat from core client for 30 sec - exiting
09:15:17 (4992): No heartbeat from core client for 30 sec - exiting
09:15:18 (4992): No heartbeat from core client for 30 sec - exiting
09:15:19 (4992): No heartbeat from core client for 30 sec - exiting
09:15:20 (4992): No heartbeat from core client for 30 sec - exiting
09:15:21 (4992): No heartbeat from core client for 30 sec - exiting
09:15:22 (4992): No heartbeat from core client for 30 sec - exiting
09:15:23 (4992): No heartbeat from core client for 30 sec - exiting
09:15:24 (4992): No heartbeat from core client for 30 sec - exiting
09:15:25 (4992): No heartbeat from core client for 30 sec - exiting
09:15:26 (4992): No heartbeat from core client for 30 sec - exiting
09:15:27 (4992): No heartbeat from core client for 30 sec - exiting
09:15:28 (4992): No heartbeat from core client for 30 sec - exiting
09:15:29 (4992): No heartbeat from core client for 30 sec - exiting
09:15:30 (4992): No heartbeat from core client for 30 sec - exiting
09:15:31 (4992): No heartbeat from core client for 30 sec - exiting
09:15:32 (4992): No heartbeat from core client for 30 sec - exiting
09:15:33 (4992): No heartbeat from core client for 30 sec - exiting
09:15:34 (4992): No heartbeat from core client for 30 sec - exiting
09:15:35 (4992): No heartbeat from core client for 30 sec - exiting
09:15:36 (4992): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
09:15:37 (4992): No heartbeat from core client for 30 sec - exiting
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4280, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5812, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4332, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4496, iMonCtr=2
Model crash detected, will try to restart...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5224, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4324, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Called boinc_finish

</stderr_txt>
<message>
<file_xfer_error>
  <file_name>hadam3p_saf_2ial_1975_1_007406607_2_8.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_saf_2ial_1975_1_007406607_2_9.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_saf_2ial_1975_1_007406607_2_10.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_saf_2ial_1975_1_007406607_2_11.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_saf_2ial_1975_1_007406607_2_12.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>

</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
25 Aug 2011 04:41:11 890420 13260199 hadam3p_saf_2ial_1975_1_007406607_2 80,736 227,438 2.8171
24 Aug 2011 04:47:40 890420 13260199 hadam3p_saf_2ial_1975_1_007406607_2 69,216 195,328 2.8220
23 Aug 2011 04:49:41 890420 13260199 hadam3p_saf_2ial_1975_1_007406607_2 57,696 163,172 2.8281
22 Aug 2011 03:37:30 890420 13260199 hadam3p_saf_2ial_1975_1_007406607_2 46,176 130,597 2.8282
19 Aug 2011 03:10:26 890420 13260199 hadam3p_saf_2ial_1975_1_007406607_2 34,656 98,065 2.8297
18 Aug 2011 01:51:41 890420 13260199 hadam3p_saf_2ial_1975_1_007406607_2 23,136 65,410 2.8272
17 Aug 2011 01:28:57 890420 13260199 hadam3p_saf_2ial_1975_1_007406607_2 11,616 32,982 2.8394


©2024 climateprediction.net