climateprediction.net home page
Task 12982665

Task 12982665

Name hadam3p_eu_2tc2_1971_1_007297024_0
Workunit 7494448
Created 15 Jun 2011, 20:48:25 UTC
Sent 15 Jun 2011, 20:48:33 UTC
Report deadline 28 May 2012, 2:08:33 UTC
Received 7 Jul 2011, 18:13:26 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 1114687
Run time 3 days 14 hours 6 min 55 sec
CPU time 2 hours 46 min
Validate state Invalid
Credit 2,187.67
Device peak FLOPS 3.04 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Europe v6.09
windows_intelx86
Stderr
<core_client_version>6.10.58</core_client_version>
<![CDATA[
<stderr_txt>
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6136, selfPID=2264, iMonCtr=1
Model crash detected, will try to restart...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5440, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5360, selfPID=5632, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
11:25:43 (5300): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
12:08:32 (5968): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4352, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5408, iMonCtr=2
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7104, selfPID=6720, iMonCtr=1
Model crash detected, will try to restart...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6136, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5448, iMonCtr=2
Model crash detected, will try to restart...
09:35:47 (1664): No heartbeat from core client for 30 sec - exiting
09:35:48 (1664): No heartbeat from core client for 30 sec - exiting
09:35:49 (1664): No heartbeat from core client for 30 sec - exiting
09:35:50 (1664): No heartbeat from core client for 30 sec - exiting
09:35:51 (1664): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
13:57:48 (5256): No heartbeat from core client for 30 sec - exiting
13:57:50 (5256): No heartbeat from core client for 30 sec - exiting
13:57:51 (5256): No heartbeat from core client for 30 sec - exiting
13:57:52 (5256): No heartbeat from core client for 30 sec - exiting
13:57:53 (5256): No heartbeat from core client for 30 sec - exiting
13:57:54 (5256): No heartbeat from core client for 30 sec - exiting
13:57:55 (5256): No heartbeat from core client for 30 sec - exiting
13:57:56 (5256): No heartbeat from core client for 30 sec - exiting
13:57:57 (5256): No heartbeat from core client for 30 sec - exiting
13:57:58 (5256): No heartbeat from core client for 30 sec - exiting
13:57:59 (5256): No heartbeat from core client for 30 sec - exiting
13:58:00 (5256): No heartbeat from core client for 30 sec - exiting
13:58:01 (5256): No heartbeat from core client for 30 sec - exiting
13:58:02 (5256): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5356, selfPID=4516, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4520, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5408, selfPID=5740, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2560, selfPID=264, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4672, selfPID=3932, iMonCtr=1
Model crash detected, will try to restart...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5172, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6028, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5788, iMonCtr=2
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6592, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5272, selfPID=4040, iMonCtr=1
Model crash detected, will try to restart...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5608, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5580, selfPID=5212, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2220, selfPID=4920, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Called boinc_finish

</stderr_txt>
<message>
<file_xfer_error>
  <file_name>hadam3p_eu_2tc2_1971_1_007297024_0_12.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>

</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
07 Jul 2011 18:20:28 1114687 12982665 hadam3p_eu_2tc2_1971_1_007297024_0 126,816 281,192 2.2173
04 Jul 2011 19:21:21 1114687 12982665 hadam3p_eu_2tc2_1971_1_007297024_0 115,296 253,264 2.1966
02 Jul 2011 21:33:46 1114687 12982665 hadam3p_eu_2tc2_1971_1_007297024_0 103,776 225,776 2.1756
02 Jul 2011 11:32:33 1114687 12982665 hadam3p_eu_2tc2_1971_1_007297024_0 92,256 194,591 2.1093
30 Jun 2011 20:27:58 1114687 12982665 hadam3p_eu_2tc2_1971_1_007297024_0 80,736 162,387 2.0113
29 Jun 2011 13:31:18 1114687 12982665 hadam3p_eu_2tc2_1971_1_007297024_0 69,216 130,137 1.8802
26 Jun 2011 20:45:01 1114687 12982665 hadam3p_eu_2tc2_1971_1_007297024_0 57,696 98,731 1.7112
26 Jun 2011 14:41:35 1114687 12982665 hadam3p_eu_2tc2_1971_1_007297024_0 46,176 76,150 1.6491
26 Jun 2011 09:34:48 1114687 12982665 hadam3p_eu_2tc2_1971_1_007297024_0 34,656 57,200 1.6505
24 Jun 2011 22:53:17 1114687 12982665 hadam3p_eu_2tc2_1971_1_007297024_0 23,137 37,976 1.6414
24 Jun 2011 21:32:33 1114687 12982665 hadam3p_eu_2tc2_1971_1_007297024_0 23,136 37,740 1.6312
24 Jun 2011 12:53:36 1114687 12982665 hadam3p_eu_2tc2_1971_1_007297024_0 11,616 18,742 1.6135


©2024 cpdn.org