climateprediction.net home page
Task 12503260

Task 12503260

Name hadam3p_eu_xti0_1983_1_007024240_2
Workunit 7227556
Created 18 Jan 2011, 13:26:47 UTC
Sent 18 Jan 2011, 18:00:24 UTC
Report deadline 31 Dec 2011, 23:20:24 UTC
Received 11 Feb 2011, 21:44:02 UTC
Server state Over
Outcome No reply
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 1108766
Run time 4 days 20 hours 13 min 51 sec
CPU time 3 days 3 hours 7 min 38 sec
Validate state Invalid
Credit 1,792.85
Device peak FLOPS 2.66 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Europe v6.08
windows_intelx86
Stderr
<core_client_version>6.10.58</core_client_version>
<![CDATA[
<stderr_txt>
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7912, iMonCtr=2
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6784, selfPID=6300, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller::17:07:55 (5392): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5604, selfPID=6432, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Global Worker:: CPDN process is not running, exiting, Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4176, selfPID=5676, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6816, selfPID=6816, iMonCtr=2
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3720, iMonCtr=2
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6324, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6316, selfPID=6064, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7380, selfPID=7380, iMonCtr=2
12:55:38 (8148): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4964, selfPID=7800, iMonCtr=1
Model crash detected, will try to restart...
19:42:49 (6204): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
19:48:07 (6504): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
19:52:59 (8080): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
20:02:22 (6064): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
20:27:30 (2880): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7056, selfPID=5212, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
12:16:22 (8388): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
12:24:25 (7692): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
13:52:49 (7696): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
14:23:52 (7052): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
14:28:01 (756): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
14:31:21 (7348): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6428, selfPID=6428, iMonCtr=2
14:39:19 (3992): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
14:42:34 (5824): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
14:48:31 (7092): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5524, selfPID=5524, iMonCtr=2
14:53:40 (7404): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Re14:59:46 (7052): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
15:02:27 (3708): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
15:04:55 (6396): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
15:10:19 (7496): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
15:18:21 (4908): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
15:24:12 (6604): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
15:31:36 (7992): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
15:50:14 (5596): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
15:53:41 (4256): No heartbeat from core client for 30 sec - exiting
16:09:54 (2904): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
16:12:55 (780): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
16:35:44 (1132): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
16:44:11 (7592): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
16:44:12 (7592): No heartbeat from core client for 30 sec - exiting
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3532, selfPID=3532, iMonCtr=2
17:21:12 (7308): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
18:47:10 (5752): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
18:47:11 (5752): No heartbeat from core client for 30 sec - exiting
21:44:25 (7776): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
15:51:51 (3608): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
16:09:00 (7760): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
16:10:59 (6384): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Re16:40:06 (7332): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
12:04:47 (6952): No heartbeat from core client for 30 sec - exiting
12:05:05 (6952): No heartbeat from core client for 30 sec - exiting
12:05:06 (6952): No heartbeat from core client for 30 sec - exiting
12:05:07 (6952): No heartbeat from core client for 30 sec - exiting
12:05:08 (6952): No heartbeat from core client for 30 sec - exiting
12:05:09 (6952): No heartbeat from core client for 30 sec - exiting
12:05:10 (6952): No heartbeat from core client for 30 sec - exiting
12:05:11 (6952): No heartbeat from core client for 30 sec - exiting
12:05:12 (6952): No heartbeat from core client for 30 sec - exiting
12:05:13 (6952): No heartbeat from core client for 30 sec - exiting
12:05:14 (6952): No heartbeat from core client for 30 sec - exiting
12:05:16 (6952): No heartbeat from core client for 30 sec - exiting
12:05:17 (6952): No heartbeat from core client for 30 sec - exiting
12:05:18 (6952): No heartbeat from core client for 30 sec - exiting
12:05:19 (6952): No heartbeat from core client for 30 sec - exiting
12:05:20 (6952): No heartbeat from core client for 30 sec - exiting
12:05:21 (6952): No heartbeat from core client for 30 sec - exiting
12:05:22 (6952): No heartbeat from core client for 30 sec - exiting
12:05:23 (6952): No heartbeat from core client for 30 sec - exiting
12:05:24 (6952): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
12:05:25 (6952): No heartbeat from core client for 30 sec - exiting
12:13:07 (3868): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
12:16:52 (3148): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=8184, selfPID=8184, iMonCtr=2
12:27:40 (4736): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6540, selfPID=6540, iMonCtr=2
12:41:34 (6448): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
13:20:25 (4596): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
13:21:26 (4892): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
13:23:30 (5556): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
13:25:48 (6020): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6296, selfPID=6296, iMonCtr=2
13:27:51 (6440): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6764, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
13:37:10 (6764): No heartbeat from core client for 30 sec - exiting
13:37:11 (6764): No heartbeat from core client for 30 sec - exiting
13:37:12 (6764): No heartbeat from core client for 30 sec - exiting
13:37:13 (6764): No heartbeat from core client for 30 sec - exiting
13:37:14 (6764): No heartbeat from core client for 30 sec - exiting
13:37:15 (6764): No heartbeat from core client for 30 sec - exiting
13:37:16 (6764): No heartbeat from core client for 30 sec - exiting
13:37:17 (6764): No heartbeat from core client for 30 sec - exiting
13:37:18 (6764): No heartbeat from core client for 30 sec - exiting
13:37:19 (6764): No heartbeat from core client for 30 sec - exiting
13:37:20 (6764): No heartbeat from core client for 30 sec - exiting
13:37:21 (6764): No heartbeat from core client for 30 sec - exiting
13:37:22 (6764): No heartbeat from core client for 30 sec - exiting
13:37:23 (6764): No heartbeat from core client for 30 sec - exiting
13:37:24 (6764): No heartbeat from core client for 30 sec - exiting
13:37:25 (6764): No heartbeat from core client for 30 sec - exiting
13:37:26 (6764): No heartbeat from core client for 30 sec - exiting
13:37:27 (6764): No heartbeat from core client for 30 sec - exiting
13:37:28 (6764): No heartbeat from core client for 30 sec - exiting
13:37:29 (6764): No heartbeat from core client for 30 sec - exiting
13:37:30 (6764): No heartbeat from core client for 30 sec - exiting
13:37:31 (6764): No heartbeat from core client for 30 sec - exiting
13:37:32 (6764): No heartbeat from core client for 30 sec - exiting
13:37:33 (6764): No heartbeat from core client for 30 sec - exiting
13:37:34 (6764): No heartbeat from core client for 30 sec - exiting
13:37:35 (6764): No heartbeat from core client for 30 sec - exiting
13:37:36 (6764): No heartbeat from core client for 30 sec - exiting
13:37:37 (6764): No heartbeat from core client for 30 sec - exiting
13:37:38 (6764): No heartbeat from core client for 30 sec - exiting
13:38:40 (6764): called boinc_finish

</stderr_txt>
<message>
<file_xfer_error>
  <file_name>hadam3p_eu_xti0_1983_1_007024240_2_10.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_xti0_1983_1_007024240_2_11.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_xti0_1983_1_007024240_2_12.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>

</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
10 Feb 2011 01:07:50 1108766 12503260 hadam3p_eu_xti0_1983_1_007024240_2 103,929 253,478 2.4390
10 Feb 2011 00:36:31 1108766 12503260 hadam3p_eu_xti0_1983_1_007024240_2 103,889 252,722 2.4326
09 Feb 2011 22:17:55 1108766 12503260 hadam3p_eu_xti0_1983_1_007024240_2 103,867 252,003 2.4262
09 Feb 2011 19:48:13 1108766 12503260 hadam3p_eu_xti0_1983_1_007024240_2 103,776 251,384 2.4224
08 Feb 2011 01:38:16 1108766 12503260 hadam3p_eu_xti0_1983_1_007024240_2 92,256 224,130 2.4294
31 Jan 2011 22:26:14 1108766 12503260 hadam3p_eu_xti0_1983_1_007024240_2 80,736 197,259 2.4433
28 Jan 2011 22:01:09 1108766 12503260 hadam3p_eu_xti0_1983_1_007024240_2 69,216 169,303 2.4460
26 Jan 2011 04:20:54 1108766 12503260 hadam3p_eu_xti0_1983_1_007024240_2 57,696 141,241 2.4480
24 Jan 2011 23:09:52 1108766 12503260 hadam3p_eu_xti0_1983_1_007024240_2 46,176 112,902 2.4450
23 Jan 2011 15:55:18 1108766 12503260 hadam3p_eu_xti0_1983_1_007024240_2 34,656 84,196 2.4295
23 Jan 2011 12:33:25 1108766 12503260 hadam3p_eu_xti0_1983_1_007024240_2 23,136 55,556 2.4013
19 Jan 2011 21:53:25 1108766 12503260 hadam3p_eu_xti0_1983_1_007024240_2 11,624 27,668 2.3802
19 Jan 2011 21:37:45 1108766 12503260 hadam3p_eu_xti0_1983_1_007024240_2 11,616 27,258 2.3466


©2024 climateprediction.net