climateprediction.net home page
Task 16117957

Task 16117957

Name hadam3p_eu_iz9o_2006_1_008490897_0
Workunit 8641710
Created 3 Dec 2013, 21:04:50 UTC
Sent 4 Dec 2013, 20:34:18 UTC
Report deadline 17 Nov 2014, 1:54:18 UTC
Received 3 Jan 2014, 19:39:33 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 1115566
Run time 7 days 3 hours 49 min 29 sec
CPU time 5 days 1 hours 26 min 31 sec
Validate state Invalid
Credit 1,790.21
Device peak FLOPS 1.25 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Europe v6.09
windows_intelx86
Stderr
<core_client_version>7.0.64</core_client_version>
<![CDATA[
<stderr_txt>
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4320, iMonCtr=2
Model crash detected, will try to restart...
22:10:41 (5324): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
22:49:59 (1596): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
23:23:33 (6464): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6096, selfPID=4884, iMonCtr=1
Model crash detected, will try to restart...
12:28:43 (288): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4696, selfPID=4696, iMonCtr=2
12:33:44 (6072): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
GController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5288, iMonCtr=2
Model crash detected, will try to restart...
12:19:59 (4196): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
12:54:46 (5980): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
13:29:30 (5084): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
14:04:12 (4312): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
14:38:57 (204): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
15:48:27 (4360): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
16:23:13 (5208): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
18:07:58 (4176): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
18:44:47 (1576): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
20:29:26 (660): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
23:25:16 (5500): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
23:25:17 (5500): No heartbeat from core client for 30 sec - exiting
23:25:18 (5500): No heartbeat from core client for 30 sec - exiting
23:25:19 (5500): No heartbeat from core client for 30 sec - exiting
23:25:20 (5500): No heartbeat from core client for 30 sec - exiting
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1744, selfPID=4980, iMonCtr=1
Model crash detected, will try to restart...
19:55:21 (5520): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
21:25:40 (7072): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5768, selfPID=5768, iMonCtr=2
22:03:45 (6876): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
21:43:23 (3672): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5004, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3488, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
11:26:53 (4024): No heartbeat from core client for 30 sec - exiting
11:26:55 (4024): No heartbeat from core client for 30 sec - exiting
11:26:56 (4024): No heartbeat from core client for 30 sec - exiting
11:26:57 (4024): No heartbeat from core client for 30 sec - exiting
11:26:58 (4024): No heartbeat from core client for 30 sec - exiting
11:26:59 (4024): No heartbeat from core client for 30 sec - exiting
11:27:00 (4024): No heartbeat from core client for 30 sec - exiting
11:27:01 (4024): No heartbeat from core client for 30 sec - exiting
11:27:02 (4024): No heartbeat from core client for 30 sec - exiting
11:27:03 (4024): No heartbeat from core client for 30 sec - exiting
11:27:04 (4024): No heartbeat from core client for 30 sec - exiting
11:27:06 (4024): No heartbeat from core client for 30 sec - exiting
11:27:07 (4024): No heartbeat from core client for 30 sec - exiting
11:27:08 (4024): No heartbeat from core client for 30 sec - exiting
11:27:09 (4024): No heartbeat from core client for 30 sec - exiting
11:27:10 (4024): No heartbeat from core client for 30 sec - exiting
11:27:11 (4024): No heartbeat from core client for 30 sec - exiting
11:27:12 (4024): No heartbeat from core client for 30 sec - exiting
11:27:13 (4024): No heartbeat from core client for 30 sec - exiting
11:27:14 (4024): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
20:14:16 (4556): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
21:59:42 (4412): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
21:59:44 (4412): No heartbeat from core client for 30 sec - exiting
21:59:45 (4412): No heartbeat from core client for 30 sec - exiting
21:59:46 (4412): No heartbeat from core client for 30 sec - exiting
21:59:47 (4412): No heartbeat from core client for 30 sec - exiting
21:59:48 (4412): No heartbeat from core client for 30 sec - exiting
21:59:49 (4412): No heartbeat from core client for 30 sec - exiting
21:59:50 (4412): No heartbeat from core client for 30 sec - exiting
21:59:51 (4412): No heartbeat from core client for 30 sec - exiting
21:59:52 (4412): No heartbeat from core client for 30 sec - exiting
21:59:53 (4412): No heartbeat from core client for 30 sec - exiting
21:59:54 (4412): No heartbeat from core client for 30 sec - exiting
21:59:55 (4412): No heartbeat from core client for 30 sec - exiting
23:35:32 (4680): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
13:18:24 (4972): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
14:02:20 (2772): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6048, selfPID=6048, iMonCtr=2
16:13:45 (5424): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
16:57:23 (2080): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
17:40:55 (4572): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
19:51:39 (2456): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
20:35:40 (576): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
21:20:58 (4716): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
16:34:57 (6304): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
17:18:41 (7108): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
18:01:20 (6120): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
18:44:40 (2816): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
21:36:07 (6156): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
22:20:32 (6024): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
23:04:12 (5416): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
12:28:11 (2668): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5928, iMonCtr=2
14:00:06 (5788): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
14:00:07 (5788): No heartbeat from core client for 30 sec - exiting
16:43:13 (1768): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
17:40:04 (6996): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...

zip error: Could not create output file (was replacing the original zip file)
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=204, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3860, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4628, selfPID=4348, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5600, selfPID=3176, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Called boinc_finish

</stderr_txt>
<message>
upload failure: <file_xfer_error>
  <file_name>hadam3p_eu_iz9o_2006_1_008490897_0_10.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_iz9o_2006_1_008490897_0_11.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_iz9o_2006_1_008490897_0_12.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>

</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
02 Jan 2014 17:46:32 1115566 16117957 hadam3p_eu_iz9o_2006_1_008490897_0 103,776 416,745 4.0158
01 Jan 2014 18:16:03 1115566 16117957 hadam3p_eu_iz9o_2006_1_008490897_0 92,256 373,992 4.0539
31 Dec 2013 21:16:29 1115566 16117957 hadam3p_eu_iz9o_2006_1_008490897_0 80,736 331,794 4.1096
30 Dec 2013 18:00:55 1115566 16117957 hadam3p_eu_iz9o_2006_1_008490897_0 69,227 288,239 4.1637
30 Dec 2013 17:00:10 1115566 16117957 hadam3p_eu_iz9o_2006_1_008490897_0 69,216 287,543 4.1543
29 Dec 2013 11:47:48 1115566 16117957 hadam3p_eu_iz9o_2006_1_008490897_0 57,696 244,380 4.2356
24 Dec 2013 19:27:03 1115566 16117957 hadam3p_eu_iz9o_2006_1_008490897_0 46,176 199,579 4.3221
21 Dec 2013 20:56:35 1115566 16117957 hadam3p_eu_iz9o_2006_1_008490897_0 34,662 148,648 4.2885
21 Dec 2013 20:06:23 1115566 16117957 hadam3p_eu_iz9o_2006_1_008490897_0 34,656 147,909 4.2679
10 Dec 2013 20:23:54 1115566 16117957 hadam3p_eu_iz9o_2006_1_008490897_0 23,149 102,895 4.4449
09 Dec 2013 23:58:12 1115566 16117957 hadam3p_eu_iz9o_2006_1_008490897_0 23,142 102,189 4.4157
09 Dec 2013 23:17:59 1115566 16117957 hadam3p_eu_iz9o_2006_1_008490897_0 23,136 101,368 4.3814
07 Dec 2013 20:59:00 1115566 16117957 hadam3p_eu_iz9o_2006_1_008490897_0 11,616 46,926 4.0398


©2024 climateprediction.net