climateprediction.net home page
Task 13719141

Task 13719141

Name hadam3p_pnw_70u7_2009_1_007596327_0
Workunit 7774457
Created 5 Dec 2011, 11:54:15 UTC
Sent 19 Dec 2011, 15:39:37 UTC
Report deadline 30 Nov 2012, 20:59:37 UTC
Received 27 Dec 2011, 3:13:12 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 1259863
Run time 1 days 9 hours 28 min 27 sec
CPU time 1 days 0 hours 4 min 6 sec
Validate state Invalid
Credit 502.74
Device peak FLOPS 2.16 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Pacific North West v6.09
windows_intelx86
Stderr
<core_client_version>6.12.34</core_client_version>
<![CDATA[
<stderr_txt>
19:15:03 (1748): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
19:15:05 (1748): No heartbeat from core client for 30 sec - exiting
19:15:06 (1748): No heartbeat from core client for 30 sec - exiting
19:15:07 (1748): No heartbeat from core client for 30 sec - exiting
19:15:08 (1748): No heartbeat from core client for 30 sec - exiting
19:15:09 (1748): No heartbeat from core client for 30 sec - exiting
19:15:10 (1748): No heartbeat from core client for 30 sec - exiting
19:15:11 (1748): No heartbeat from core client for 30 sec - exiting
19:15:12 (1748): No heartbeat from core client for 30 sec - exiting
19:15:13 (1748): No heartbeat from core client for 30 sec - exiting
19:15:14 (1748): No heartbeat from core client for 30 sec - exiting
19:15:16 (1748): No heartbeat from core client for 30 sec - exiting
19:15:17 (1748): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2808, selfPID=2808, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
20:35:22 (4084): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
20:35:24 (4084): No heartbeat from core client for 30 sec - exiting
20:35:25 (4084): No heartbeat from core client for 30 sec - exiting
20:35:26 (4084): No heartbeat from core client for 30 sec - exiting
20:35:27 (4084): No heartbeat from core client for 30 sec - exiting
20:35:28 (4084): No heartbeat from core client for 30 sec - exiting
20:35:29 (4084): No heartbeat from core client for 30 sec - exiting
20:35:30 (4084): No heartbeat from core client for 30 sec - exiting
20:35:31 (4084): No heartbeat from core client for 30 sec - exiting
20:35:32 (4084): No heartbeat from core client for 30 sec - exiting
20:35:33 (4084): No heartbeat from core client for 30 sec - exiting
20:35:34 (4084): No heartbeat from core client for 30 sec - exiting
20:35:36 (4084): No heartbeat from core client for 30 sec - exiting
20:35:37 (4084): No heartbeat from core client for 30 sec - exiting
20:35:38 (4084): No heartbeat from core client for 30 sec - exiting
20:35:39 (4084): No heartbeat from core client for 30 sec - exiting
20:35:40 (4084): No heartbeat from core client for 30 sec - exiting
20:35:41 (4084): No heartbeat from core client for 30 sec - exiting
20:35:42 (4084): No heartbeat from core client for 30 sec - exiting
20:35:43 (4084): No heartbeat from core client for 30 sec - exiting
20:35:44 (4084): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
18:15:48 (3188): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
18:15:50 (3188): No heartbeat from core client for 30 sec - exiting
18:15:51 (3188): No heartbeat from core client for 30 sec - exiting
18:15:52 (3188): No heartbeat from core client for 30 sec - exiting
18:15:53 (3188): No heartbeat from core client for 30 sec - exiting
18:15:54 (3188): No heartbeat from core client for 30 sec - exiting
18:15:55 (3188): No heartbeat from core client for 30 sec - exiting
18:15:56 (3188): No heartbeat from core client for 30 sec - exiting
18:15:57 (3188): No heartbeat from core client for 30 sec - exiting
18:15:58 (3188): No heartbeat from core client for 30 sec - exiting
18:15:59 (3188): No heartbeat from core client for 30 sec - exiting
18:16:01 (3188): No heartbeat from core client for 30 sec - exiting
18:16:02 (3188): No heartbeat from core client for 30 sec - exiting
18:16:03 (3188): No heartbeat from core client for 30 sec - exiting
18:35:32 (1396): Can't acquire lockfile (32) - waiting 35s
18:36:07 (1396): Can't acquire lockfile (32) - exiting
18:36:07 (1396): Error: &#131;v&#131;&#141;&#131;Z&#131;X&#130;&#205;&#131;t&#131;@&#131;C&#131;&#139;&#130;&#201;&#131;A&#131;N&#131;Z&#131;X&#130;&#197;&#130;&#171;&#130;&#220;&#130;&#185;&#130;&#241;&#129;B&#149;&#202;&#130;&#204;&#131;v&#131;&#141;&#131;Z&#131;X&#130;&#170;&#142;g&#151;p&#146;&#134;&#130;&#197;&#130;&#183;&#129;B (0x20)
18:37:54 (1088): Can't acquire lockfile (32) - waiting 35s
18:38:29 (1088): Can't acquire lockfile (32) - exiting
18:38:29 (1088): Error: &#131;v&#131;&#141;&#131;Z&#131;X&#130;&#205;&#131;t&#131;@&#131;C&#131;&#139;&#130;&#201;&#131;A&#131;N&#131;Z&#131;X&#130;&#197;&#130;&#171;&#130;&#220;&#130;&#185;&#130;&#241;&#129;B&#149;&#202;&#130;&#204;&#131;v&#131;&#141;&#131;Z&#131;X&#130;&#170;&#142;g&#151;p&#146;&#134;&#130;&#197;&#130;&#183;&#129;B (0x20)
18:47:07 (4220): Can't acquire lockfile (32) - waiting 35s
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
19:02:56 (4036): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
19:02:57 (4036): No heartbeat from core client for 30 sec - exiting
19:02:58 (4036): No heartbeat from core client for 30 sec - exiting
19:02:59 (4036): No heartbeat from core client for 30 sec - exiting
19:03:00 (4036): No heartbeat from core client for 30 sec - exiting
19:03:01 (4036): No heartbeat from core client for 30 sec - exiting
19:03:02 (4036): No heartbeat from core client for 30 sec - exiting
19:03:03 (4036): No heartbeat from core client for 30 sec - exiting
19:03:04 (4036): No heartbeat from core client for 30 sec - exiting
19:03:05 (4036): No heartbeat from core client for 30 sec - exiting
19:03:06 (4036): No heartbeat from core client for 30 sec - exiting
19:03:07 (4036): No heartbeat from core client for 30 sec - exiting
19:03:08 (4036): No heartbeat from core client for 30 sec - exiting
19:03:09 (4036): No heartbeat from core client for 30 sec - exiting
19:03:10 (4036): No heartbeat from core client for 30 sec - exiting
19:03:11 (4036): No heartbeat from core client for 30 sec - exiting
19:03:12 (4036): No heartbeat from core client for 30 sec - exiting
19:03:13 (4036): No heartbeat from core client for 30 sec - exiting
19:03:14 (4036): No heartbeat from core client for 30 sec - exiting
19:03:15 (4036): No heartbeat from core client for 30 sec - exiting
19:03:16 (4036): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4416, selfPID=996, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4584, selfPID=4020, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4780, selfPID=3648, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5040, selfPID=5040, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5040, selfPID=3680, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Regional yearly means requires 12 input files got 2
Called boinc_finish

</stderr_txt>
<message>
upload failure: <file_xfer_error>
  <file_name>hadam3p_pnw_70u7_2009_1_007596327_0_3.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_70u7_2009_1_007596327_0_4.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_70u7_2009_1_007596327_0_5.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_70u7_2009_1_007596327_0_6.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_70u7_2009_1_007596327_0_7.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_70u7_2009_1_007596327_0_8.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_70u7_2009_1_007596327_0_9.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_70u7_2009_1_007596327_0_10.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_70u7_2009_1_007596327_0_11.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_70u7_2009_1_007596327_0_12.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>

</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
26 Dec 2011 22:54:50 1186039 13719141 hadam3p_pnw_70u7_2009_1_007596327_0 23,137 77,030 3.3293
26 Dec 2011 21:54:32 1186039 13719141 hadam3p_pnw_70u7_2009_1_007596327_0 23,136 76,504 3.3067
25 Dec 2011 23:38:19 1186039 13719141 hadam3p_pnw_70u7_2009_1_007596327_0 11,616 38,842 3.3438


©2024 climateprediction.net