climateprediction.net home page
Task 14083263

Task 14083263

Name hadam3p_eu_2owj_1964_1_007165771_1
Workunit 7350611
Created 9 Feb 2012, 22:14:01 UTC
Sent 9 Feb 2012, 22:50:11 UTC
Report deadline 22 Jan 2013, 4:10:11 UTC
Received 2 Mar 2012, 13:40:54 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 984012
Run time 3 days 15 hours 38 min 54 sec
CPU time 3 days 9 hours 57 min 51 sec
Validate state Invalid
Credit 1,392.75
Device peak FLOPS 1.94 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Europe v6.09
windows_intelx86
Stderr
<core_client_version>6.6.28</core_client_version>
<![CDATA[
<stderr_txt>
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6964, selfPID=6964, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5784, selfPID=5784, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6060, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4976, selfPID=3728, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5036, selfPID=1976, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3324, selfPID=4980, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
10:52:06 (4312): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
10:53:45 (5860): No heartbeat from core client for 30 sec - exiting
10:53:46 (5860): No heartbeat from core client for 30 sec - exiting
10:53:47 (5860): No heartbeat from core client for 30 sec - exiting
10:53:48 (5860): No heartbeat from core client for 30 sec - exiting
10:53:49 (5860): No heartbeat from core client for 30 sec - exiting
10:53:50 (5860): No heartbeat from core client for 30 sec - exiting
10:53:51 (5860): No heartbeat from core client for 30 sec - exiting
10:53:52 (5860): No heartbeat from core client for 30 sec - exiting
10:53:53 (5860): No heartbeat from core client for 30 sec - exiting
10:53:55 (5860): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4724, selfPID=4724, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4936, selfPID=4104, iMonCtr=1
Model crash detected, will try to restart...
17:25:24 (3284): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
17:27:50 (4040): No heartbeat from core client for 30 sec - exiting
17:27:51 (4040): No heartbeat from core client for 30 sec - exiting
17:27:52 (4040): No heartbeat from core client for 30 sec - exiting
17:27:53 (4040): No heartbeat from core client for 30 sec - exiting
17:27:54 (4040): No heartbeat from core client for 30 sec - exiting
17:27:55 (4040): No heartbeat from core client for 30 sec - exiting
17:27:56 (4040): No heartbeat from core client for 30 sec - exiting
17:27:57 (4040): No heartbeat from core client for 30 sec - exiting
17:27:58 (4040): No heartbeat from core client for 30 sec - exiting
17:27:59 (4040): No heartbeat from core client for 30 sec - exiting
17:28:00 (4040): No heartbeat from core client for 30 sec - exiting
17:28:01 (4040): No heartbeat from core client for 30 sec - exiting
17:28:02 (4040): No heartbeat from core client for 30 sec - exiting
17:28:03 (4040): No heartbeat from core client for 30 sec - exiting
17:28:04 (4040): No heartbeat from core client for 30 sec - exiting
17:28:36 (4040): No heartbeat from core client for 30 sec - exiting
17:28:37 (4040): No heartbeat from core client for 30 sec - exiting
17:28:38 (4040): No heartbeat from core client for 30 sec - exiting
17:28:39 (4040): No heartbeat from core client for 30 sec - exiting
17:29:15 (4040): No heartbeat from core client for 30 sec - exiting
17:29:16 (4040): No heartbeat from core client for 30 sec - exiting
17:29:17 (4040): No heartbeat from core client for 30 sec - exiting
17:29:18 (4040): No heartbeat from core client for 30 sec - exiting
17:29:19 (4040): No heartbeat from core client for 30 sec - exiting
17:29:20 (4040): No heartbeat from core client for 30 sec - exiting
17:29:21 (4040): No heartbeat from core client for 30 sec - exiting
17:29:22 (4040): No heartbeat from core client for 30 sec - exiting
17:29:23 (4040): No heartbeat from core client for 30 sec - exiting
17:29:24 (4040): No heartbeat from core client for 30 sec - exiting
17:29:25 (4040): No heartbeat from core client for 30 sec - exiting
17:29:26 (4040): No heartbeat from core client for 30 sec - exiting
17:29:27 (4040): No heartbeat from core client for 30 sec - exiting
17:29:28 (4040): No heartbeat from core client for 30 sec - exiting
17:29:29 (4040): No heartbeat from core client for 30 sec - exiting
17:29:30 (4040): No heartbeat from core client for 30 sec - exiting
17:29:31 (4040): No heartbeat from core client for 30 sec - exiting
17:29:32 (4040): No heartbeat from core client for 30 sec - exiting
17:29:33 (4040): No heartbeat from core client for 30 sec - exiting
17:29:34 (4040): No heartbeat from core client for 30 sec - exiting
17:29:35 (4040): No heartbeat from core client for 30 sec - exiting
17:29:36 (4040): No heartbeat from core client for 30 sec - exiting
17:29:37 (4040): No heartbeat from core client for 30 sec - exiting
17:29:38 (4040): No heartbeat from core client for 30 sec - exiting
17:29:39 (4040): No heartbeat from core client for 30 sec - exiting
17:29:40 (4040): No heartbeat from core client for 30 sec - exiting
17:29:41 (4040): No heartbeat from core client for 30 sec - exiting
17:29:42 (4040): No heartbeat from core client for 30 sec - exiting
17:29:43 (4040): No heartbeat from core client for 30 sec - exiting
17:29:44 (4040): No heartbeat from core client for 30 sec - exiting
17:29:45 (4040): No heartbeat from core client for 30 sec - exiting
17:29:46 (4040): No heartbeat from core client for 30 sec - exiting
17:29:47 (4040): No heartbeat from core client for 30 sec - exiting
17:29:48 (4040): No heartbeat from core client for 30 sec - exiting
17:29:49 (4040): No heartbeat from core client for 30 sec - exiting
17:29:50 (4040): No heartbeat from core client for 30 sec - exiting
17:29:51 (4040): No heartbeat from core client for 30 sec - exiting
17:29:52 (4040): No heartbeat from core client for 30 sec - exiting
17:29:53 (4040): No heartbeat from core client for 30 sec - exiting
17:29:54 (4040): No heartbeat from core client for 30 sec - exiting
17:29:55 (4040): No heartbeat from core client for 30 sec - exiting
17:29:56 (4040): No heartbeat from core client for 30 sec - exiting
17:29:57 (4040): No heartbeat from core client for 30 sec - exiting
17:29:58 (4040): No heartbeat from core client for 30 sec - exiting
17:29:59 (4040): No heartbeat from core client for 30 sec - exiting
17:30:00 (4040): No heartbeat from core client for 30 sec - exiting
17:30:01 (4040): No heartbeat from core client for 30 sec - exiting
17:30:02 (4040): No heartbeat from core client for 30 sec - exiting
17:30:03 (4040): No heartbeat from core client for 30 sec - exiting
17:30:04 (4040): No heartbeat from core client for 30 sec - exiting
17:30:05 (4040): No heartbeat from core client for 30 sec - exiting
17:30:06 (4040): No heartbeat from core client for 30 sec - exiting
17:30:07 (4040): No heartbeat from core client for 30 sec - exiting
17:30:08 (4040): No heartbeat from core client for 30 sec - exiting
17:30:09 (4040): No heartbeat from core client for 30 sec - exiting
17:30:10 (4040): No heartbeat from core client for 30 sec - exiting
17:30:11 (4040): No heartbeat from core client for 30 sec - exiting
17:30:12 (4040): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
18:21:49 (3060): No heartbeat from core client for 30 sec - exiting
18:21:50 (3060): No heartbeat from core client for 30 sec - exiting
18:21:51 (3060): No heartbeat from core client for 30 sec - exiting
18:21:52 (3060): No heartbeat from core client for 30 sec - exiting
18:21:53 (3060): No heartbeat from core client for 30 sec - exiting
18:21:54 (3060): No heartbeat from core client for 30 sec - exiting
18:21:55 (3060): No heartbeat from core client for 30 sec - exiting
18:21:56 (3060): No heartbeat from core client for 30 sec - exiting
18:21:57 (3060): No heartbeat from core client for 30 sec - exiting
18:21:58 (3060): No heartbeat from core client for 30 sec - exiting
18:21:59 (3060): No heartbeat from core client for 30 sec - exiting
18:22:03 (3060): No heartbeat from core client for 30 sec - exiting
18:22:04 (3060): No heartbeat from core client for 30 sec - exiting
18:22:05 (3060): No heartbeat from core client for 30 sec - exiting
18:22:06 (3060): No heartbeat from core client for 30 sec - exiting
18:22:07 (3060): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...

Model crashed: READHIST: End of file in READ from history file for namelist NLIHISTO                                                                                                                                                                                           tmp/xaakg.pipe_dummy                                                            2048    
Leaving CPDN_Main::Monitor...
Called boinc_finish

</stderr_txt>
<message>
<file_xfer_error>
  <file_name>hadam3p_eu_2owj_1964_1_007165771_1_8.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_2owj_1964_1_007165771_1_9.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_2owj_1964_1_007165771_1_10.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_2owj_1964_1_007165771_1_11.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_2owj_1964_1_007165771_1_12.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>

</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
02 Mar 2012 13:46:04 984012 14083263 hadam3p_eu_2owj_1964_1_007165771_1 80,736 286,185 3.5447
28 Feb 2012 20:37:07 984012 14083263 hadam3p_eu_2owj_1964_1_007165771_1 69,216 247,551 3.5765
27 Feb 2012 19:14:43 984012 14083263 hadam3p_eu_2owj_1964_1_007165771_1 57,696 209,543 3.6318
25 Feb 2012 16:54:32 984012 14083263 hadam3p_eu_2owj_1964_1_007165771_1 46,176 168,122 3.6409
16 Feb 2012 22:12:40 984012 14083263 hadam3p_eu_2owj_1964_1_007165771_1 34,656 126,121 3.6392
15 Feb 2012 19:34:20 984012 14083263 hadam3p_eu_2owj_1964_1_007165771_1 23,136 84,350 3.6458
14 Feb 2012 16:26:22 984012 14083263 hadam3p_eu_2owj_1964_1_007165771_1 11,616 41,826 3.6007


©2024 cpdn.org