climateprediction.net home page
Task 17653675

Task 17653675

Name hadam3p_pnw_w1h6_2005_1_009351203_1
Workunit 9435336
Created 25 Dec 2014, 20:13:19 UTC
Sent 25 Dec 2014, 21:11:54 UTC
Report deadline 8 Dec 2015, 2:31:54 UTC
Received 13 Jan 2015, 17:51:39 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 1124633
Run time 21 hours 49 min 14 sec
CPU time 21 hours 49 min 14 sec
Validate state Invalid
Credit 757.44
Device peak FLOPS 3.21 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Pacific North West v7.22
windows_intelx86
Stderr
<core_client_version>6.10.17</core_client_version>
<![CDATA[
<stderr_txt>
17:51:30 (6872): No heartbeat from client for 30 sec - exiting
17:51:30 (6872): timer handler: client dead, exiting
17:51:31 (6872): No heartbeat from client for 30 sec - exiting
17:51:31 (6872): timer handler: client dead, exiting
17:51:32 (6872): No heartbeat from client for 30 sec - exiting
17:51:32 (6872): timer handler: client dead, exiting
17:51:33 (6872): No heartbeat from client for 30 sec - exiting
17:51:33 (6872): timer handler: client dead, exiting
17:51:34 (6872): No heartbeat from client for 30 sec - exiting
17:51:34 (6872): timer handler: client dead, exiting
17:51:35 (6872): No heartbeat from client for 30 sec - exiting
17:51:35 (6872): timer handler: client dead, exiting
CPDN Monitor - No 'heartbeat' from BOINC...
17:55:10 (10328): No heartbeat from client for 30 sec - exiting
17:55:10 (10328): timer handler: client dead, exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
23:12:32 (17664): No heartbeat from client for 30 sec - exiting
23:12:32 (17664): timer handler: client dead, exiting
CPDN Monitor - No 'heartbeat' from BOINC...
23:12:33 (17664): No heartbeat from client for 30 sec - exiting
23:12:33 (17664): timer handler: client dead, exiting
23:12:34 (17664): No heartbeat from client for 30 sec - exiting
23:12:34 (17664): timer handler: client dead, exiting
01:38:01 (13204): No heartbeat from client for 30 sec - exiting
01:38:01 (13204): timer handler: client dead, exiting
CPDN Monitor - No 'heartbeat' from BOINC...
01:38:02 (13204): No heartbeat from client for 30 sec - exiting
01:38:02 (13204): timer handler: client dead, exiting
01:38:03 (13204): No heartbeat from client for 30 sec - exiting
01:38:03 (13204): timer handler: client dead, exiting
01:39:16 (13712): No heartbeat from client for 30 sec - exiting
01:39:16 (13712): timer handler: client dead, exiting
01:39:18 (13712): No heartbeat from client for 30 sec - exiting
01:39:18 (13712): timer handler: client dead, exiting
01:39:19 (13712): No heartbeat from client for 30 sec - exiting
01:39:19 (13712): timer handler: client dead, exiting
01:39:20 (13712): No heartbeat from client for 30 sec - exiting
01:39:20 (13712): timer handler: client dead, exiting
01:39:21 (13712): No heartbeat from client for 30 sec - exiting
01:39:21 (13712): timer handler: client dead, exiting
01:39:22 (13712): No heartbeat from client for 30 sec - exiting
01:39:22 (13712): timer handler: client dead, exiting
01:39:23 (13712): No heartbeat from client for 30 sec - exiting
01:39:23 (13712): timer handler: client dead, exiting
01:39:24 (13712): No heartbeat from client for 30 sec - exiting
01:39:24 (13712): timer handler: client dead, exiting
01:39:25 (13712): No heartbeat from client for 30 sec - exiting
01:39:25 (13712): timer handler: client dead, exiting
01:39:26 (13712): No heartbeat from client for 30 sec - exiting
01:39:26 (13712): timer handler: client dead, exiting
01:39:27 (13712): No heartbeat from client for 30 sec - exiting
01:39:27 (13712): timer handler: client dead, exiting
01:39:29 (13712): No heartbeat from client for 30 sec - exiting
01:39:29 (13712): timer handler: client dead, exiting
01:39:30 (13712): No heartbeat from client for 30 sec - exiting
01:39:30 (13712): timer handler: client dead, exiting
01:39:31 (13712): No heartbeat from client for 30 sec - exiting
01:39:31 (13712): timer handler: client dead, exiting
01:39:32 (13712): No heartbeat from client for 30 sec - exiting
01:39:32 (13712): timer handler: client dead, exiting
01:39:33 (13712): No heartbeat from client for 30 sec - exiting
01:39:33 (13712): timer handler: client dead, exiting
01:39:34 (13712): No heartbeat from client for 30 sec - exiting
01:39:34 (13712): timer handler: client dead, exiting
01:39:35 (13712): No heartbeat from client for 30 sec - exiting
01:39:35 (13712): timer handler: client dead, exiting
01:39:36 (13712): No heartbeat from client for 30 sec - exiting
01:39:36 (13712): timer handler: client dead, exiting
01:39:37 (13712): No heartbeat from client for 30 sec - exiting
01:39:37 (13712): timer handler: client dead, exiting
01:39:38 (13712): No heartbeat from client for 30 sec - exiting
01:39:38 (13712): timer handler: client dead, exiting
01:39:39 (13712): No heartbeat from client for 30 sec - exiting
01:39:39 (13712): timer handler: client dead, exiting
01:39:41 (13712): No heartbeat from client for 30 sec - exiting
01:39:41 (13712): timer handler: client dead, exiting
01:39:42 (13712): No heartbeat from client for 30 sec - exiting
01:39:42 (13712): timer handler: client dead, exiting
01:39:43 (13712): No heartbeat from client for 30 sec - exiting
01:39:43 (13712): timer handler: client dead, exiting
01:39:44 (13712): No heartbeat from client for 30 sec - exiting
01:39:44 (13712): timer handler: client dead, exiting
01:39:45 (13712): No heartbeat from client for 30 sec - exiting
01:39:45 (13712): timer handler: client dead, exiting
01:39:46 (13712): No heartbeat from client for 30 sec - exiting
01:39:46 (13712): timer handler: client dead, exiting
01:39:47 (13712): No heartbeat from client for 30 sec - exiting
01:39:47 (13712): timer handler: client dead, exiting
01:39:48 (13712): No heartbeat from client for 30 sec - exiting
01:39:48 (13712): timer handler: client dead, exiting
01:39:49 (13712): No heartbeat from client for 30 sec - exiting
01:39:49 (13712): timer handler: client dead, exiting
01:39:50 (13712): No heartbeat from client for 30 sec - exiting
01:39:50 (13712): timer handler: client dead, exiting
01:39:51 (13712): No heartbeat from client for 30 sec - exiting
01:39:51 (13712): timer handler: client dead, exiting
01:39:53 (13712): No heartbeat from client for 30 sec - exiting
01:39:53 (13712): timer handler: client dead, exiting
01:39:54 (13712): No heartbeat from client for 30 sec - exiting
01:39:54 (13712): timer handler: client dead, exiting
01:39:55 (13712): No heartbeat from client for 30 sec - exiting
01:39:55 (13712): timer handler: client dead, exiting
01:39:56 (13712): No heartbeat from client for 30 sec - exiting
01:39:56 (13712): timer handler: client dead, exiting
01:39:57 (13712): No heartbeat from client for 30 sec - exiting
01:39:57 (13712): timer handler: client dead, exiting
01:39:58 (13712): No heartbeat from client for 30 sec - exiting
01:39:58 (13712): timer handler: client dead, exiting
01:39:59 (13712): No heartbeat from client for 30 sec - exiting
01:39:59 (13712): timer handler: client dead, exiting
01:40:00 (13712): No heartbeat from client for 30 sec - exiting
01:40:00 (13712): timer handler: client dead, exiting
01:40:01 (13712): No heartbeat from client for 30 sec - exiting
01:40:01 (13712): timer handler: client dead, exiting
01:40:02 (13712): No heartbeat from client for 30 sec - exiting
01:40:02 (13712): timer handler: client dead, exiting
01:40:03 (13712): No heartbeat from client for 30 sec - exiting
01:40:03 (13712): timer handler: client dead, exiting
01:40:05 (13712): No heartbeat from client for 30 sec - exiting
01:40:05 (13712): timer handler: client dead, exiting
01:40:06 (13712): No heartbeat from client for 30 sec - exiting
01:40:06 (13712): timer handler: client dead, exiting
01:40:07 (13712): No heartbeat from client for 30 sec - exiting
01:40:07 (13712): timer handler: client dead, exiting
01:40:08 (13712): No heartbeat from client for 30 sec - exiting
01:40:08 (13712): timer handler: client dead, exiting
01:40:09 (13712): No heartbeat from client for 30 sec - exiting
01:40:09 (13712): timer handler: client dead, exiting
01:40:10 (13712): No heartbeat from client for 30 sec - exiting
01:40:10 (13712): timer handler: client dead, exiting
01:40:11 (13712): No heartbeat from client for 30 sec - exiting
01:40:11 (13712): timer handler: client dead, exiting
01:40:12 (13712): No heartbeat from client for 30 sec - exiting
01:40:12 (13712): timer handler: client dead, exiting
01:40:13 (13712): No heartbeat from client for 30 sec - exiting
01:40:13 (13712): timer handler: client dead, exiting
01:40:14 (13712): No heartbeat from client for 30 sec - exiting
01:40:14 (13712): timer handler: client dead, exiting
01:40:15 (13712): No heartbeat from client for 30 sec - exiting
01:40:15 (13712): timer handler: client dead, exiting
01:40:17 (13712): No heartbeat from client for 30 sec - exiting
01:40:17 (13712): timer handler: client dead, exiting
01:40:18 (13712): No heartbeat from client for 30 sec - exiting
01:40:18 (13712): timer handler: client dead, exiting
01:40:19 (13712): No heartbeat from client for 30 sec - exiting
01:40:19 (13712): timer handler: client dead, exiting
01:40:20 (13712): No heartbeat from client for 30 sec - exiting
01:40:20 (13712): timer handler: client dead, exiting
01:40:21 (13712): No heartbeat from client for 30 sec - exiting
01:40:21 (13712): timer handler: client dead, exiting
01:40:22 (13712): No heartbeat from client for 30 sec - exiting
01:40:22 (13712): timer handler: client dead, exiting
01:40:23 (13712): No heartbeat from client for 30 sec - exiting
01:40:23 (13712): timer handler: client dead, exiting
01:40:24 (13712): No heartbeat from client for 30 sec - exiting
01:40:24 (13712): timer handler: client dead, exiting
01:40:25 (13712): No heartbeat from client for 30 sec - exiting
01:40:26 (13712): timer handler: client dead, exiting
01:40:27 (13712): No heartbeat from client for 30 sec - exiting
01:40:27 (13712): timer handler: client dead, exiting
CPDN Monitor - No 'heartbeat' from BOINC...
01:40:28 (13712): No heartbeat from client for 30 sec - exiting
01:40:28 (13712): timer handler: client dead, exiting
07:13:41 (12616): No heartbeat from client for 30 sec - exiting
07:13:41 (12616): timer handler: client dead, exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=8056, selfPID=8056, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...

Model crashed: READHIST: End of file in READ from history file for namelist NLIHISTO                                                                                                                                                                                           tmp/xaakm.pipe_dummy                                                            2048    
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=8792, selfPID=8792, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=8792, selfPID=292, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
00:55:07 (292): called boinc_finish

</stderr_txt>
<message>
<file_xfer_error>
  <file_name>hadam3p_pnw_w1h6_2005_1_009351203_1_4.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_w1h6_2005_1_009351203_1_5.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_w1h6_2005_1_009351203_1_6.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_w1h6_2005_1_009351203_1_7.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_w1h6_2005_1_009351203_1_8.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_w1h6_2005_1_009351203_1_9.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_w1h6_2005_1_009351203_1_10.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_w1h6_2005_1_009351203_1_11.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_w1h6_2005_1_009351203_1_12.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>

</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
13 Jan 2015 15:58:32 1124633 17653675 hadam3p_pnw_w1h6_2005_1_009351203_1 34,859 66,237 1.9001
13 Jan 2015 15:58:32 1124633 17653675 hadam3p_pnw_w1h6_2005_1_009351203_1 23,339 48,975 2.0984
13 Jan 2015 15:58:32 1124633 17653675 hadam3p_pnw_w1h6_2005_1_009351203_1 11,819 31,477 2.6633


©2024 cpdn.org