climateprediction.net home page
Task 12296992

Task 12296992

Name hadam3p_pnw_zppp_1972_1_007011845_0
Workunit 7215161
Created 24 Nov 2010, 14:21:00 UTC
Sent 23 Jan 2011, 18:04:07 UTC
Report deadline 5 Jan 2012, 23:24:07 UTC
Received 14 Feb 2011, 21:47:35 UTC
Server state Over
Outcome Didn't need
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 1013631
Run time 2 days 12 hours 30 min 30 sec
CPU time 2 days 1 hours 43 min 15 sec
Validate state Invalid
Credit 1,003.35
Device peak FLOPS 2.27 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Pacific North West v6.08
windows_intelx86
Stderr
<core_client_version>6.10.58</core_client_version>
<![CDATA[
<stderr_txt>
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3760, selfPID=3760, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5784, selfPID=5784, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5948, selfPID=5948, iMonCtr=2
20:22:53 (3452): No heartbeat from core client for 30 sec - exiting
20:22:54 (3452): No heartbeat from core client for 30 sec - exiting
20:22:55 (3452): No heartbeat from core client for 30 sec - exiting
20:22:56 (3452): No heartbeat from core client for 30 sec - exiting
20:22:57 (3452): No heartbeat from core client for 30 sec - exiting
20:22:58 (3452): No heartbeat from core client for 30 sec - exiting
20:22:59 (3452): No heartbeat from core client for 30 sec - exiting
20:23:00 (3452): No heartbeat from core client for 30 sec - exiting
20:23:02 (3452): No heartbeat from core client for 30 sec - exiting
20:23:03 (3452): No heartbeat from core client for 30 sec - exiting
20:23:04 (3452): No heartbeat from core client for 30 sec - exiting
20:23:05 (3452): No heartbeat from core client for 30 sec - exiting
20:23:06 (3452): No heartbeat from core client for 30 sec - exiting
20:23:07 (3452): No heartbeat from core client for 30 sec - exiting
20:23:08 (3452): No heartbeat from core client for 30 sec - exiting
20:23:09 (3452): No heartbeat from core client for 30 sec - exiting
20:23:10 (3452): No heartbeat from core client for 30 sec - exiting
20:23:11 (3452): No heartbeat from core client for 30 sec - exiting
20:23:12 (3452): No heartbeat from core client for 30 sec - exiting
20:23:13 (3452): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7124, selfPID=7124, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6852, selfPID=6852, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
17:05:12 (2688): No heartbeat from core client for 30 sec - exiting
17:05:13 (2688): No heartbeat from core client for 30 sec - exiting
17:05:14 (2688): No heartbeat from core client for 30 sec - exiting
17:05:16 (2688): No heartbeat from core client for 30 sec - exiting
17:05:17 (2688): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1592, selfPID=1592, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5292, selfPID=5292, iMonCtr=2
22:28:21 (4852): No heartbeat from core client for 30 sec - exiting
22:28:22 (4852): No heartbeat from core client for 30 sec - exiting
22:28:23 (4852): No heartbeat from core client for 30 sec - exiting
22:28:24 (4852): No heartbeat from core client for 30 sec - exiting

Model crashed: READHIST: End of file in READ from history file for namelist NLIHISTO                                                                                                                                                                                           tmp/xaakm.pipe_dummy                                                            2048    
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=0, iMonCtr=2
CPDN Monitor - No 'heartbeat' from BOINC...

Model crashed: READHIST: End of file in READ from history file for namelist NLIHISTO                                                                                                                                                                                           tmp/xaakm.pipe_dummy                                                            2048    
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=0, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5716, selfPID=5172, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Regional yearly means requires 12 input files got 4
22:28:56 (5172): called boinc_finish

</stderr_txt>
<message>
<file_xfer_error>
  <file_name>hadam3p_pnw_zppp_1972_1_007011845_0_5.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_zppp_1972_1_007011845_0_6.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_zppp_1972_1_007011845_0_7.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_zppp_1972_1_007011845_0_8.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_zppp_1972_1_007011845_0_9.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_zppp_1972_1_007011845_0_10.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_zppp_1972_1_007011845_0_11.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_zppp_1972_1_007011845_0_12.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>

</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
13 Feb 2011 14:33:16 1013631 12296992 hadam3p_pnw_zppp_1972_1_007011845_0 46,176 153,436 3.3229
09 Feb 2011 16:44:16 1013631 12296992 hadam3p_pnw_zppp_1972_1_007011845_0 34,656 115,009 3.3186
06 Feb 2011 11:20:04 1013631 12296992 hadam3p_pnw_zppp_1972_1_007011845_0 23,136 76,142 3.2911
04 Feb 2011 21:52:59 1013631 12296992 hadam3p_pnw_zppp_1972_1_007011845_0 11,616 38,566 3.3201


©2024 climateprediction.net