climateprediction.net home page
Task 13979227

Task 13979227

Name hadam3p_pnw_8sjj_2002_1_007709947_0
Workunit 7865055
Created 25 Jan 2012, 19:03:29 UTC
Sent 3 Feb 2012, 14:42:16 UTC
Report deadline 15 Jan 2013, 20:02:16 UTC
Received 21 Feb 2012, 23:33:27 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 1183660
Run time 3 days 13 hours 9 min
CPU time 3 days 11 hours 39 min 27 sec
Validate state Invalid
Credit 2,004.61
Device peak FLOPS 2.82 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Pacific North West v6.09
windows_intelx86
Stderr
<core_client_version>6.12.34</core_client_version>
<![CDATA[
<stderr_txt>
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=8012, selfPID=8012, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4752, selfPID=4752, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=10156, selfPID=10156, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=9716, selfPID=9716, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
No Process Handle
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=10208, selfPID=10208, iMonCtr=2
08:54:44 (1016): No heartbeat from core client for 30 sec - exiting
08:54:45 (1016): No heartbeat from core client for 30 sec - exiting
08:54:46 (1016): No heartbeat from core client for 30 sec - exiting
08:54:47 (1016): No heartbeat from core client for 30 sec - exiting
08:54:48 (1016): No heartbeat from core client for 30 sec - exiting
08:54:49 (1016): No heartbeat from core client for 30 sec - exiting
08:54:50 (1016): No heartbeat from core client for 30 sec - exiting
08:54:51 (1016): No heartbeat from core client for 30 sec - exiting
08:54:52 (1016): No heartbeat from core client for 30 sec - exiting
08:54:53 (1016): No heartbeat from core client for 30 sec - exiting
08:54:54 (1016): No heartbeat from core client for 30 sec - exiting
08:54:55 (1016): No heartbeat from core client for 30 sec - exiting
08:54:56 (1016): No heartbeat from core client for 30 sec - exiting
08:54:57 (1016): No heartbeat from core client for 30 sec - exiting
08:54:58 (1016): No heartbeat from core client for 30 sec - exiting
08:54:59 (1016): No heartbeat from core client for 30 sec - exiting
08:55:00 (1016): No heartbeat from core client for 30 sec - exiting
08:55:01 (1016): No heartbeat from core client for 30 sec - exiting
08:55:02 (1016): No heartbeat from core client for 30 sec - exiting
08:55:03 (1016): No heartbeat from core client for 30 sec - exiting
08:55:04 (1016): No heartbeat from core client for 30 sec - exiting
08:55:05 (1016): No heartbeat from core client for 30 sec - exiting
08:55:06 (1016): No heartbeat from core client for 30 sec - exiting
08:55:07 (1016): No heartbeat from core client for 30 sec - exiting
08:55:08 (1016): No heartbeat from core client for 30 sec - exiting
08:55:09 (1016): No heartbeat from core client for 30 sec - exiting
08:55:10 (1016): No heartbeat from core client for 30 sec - exiting
08:55:11 (1016): No heartbeat from core client for 30 sec - exiting
08:55:12 (1016): No heartbeat from core client for 30 sec - exiting
08:55:13 (1016): No heartbeat from core client for 30 sec - exiting
08:55:14 (1016): No heartbeat from core client for 30 sec - exiting
08:55:15 (1016): No heartbeat from core client for 30 sec - exiting
08:55:16 (1016): No heartbeat from core client for 30 sec - exiting
08:55:17 (1016): No heartbeat from core client for 30 sec - exiting
08:55:18 (1016): No heartbeat from core client for 30 sec - exiting
08:55:19 (1016): No heartbeat from core client for 30 sec - exiting
08:55:20 (1016): No heartbeat from core client for 30 sec - exiting
08:55:21 (1016): No heartbeat from core client for 30 sec - exiting
08:55:22 (1016): No heartbeat from core client for 30 sec - exiting
08:55:23 (1016): No heartbeat from core client for 30 sec - exiting
08:55:24 (1016): No heartbeat from core client for 30 sec - exiting
08:55:25 (1016): No heartbeat from core client for 30 sec - exiting
08:55:26 (1016): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=8096, selfPID=5244, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Regional yearly means requires 12 input files got 8
cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadam3p_pnw_8sjj_2002_1_007709947/dataout/atmos_restart.day after 11 attempts
cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadam3p_pnw_8sjj_2002_1_007709947/dataout/region_restart.day after 11 attempts

Model crashed: READHIST: End of file in READ from history file for namelist NLIHISTO                                                                                                                                                                                           tmp/xaakm.pipe_dummy                                                            2048    
Leaving CPDN_Main::Monitor...
Regional yearly means requires 12 input files got 0
Called boinc_finish

</stderr_txt>
<message>
upload failure: <file_xfer_error>
  <file_name>hadam3p_pnw_8sjj_2002_1_007709947_0_9.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_8sjj_2002_1_007709947_0_10.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_8sjj_2002_1_007709947_0_11.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_8sjj_2002_1_007709947_0_12.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>

</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
13 Feb 2012 03:18:44 1183660 13979227 hadam3p_pnw_8sjj_2002_1_007709947_0 92,256 288,345 3.1255
10 Feb 2012 03:02:19 1183660 13979227 hadam3p_pnw_8sjj_2002_1_007709947_0 80,736 252,588 3.1286
09 Feb 2012 16:08:55 1183660 13979227 hadam3p_pnw_8sjj_2002_1_007709947_0 69,216 216,136 3.1226
09 Feb 2012 06:13:00 1183660 13979227 hadam3p_pnw_8sjj_2002_1_007709947_0 57,696 179,211 3.1061
08 Feb 2012 18:34:58 1183660 13979227 hadam3p_pnw_8sjj_2002_1_007709947_0 46,178 143,326 3.1038
08 Feb 2012 17:38:13 1183660 13979227 hadam3p_pnw_8sjj_2002_1_007709947_0 46,176 142,992 3.0967
08 Feb 2012 04:53:20 1183660 13979227 hadam3p_pnw_8sjj_2002_1_007709947_0 34,656 108,446 3.1292
07 Feb 2012 14:37:34 1183660 13979227 hadam3p_pnw_8sjj_2002_1_007709947_0 23,136 72,616 3.1387
07 Feb 2012 01:59:11 1183660 13979227 hadam3p_pnw_8sjj_2002_1_007709947_0 11,616 37,576 3.2348


©2024 cpdn.org