climateprediction.net home page
Task 11910602

Task 11910602

Name hadam3p_saf_v68v_1990_1_006724916_1
Workunit 6928159
Created 17 Sep 2010, 23:41:26 UTC
Sent 17 Sep 2010, 23:52:07 UTC
Report deadline 31 Aug 2011, 5:12:07 UTC
Received 16 Oct 2010, 8:12:26 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 1101724
Run time 2 days 17 hours 53 min 35 sec
CPU time 2 days 15 hours 35 min 13 sec
Validate state Invalid
Credit 1,496.58
Device peak FLOPS 2.69 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Southern Africa v6.05
windows_intelx86
Stderr
<core_client_version>6.10.58</core_client_version>
<![CDATA[
<stderr_txt>
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3500, selfPID=3500, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
RegiCPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3380, selfPID=3380, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2572, selfPID=2572, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4504, selfPID=4504, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=652, selfPID=652, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4796, selfPID=4796, iMonCtr=2
03:28:02 (3960): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
03:31:26 (4048): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2732, selfPID=2732, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4544, selfPID=4544, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1832, selfPID=1832, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=328, selfPID=328, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
01:27:25 (3668): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=0
Model crash detected, will try to restart...
03:50:09 (3376): No heartbeat from core client for 30 sec - exiting
03:50:10 (3376): No heartbeat from core client for 30 sec - exiting
03:50:11 (3376): No heartbeat from core client for 30 sec - exiting
03:50:12 (3376): No heartbeat from core client for 30 sec - exiting
03:50:13 (3376): No heartbeat from core client for 30 sec - exiting
03:50:14 (3376): No heartbeat from core client for 30 sec - exiting
03:50:15 (3376): No heartbeat from core client for 30 sec - exiting
03:50:16 (3376): No heartbeat from core client for 30 sec - exiting
03:50:17 (3376): No heartbeat from core client for 30 sec - exiting
03:50:18 (3376): No heartbeat from core client for 30 sec - exiting
03:50:20 (3376): No heartbeat from core client for 30 sec - exiting
03:50:21 (3376): No heartbeat from core client for 30 sec - exiting
03:50:22 (3376): No heartbeat from core client for 30 sec - exiting
03:50:23 (3376): No heartbeat from core client for 30 sec - exiting
03:50:24 (3376): No heartbeat from core client for 30 sec - exiting
03:50:25 (3376): No heartbeat from core client for 30 sec - exiting
03:50:26 (3376): No heartbeat from core client for 30 sec - exiting
03:50:27 (3376): No heartbeat from core client for 30 sec - exiting
03:50:28 (3376): No heartbeat from core client for 30 sec - exiting
03:50:29 (3376): No heartbeat from core client for 30 sec - exiting
03:50:31 (3376): No heartbeat from core client for 30 sec - exiting

Model crashed: READHIST: End of file in READ from history file for namelist NLIHISTO                                                                                                                                                                                           tmp/xaakg.pipe_dummy                                                            2048    
CPDN Monitor - No 'heartbeat' from BOINC...

Model crashed: READHIST: End of file in READ from history file for namelist NLIHISTO                                                                                                                                                                                           tmp/xaakg.pipe_dummy                                                            2048    
Leaving CPDN_Main::Monitor...

zip error: Could not create output file (was replacing the original zip file)
03:51:01 (3320): called boinc_finish

</stderr_txt>
<message>
<file_xfer_error>
  <file_name>hadam3p_saf_v68v_1990_1_006724916_1_9.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_saf_v68v_1990_1_006724916_1_10.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_saf_v68v_1990_1_006724916_1_11.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_saf_v68v_1990_1_006724916_1_12.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>

</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
16 Oct 2010 06:30:42 1101724 11910602 hadam3p_saf_v68v_1990_1_006724916_1 92,256 222,973 2.4169
15 Oct 2010 13:22:00 1101724 11910602 hadam3p_saf_v68v_1990_1_006724916_1 80,736 195,673 2.4236
15 Oct 2010 03:41:05 1101724 11910602 hadam3p_saf_v68v_1990_1_006724916_1 69,216 168,669 2.4368
14 Oct 2010 15:15:36 1101724 11910602 hadam3p_saf_v68v_1990_1_006724916_1 57,696 140,994 2.4437
14 Oct 2010 04:14:02 1101724 11910602 hadam3p_saf_v68v_1990_1_006724916_1 46,176 113,013 2.4474
13 Oct 2010 15:18:12 1101724 11910602 hadam3p_saf_v68v_1990_1_006724916_1 34,656 85,139 2.4567
12 Oct 2010 05:03:41 1101724 11910602 hadam3p_saf_v68v_1990_1_006724916_1 23,136 56,817 2.4558
11 Oct 2010 20:29:09 1101724 11910602 hadam3p_saf_v68v_1990_1_006724916_1 11,616 28,719 2.4724


©2024 climateprediction.net