climateprediction.net home page
Task 12325559

Task 12325559

Name hadam3p_saf_24vi_1989_1_007038406_0
Workunit 7241722
Created 25 Nov 2010, 11:02:55 UTC
Sent 10 Dec 2010, 11:18:18 UTC
Report deadline 22 Nov 2011, 16:38:18 UTC
Received 5 Jan 2011, 15:19:45 UTC
Server state Over
Outcome No reply
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 974347
Run time 2 days 21 hours 59 min 3 sec
CPU time 2 days 19 hours 1 min 43 sec
Validate state Invalid
Credit 1,496.58
Device peak FLOPS 2.36 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Southern Africa v6.08
windows_intelx86
Stderr
<core_client_version>6.10.58</core_client_version>
<![CDATA[
<stderr_txt>
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7660, selfPID=7660, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=20168, selfPID=20168, iMonCtr=2
03:34:09 (21200): Can't acquire lockfile (32) - waiting 35s
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVaCPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
16:03:32 (22404): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
16:16:03 (29104): No heartbeat from core client for 30 sec - exiting
16:16:04 (29104): No heartbeat from core client for 30 sec - exiting
16:16:05 (29104): No heartbeat from core client for 30 sec - exiting
16:16:06 (29104): No heartbeat from core client for 30 sec - exiting
16:16:07 (29104): No heartbeat from core client for 30 sec - exiting
16:16:08 (29104): No heartbeat from core client for 30 sec - exiting
16:16:09 (29104): No heartbeat from core client for 30 sec - exiting
16:16:10 (29104): No heartbeat from core client for 30 sec - exiting
16:16:11 (29104): No heartbeat from core client for 30 sec - exiting
16:16:12 (29104): No heartbeat from core client for 30 sec - exiting
16:16:13 (29104): No heartbeat from core client for 30 sec - exiting
16:16:14 (29104): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
22:51:50 (25092): Can't acquire lockfile (32) - waiting 35s
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=28844, selfPID=28844, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN processRegional Worker::CPPD process is not  runnng,, eitinng, RRetaal = 1, checPIID=3400, selfPID=34300, iMonCtr=1

Controller:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=0
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=34120, selfPID=CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
08:33:17 (25956): No heartbeat from core client for 30 sec - exiting
08:33:19 (25956): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
08:48:46 (25276): No heartbeat from core client for 30 sec - exiting
08:48:47 (25276): No heartbeat from core client for 30 sec - exiting
08:48:48 (25276): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
12:12:47 (35168): No heartbeat from core client for 30 sec - exiting
12:12:48 (35168): No heartbeat from core client for 30 sec - exiting
12:12:49 (35168): No heartbeat from core client for 30 sec - exiting
12:12:50 (35168): No heartbeat from core client for 30 sec - exiting
12:12:51 (35168): No heartbeat from core client for 30 sec - exiting
12:12:52 (35168): No heartbeat from core client for 30 sec - exiting
12:12:53 (35168): No heartbeat from core client for 30 sec - exiting
12:12:54 (35168): No heartbeat from core client for 30 sec - exiting
12:12:55 (35168): No heartbeat from core client for 30 sec - exiting
12:12:56 (35168): No heartbeat from core client for 30 sec - exiting
12:12:57 (35168): No heartbeat from core client for 30 sec - exiting
12:12:59 (35168): No heartbeat from core client for 30 sec - exiting
12:13:00 (35168): No heartbeat from core client for 30 sec - exiting
12:13:01 (35168): No heartbeat from core client for 30 sec - exiting
12:13:02 (35168): No heartbeat from core client for 30 sec - exiting
12:13:03 (35168): No heartbeat from core client for 30 sec - exiting
12:13:04 (35168): No heartbeat from core client for 30 sec - exiting
12:13:05 (35168): No heartbeat from core client for 30 sec - exiting
12:13:06 (35168): No heartbeat from core client for 30 sec - exiting
12:13:07 (35168): No heartbeat from core client for 30 sec - exiting
12:13:08 (35168): No heartbeat from core client for 30 sec - exiting
12:13:09 (35168): No heartbeat from core client for 30 sec - exiting
12:13:10 (35168): No heartbeat from core client for 30 sec - exiting
12:13:11 (35168): No heartbeat from core client for 30 sec - exiting
12:13:12 (35168): No heartbeat from core client for 30 sec - exiting
12:13:13 (35168): No heartbeat from core client for 30 sec - exiting
12:13:14 (35168): No heartbeat from core client for 30 sec - exiting
12:13:15 (35168): No heartbeat from core client for 30 sec - exiting
12:13:16 (35168): No heartbeat from core client for 30 sec - exiting
12:13:17 (35168): No heartbeat from core client for 30 sec - exiting
12:13:18 (35168): No heartbeat from core client for 30 sec - exiting
12:13:19 (35168): No heartbeat from core client for 30 sec - exiting
12:13:20 (35168): No heartbeat from core client for 30 sec - exiting
12:13:21 (35168): No heartbeat from core client for 30 sec - exiting
12:13:22 (35168): No heartbeat from core client for 30 sec - exiting
12:13:23 (35168): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=34920, iMonCtr=1
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=37944, selfPID=37944, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=37944, selfPID=37384, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
16:13:05 (37384): called boinc_finish

</stderr_txt>
<message>
<file_xfer_error>
  <file_name>hadam3p_saf_24vi_1989_1_007038406_0_9.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_saf_24vi_1989_1_007038406_0_10.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_saf_24vi_1989_1_007038406_0_11.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_saf_24vi_1989_1_007038406_0_12.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>

</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
05 Jan 2011 14:22:33 974347 12325559 hadam3p_saf_24vi_1989_1_007038406_0 92,256 239,085 2.5915
05 Jan 2011 03:25:54 974347 12325559 hadam3p_saf_24vi_1989_1_007038406_0 80,736 209,220 2.5914
04 Jan 2011 16:36:00 974347 12325559 hadam3p_saf_24vi_1989_1_007038406_0 69,216 179,742 2.5968
04 Jan 2011 06:02:33 974347 12325559 hadam3p_saf_24vi_1989_1_007038406_0 57,696 149,616 2.5932
03 Jan 2011 18:33:23 974347 12325559 hadam3p_saf_24vi_1989_1_007038406_0 46,176 119,960 2.5979
02 Jan 2011 23:12:36 974347 12325559 hadam3p_saf_24vi_1989_1_007038406_0 34,656 89,581 2.5849
02 Jan 2011 13:25:11 974347 12325559 hadam3p_saf_24vi_1989_1_007038406_0 23,136 60,546 2.6170
26 Dec 2010 19:56:38 974347 12325559 hadam3p_saf_24vi_1989_1_007038406_0 11,616 31,033 2.6716


©2024 climateprediction.net