climateprediction.net home page
Task 14377578

Task 14377578

Name hadam3p_saf_0obd_1984_1_006845489_1
Workunit 7048805
Created 7 Apr 2012, 16:33:00 UTC
Sent 7 Apr 2012, 16:33:45 UTC
Report deadline 20 Mar 2013, 21:53:45 UTC
Received 16 Apr 2012, 12:18:02 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 1194961
Run time 3 days 23 hours 15 min 5 sec
CPU time 3 days 19 hours 2 min 19 sec
Validate state Invalid
Credit 1,870.33
Device peak FLOPS 2.08 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Southern Africa v6.09
windows_intelx86
Stderr
<core_client_version>6.12.34</core_client_version>
<![CDATA[
<stderr_txt>
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2916, selfPID=2916, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
20:44:58 (3724): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2108, selfPID=2852, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5024, selfPID=4592, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4848, selfPID=4848, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4444, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3304, selfPID=4172, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
08:11:46 (2904): No heartbeat from core client for 30 sec - exiting
08:11:47 (2904): No heartbeat from core client for 30 sec - exiting
08:11:48 (2904): No heartbeat from core client for 30 sec - exiting
08:11:49 (2904): No heartbeat from core client for 30 sec - exiting
08:11:50 (2904): No heartbeat from core client for 30 sec - exiting
08:11:51 (2904): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
12:53:17 (3448): No heartbeat from core client for 30 sec - exiting
12:53:18 (3448): No heartbeat from core client for 30 sec - exiting
12:53:19 (3448): No heartbeat from core client for 30 sec - exiting
12:53:20 (3448): No heartbeat from core client for 30 sec - exiting
12:53:21 (3448): No heartbeat from core client for 30 sec - exiting
12:53:22 (3448): No heartbeat from core client for 30 sec - exiting
12:53:23 (3448): No heartbeat from core client for 30 sec - exiting
12:53:24 (3448): No heartbeat from core client for 30 sec - exiting
12:53:25 (3448): No heartbeat from core client for 30 sec - exiting
12:53:26 (3448): No heartbeat from core client for 30 sec - exiting
12:53:27 (3448): No heartbeat from core client for 30 sec - exiting
12:53:28 (3448): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
04:49:58 (2860): No heartbeat from core client for 30 sec - exiting
04:49:59 (2860): No heartbeat from core client for 30 sec - exiting
04:50:00 (2860): No heartbeat from core client for 30 sec - exiting
04:50:01 (2860): No heartbeat from core client for 30 sec - exiting
04:50:02 (2860): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Signal 11 received, exiting...
Called boinc_finish
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3836, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3840, selfPID=4008, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Called boinc_finish

</stderr_txt>
<message>
upload failure: <file_xfer_error>
  <file_name>hadam3p_saf_0obd_1984_1_006845489_1_11.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_saf_0obd_1984_1_006845489_1_12.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>

</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
15 Apr 2012 06:31:48 1194961 14377578 hadam3p_saf_0obd_1984_1_006845489_1 115,296 311,264 2.6997
14 Apr 2012 18:34:40 1194961 14377578 hadam3p_saf_0obd_1984_1_006845489_1 103,776 280,932 2.7071
14 Apr 2012 06:34:01 1194961 14377578 hadam3p_saf_0obd_1984_1_006845489_1 92,256 250,623 2.7166
13 Apr 2012 15:34:11 1194961 14377578 hadam3p_saf_0obd_1984_1_006845489_1 80,736 220,816 2.7350
12 Apr 2012 12:18:16 1194961 14377578 hadam3p_saf_0obd_1984_1_006845489_1 69,216 189,885 2.7434
11 Apr 2012 14:30:21 1194961 14377578 hadam3p_saf_0obd_1984_1_006845489_1 57,696 160,143 2.7756
10 Apr 2012 16:45:43 1194961 14377578 hadam3p_saf_0obd_1984_1_006845489_1 46,176 129,434 2.8031
10 Apr 2012 12:47:58 1194961 14377578 hadam3p_saf_0obd_1984_1_006845489_1 34,656 98,480 2.8416
09 Apr 2012 12:08:10 1194961 14377578 hadam3p_saf_0obd_1984_1_006845489_1 23,136 67,282 2.9081
08 Apr 2012 17:50:25 1194961 14377578 hadam3p_saf_0obd_1984_1_006845489_1 11,616 34,453 2.9660


©2024 cpdn.org