climateprediction.net home page
Task 15087889

Task 15087889

Name hadam3p_saf_1rm4_1987_1_007002020_1
Workunit 7205336
Created 10 Aug 2012, 7:50:19 UTC
Sent 10 Aug 2012, 7:59:27 UTC
Report deadline 23 Jul 2013, 13:19:27 UTC
Received 19 Aug 2012, 8:52:56 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 1205594
Run time 2 days 19 hours 2 min 4 sec
CPU time 2 days 4 hours 38 min 27 sec
Validate state Invalid
Credit 1,496.58
Device peak FLOPS 3.13 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Southern Africa v6.09
windows_intelx86
Stderr
<core_client_version>7.0.28</core_client_version>
<![CDATA[
<stderr_txt>
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1672, selfPID=1104, iMonCtr=1
Model crash detected, will try to restart...
23:14:42 (2944): No heartbeat from core client for 30 sec - exiting
23:14:43 (2944): No heartbeat from core client for 30 sec - exiting
23:14:44 (2944): No heartbeat from core client for 30 sec - exiting
23:14:45 (2944): No heartbeat from core client for 30 sec - exiting
23:14:46 (2944): No heartbeat from core client for 30 sec - exiting
23:14:47 (2944): No heartbeat from core client for 30 sec - exiting
23:14:48 (2944): No heartbeat from core client for 30 sec - exiting
23:14:49 (2944): No heartbeat from core client for 30 sec - exiting
23:14:50 (2944): No heartbeat from core client for 30 sec - exiting
23:14:51 (2944): No heartbeat from core client for 30 sec - exiting
23:14:52 (2944): No heartbeat from core client for 30 sec - exiting
23:14:53 (2944): No heartbeat from core client for 30 sec - exiting
07:58:23 (2944): No heartbeat from core client for 30 sec - exiting
07:58:24 (2944): No heartbeat from core client for 30 sec - exiting
07:58:25 (2944): No heartbeat from core client for 30 sec - exiting
07:58:26 (2944): No heartbeat from core client for 30 sec - exiting
07:58:27 (2944): No heartbeat from core client for 30 sec - exiting
07:58:28 (2944): No heartbeat from core client for 30 sec - exiting
07:58:29 (2944): No heartbeat from core client for 30 sec - exiting
07:58:30 (2944): No heartbeat from core client for 30 sec - exiting
07:58:31 (2944): No heartbeat from core client for 30 sec - exiting
07:58:32 (2944): No heartbeat from core client for 30 sec - exiting
07:58:33 (2944): No heartbeat from core client for 30 sec - exiting
07:58:34 (2944): No heartbeat from core client for 30 sec - exiting
07:58:36 (2944): No heartbeat from core client for 30 sec - exiting
07:58:37 (2944): No heartbeat from core client for 30 sec - exiting
07:58:38 (2944): No heartbeat from core client for 30 sec - exiting
07:58:39 (2944): No heartbeat from core client for 30 sec - exiting
07:58:40 (2944): No heartbeat from core client for 30 sec - exiting
07:58:41 (2944): No heartbeat from core client for 30 sec - exiting
07:58:42 (2944): No heartbeat from core client for 30 sec - exiting
07:58:43 (2944): No heartbeat from core client for 30 sec - exiting
07:58:44 (2944): No heartbeat from core client for 30 sec - exiting
07:58:45 (2944): No heartbeat from core client for 30 sec - exiting
07:58:46 (2944): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=732, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2644, iMonCtr=2
Model crash detected, will try to restart...
07:29:29 (3060): No heartbeat from core client for 30 sec - exiting
07:29:30 (3060): No heartbeat from core client for 30 sec - exiting
07:29:32 (3060): No heartbeat from core client for 30 sec - exiting
07:29:33 (3060): No heartbeat from core client for 30 sec - exiting
07:29:34 (3060): No heartbeat from core client for 30 sec - exiting
07:29:35 (3060): No heartbeat from core client for 30 sec - exiting
07:29:36 (3060): No heartbeat from core client for 30 sec - exiting
07:29:37 (3060): No heartbeat from core client for 30 sec - exiting
07:29:38 (3060): No heartbeat from core client for 30 sec - exiting
07:29:39 (3060): No heartbeat from core client for 30 sec - exiting
07:29:40 (3060): No heartbeat from core client for 30 sec - exiting
07:29:41 (3060): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3784, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3860, selfPID=3048, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3872, selfPID=2672, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
forrtl: severe (24): end-of-file during read, unit 9, file C:\ProgramData\BOINC\projects\climateprediction.net\hadam3p_saf_1rm4_1987_1_007002020\tmp\xaakm.namelists

Image              PC        Routine            Line        Source             
hadam3p_saf_um_6.  00C5A39A  Unknown               Unknown  Unknown
hadam3p_saf_um_6.  00C02CD0  Unknown               Unknown  Unknown
hadam3p_saf_um_6.  00C01E9A  Unknown               Unknown  Unknown
hadam3p_saf_um_6.  00BE2819  Unknown               Unknown  Unknown
hadam3p_saf_um_6.  00AE2287  Unknown               Unknown  Unknown
hadam3p_saf_um_6.  00B7E7B2  Unknown               Unknown  Unknown
hadam3p_saf_um_6.  00B7F2DA  Unknown               Unknown  Unknown
hadam3p_saf_um_6.  008F9BD2  Unknown               Unknown  Unknown
hadam3p_saf_um_6.  00C3E638  Unknown               Unknown  Unknown
kernel32.dll       761A339A  Unknown               Unknown  Unknown
ntdll.dll          77569EF2  Unknown               Unknown  Unknown
ntdll.dll          77569EC5  Unknown               Unknown  Unknown
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3760, selfPID=3388, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Called boinc_finish

</stderr_txt>
<message>
upload failure: <file_xfer_error>
  <file_name>hadam3p_saf_1rm4_1987_1_007002020_1_9.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_saf_1rm4_1987_1_007002020_1_10.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_saf_1rm4_1987_1_007002020_1_11.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_saf_1rm4_1987_1_007002020_1_12.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>

</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
18 Aug 2012 12:14:43 1205594 15087889 hadam3p_saf_1rm4_1987_1_007002020_1 92,256 188,503 2.0433
18 Aug 2012 04:38:23 1205594 15087889 hadam3p_saf_1rm4_1987_1_007002020_1 80,736 166,319 2.0600
17 Aug 2012 20:51:21 1205594 15087889 hadam3p_saf_1rm4_1987_1_007002020_1 69,216 144,323 2.0851
17 Aug 2012 06:09:22 1205594 15087889 hadam3p_saf_1rm4_1987_1_007002020_1 57,696 122,354 2.1207
15 Aug 2012 07:59:50 1205594 15087889 hadam3p_saf_1rm4_1987_1_007002020_1 46,176 97,952 2.1213
12 Aug 2012 06:24:57 1205594 15087889 hadam3p_saf_1rm4_1987_1_007002020_1 34,656 72,411 2.0894
11 Aug 2012 11:17:19 1205594 15087889 hadam3p_saf_1rm4_1987_1_007002020_1 23,136 47,461 2.0514
10 Aug 2012 18:30:27 1205594 15087889 hadam3p_saf_1rm4_1987_1_007002020_1 11,616 24,014 2.0673


©2024 cpdn.org