climateprediction.net home page
Task 14467066

Task 14467066

Name hadam3p_pnw_bu4e_1997_1_007925341_0
Workunit 8080453
Created 18 Apr 2012, 13:22:52 UTC
Sent 3 May 2012, 20:03:58 UTC
Report deadline 16 Apr 2013, 1:23:58 UTC
Received 1 Jun 2012, 8:37:17 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 1215667
Run time 1 days 18 hours 33 min 41 sec
CPU time 10 hours 17 min 48 sec
Validate state Invalid
Credit 502.72
Device peak FLOPS 2.93 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Pacific North West v6.09
windows_intelx86
Stderr
<core_client_version>7.0.25</core_client_version>
<![CDATA[
<stderr_txt>
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
11:38:58 (3576): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4024, selfPID=4024, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3516, selfPID=4208, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Regional yearly means requires 12 input files got 2
11:03:54 (3196): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4488, selfPID=3928, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Regional yearly means requires 12 input files got 2
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1344, selfPID=4788, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Regional yearly means requires 12 input files got 2
Called boinc_finish
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4928, selfPID=4928, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=636, selfPID=636, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4912, selfPID=4104, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
22:29:44 (3936): No heartbeat from core client for 30 sec - exiting
22:29:45 (3936): No heartbeat from core client for 30 sec - exiting
22:29:46 (3936): No heartbeat from core client for 30 sec - exiting
22:29:47 (3936): No heartbeat from core client for 30 sec - exiting
22:29:48 (3936): No heartbeat from core client for 30 sec - exiting
22:29:49 (3936): No heartbeat from core client for 30 sec - exiting
22:29:50 (3936): No heartbeat from core client for 30 sec - exiting
22:29:52 (3936): No heartbeat from core client for 30 sec - exiting
22:29:53 (3936): No heartbeat from core client for 30 sec - exiting
22:29:54 (3936): No heartbeat from core client for 30 sec - exiting
22:29:55 (3936): No heartbeat from core client for 30 sec - exiting
22:29:56 (3936): No heartbeat from core client for 30 sec - exiting
22:29:57 (3936): No heartbeat from core client for 30 sec - exiting
22:29:58 (3936): No heartbeat from core client for 30 sec - exiting
22:29:59 (3936): No heartbeat from core client for 30 sec - exiting
22:30:00 (3936): No heartbeat from core client for 30 sec - exiting
22:30:01 (3936): No heartbeat from core client for 30 sec - exiting
22:30:02 (3936): No heartbeat from core client for 30 sec - exiting
22:30:04 (3936): No heartbeat from core client for 30 sec - exiting
22:30:05 (3936): No heartbeat from core client for 30 sec - exiting
22:30:06 (3936): No heartbeat from core client for 30 sec - exiting
22:30:07 (3936): No heartbeat from core client for 30 sec - exiting
22:30:08 (3936): No heartbeat from core client for 30 sec - exiting
22:30:09 (3936): No heartbeat from core client for 30 sec - exiting
22:30:10 (3936): No heartbeat from core client for 30 sec - exiting
22:30:11 (3936): No heartbeat from core client for 30 sec - exiting
22:30:12 (3936): No heartbeat from core client for 30 sec - exiting
22:30:13 (3936): No heartbeat from core client for 30 sec - exiting
22:30:14 (3936): No heartbeat from core client for 30 sec - exiting
22:30:16 (3936): No heartbeat from core client for 30 sec - exiting
22:30:17 (3936): No heartbeat from core client for 30 sec - exiting
22:30:18 (3936): No heartbeat from core client for 30 sec - exiting
22:30:19 (3936): No heartbeat from core client for 30 sec - exiting
22:30:20 (3936): No heartbeat from core client for 30 sec - exiting
22:30:21 (3936): No heartbeat from core client for 30 sec - exiting
22:30:22 (3936): No heartbeat from core client for 30 sec - exiting
22:30:23 (3936): No heartbeat from core client for 30 sec - exiting
22:30:24 (3936): No heartbeat from core client for 30 sec - exiting
22:30:25 (3936): No heartbeat from core client for 30 sec - exiting
22:30:26 (3936): No heartbeat from core client for 30 sec - exiting
22:30:28 (3936): No heartbeat from core client for 30 sec - exiting
22:30:29 (3936): No heartbeat from core client for 30 sec - exiting
22:30:30 (3936): No heartbeat from core client for 30 sec - exiting
22:30:31 (3936): No heartbeat from core client for 30 sec - exiting
22:30:32 (3936): No heartbeat from core client for 30 sec - exiting
22:30:33 (3936): No heartbeat from core client for 30 sec - exiting
22:30:34 (3936): No heartbeat from core client for 30 sec - exiting
22:30:35 (3936): No heartbeat from core client for 30 sec - exiting
22:30:36 (3936): No heartbeat from core client for 30 sec - exiting
22:30:38 (3936): No heartbeat from core client for 30 sec - exiting
22:30:39 (3936): No heartbeat from core client for 30 sec - exiting
22:30:40 (3936): No heartbeat from core client for 30 sec - exiting
22:30:41 (3936): No heartbeat from core client for 30 sec - exiting
22:30:42 (3936): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4212, selfPID=4212, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3156, selfPID=3156, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4828, selfPID=3256, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Regional yearly means requires 12 input files got 1
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
11:20:30 (3972): No heartbeat from core client for 30 sec - exiting
11:20:31 (3972): No heartbeat from core client for 30 sec - exiting
11:20:32 (3972): No heartbeat from core client for 30 sec - exiting
11:20:33 (3972): No heartbeat from core client for 30 sec - exiting
11:20:34 (3972): No heartbeat from core client for 30 sec - exiting
11:20:35 (3972): No heartbeat from core client for 30 sec - exiting
11:20:36 (3972): No heartbeat from core client for 30 sec - exiting
11:20:37 (3972): No heartbeat from core client for 30 sec - exiting
11:20:38 (3972): No heartbeat from core client for 30 sec - exiting
11:20:40 (3972): No heartbeat from core client for 30 sec - exiting
11:20:41 (3972): No heartbeat from core client for 30 sec - exiting
11:20:42 (3972): No heartbeat from core client for 30 sec - exiting
11:20:43 (3972): No heartbeat from core client for 30 sec - exiting
11:20:44 (3972): No heartbeat from core client for 30 sec - exiting
11:20:45 (3972): No heartbeat from core client for 30 sec - exiting
11:20:46 (3972): No heartbeat from core client for 30 sec - exiting
11:20:47 (3972): No heartbeat from core client for 30 sec - exiting
11:20:48 (3972): No heartbeat from core client for 30 sec - exiting
11:20:49 (3972): No heartbeat from core client for 30 sec - exiting
11:20:50 (3972): No heartbeat from core client for 30 sec - exiting
11:20:52 (3972): No heartbeat from core client for 30 sec - exiting
11:20:53 (3972): No heartbeat from core client for 30 sec - exiting
11:20:54 (3972): No heartbeat from core client for 30 sec - exiting
11:20:55 (3972): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
16:34:51 (3324): No heartbeat from core client for 30 sec - exiting
16:34:52 (3324): No heartbeat from core client for 30 sec - exiting
16:34:53 (3324): No heartbeat from core client for 30 sec - exiting
16:34:54 (3324): No heartbeat from core client for 30 sec - exiting
16:34:55 (3324): No heartbeat from core client for 30 sec - exiting
16:34:56 (3324): No heartbeat from core client for 30 sec - exiting
16:34:57 (3324): No heartbeat from core client for 30 sec - exiting
16:34:58 (3324): No heartbeat from core client for 30 sec - exiting
16:35:00 (3324): No heartbeat from core client for 30 sec - exiting
16:35:01 (3324): No heartbeat from core client for 30 sec - exiting
16:35:02 (3324): No heartbeat from core client for 30 sec - exiting
16:35:03 (3324): No heartbeat from core client for 30 sec - exiting
16:35:04 (3324): No heartbeat from core client for 30 sec - exiting
16:35:05 (3324): No heartbeat from core client for 30 sec - exiting
16:35:06 (3324): No heartbeat from core client for 30 sec - exiting
16:35:07 (3324): No heartbeat from core client for 30 sec - exiting
16:35:08 (3324): No heartbeat from core client for 30 sec - exiting
16:35:09 (3324): No heartbeat from core client for 30 sec - exiting
16:35:11 (3324): No heartbeat from core client for 30 sec - exiting
16:35:12 (3324): No heartbeat from core client for 30 sec - exiting
16:35:13 (3324): No heartbeat from core client for 30 sec - exiting
16:35:14 (3324): No heartbeat from core client for 30 sec - exiting
16:35:15 (3324): No heartbeat from core client for 30 sec - exiting
16:35:16 (3324): No heartbeat from core client for 30 sec - exiting
16:35:17 (3324): No heartbeat from core client for 30 sec - exiting
16:35:18 (3324): No heartbeat from core client for 30 sec - exiting
16:35:19 (3324): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3012, selfPID=4628, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Regional yearly means requires 12 input files got 1
cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadam3p_pnw_bu4e_1997_1_007925341/dataout/atmos_restart.day after 11 attempts
forrtl: severe (24): end-of-file during read, unit 9, file C:\ProgramData\BOINC\projects\climateprediction.net\hadam3p_pnw_bu4e_1997_1_007925341\tmp\xaakg.namelists

Image              PC        Routine            Line        Source             
hadrm3p_pnw_um_6.  016FC52A  Unknown               Unknown  Unknown
hadrm3p_pnw_um_6.  016A4460  Unknown               Unknown  Unknown
hadrm3p_pnw_um_6.  016A362A  Unknown               Unknown  Unknown
hadrm3p_pnw_um_6.  01682469  Unknown               Unknown  Unknown
hadrm3p_pnw_um_6.  015866EB  Unknown               Unknown  Unknown
hadrm3p_pnw_um_6.  01622AE2  Unknown               Unknown  Unknown
hadrm3p_pnw_um_6.  016235AF  Unknown               Unknown  Unknown
hadrm3p_pnw_um_6.  013C9860  Unknown               Unknown  Unknown
hadrm3p_pnw_um_6.  016E0893  Unknown               Unknown  Unknown
kernel32.dll       7657339A  Unknown               Unknown  Unknown
ntdll.dll          76F69EF2  Unknown               Unknown  Unknown
ntdll.dll          76F69EC5  Unknown               Unknown  Unknown
forrtl: severe (24): end-of-file during read, unit 9, file C:\ProgramData\BOINC\projects\climateprediction.net\hadam3p_pnw_bu4e_1997_1_007925341\tmp\xaakm.namelists

Image              PC        Routine            Line        Source             
hadam3p_pnw_um_6.  00D6A39A  Unknown               Unknown  Unknown
hadam3p_pnw_um_6.  00D12CD0  Unknown               Unknown  Unknown
hadam3p_pnw_um_6.  00D11E9A  Unknown               Unknown  Unknown
hadam3p_pnw_um_6.  00CF2819  Unknown               Unknown  Unknown
hadam3p_pnw_um_6.  00BF2287  Unknown               Unknown  Unknown
hadam3p_pnw_um_6.  00C8E7B2  Unknown               Unknown  Unknown
hadam3p_pnw_um_6.  00C8F2DA  Unknown               Unknown  Unknown
hadam3p_pnw_um_6.  00A09BD2  Unknown               Unknown  Unknown
hadam3p_pnw_um_6.  00D4E638  Unknown               Unknown  Unknown
kernel32.dll       7657339A  Unknown               Unknown  Unknown
ntdll.dll          76F69EF2  Unknown               Unknown  Unknown
ntdll.dll          76F69EC5  Unknown               Unknown  Unknown
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2472, selfPID=2376, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Regional yearly means requires 12 input files got 0
Called boinc_finish

</stderr_txt>
<message>
upload failure: <file_xfer_error>
  <file_name>hadam3p_pnw_bu4e_1997_1_007925341_0_3.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_bu4e_1997_1_007925341_0_4.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_bu4e_1997_1_007925341_0_5.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_bu4e_1997_1_007925341_0_6.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_bu4e_1997_1_007925341_0_7.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_bu4e_1997_1_007925341_0_8.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_bu4e_1997_1_007925341_0_9.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_bu4e_1997_1_007925341_0_10.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_bu4e_1997_1_007925341_0_11.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_bu4e_1997_1_007925341_0_12.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>

</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
07 May 2012 22:13:44 1215667 14467066 hadam3p_pnw_bu4e_1997_1_007925341_0 23,136 59,027 2.5513
20 May 2012 09:17:15 1215667 14467066 hadam3p_pnw_bu4e_1997_1_007925341_0 11,632 29,596 2.5444
19 May 2012 14:51:57 1215667 14467066 hadam3p_pnw_bu4e_1997_1_007925341_0 11,625 29,205 2.5123
18 May 2012 15:19:32 1215667 14467066 hadam3p_pnw_bu4e_1997_1_007925341_0 11,619 28,827 2.4810
04 May 2012 23:21:33 1215667 14467066 hadam3p_pnw_bu4e_1997_1_007925341_0 11,616 28,591 2.4613


©2024 cpdn.org