climateprediction.net home page
Task 14810528

Task 14810528

Name hadam3p_pnw_bn48_1960_1_008009658_0
Workunit 8164772
Created 21 Jun 2012, 9:28:26 UTC
Sent 21 Jun 2012, 9:29:40 UTC
Report deadline 3 Jun 2013, 14:49:40 UTC
Received 7 Aug 2012, 10:32:03 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 1114818
Run time 3 days 4 hours 5 min 38 sec
CPU time 2 days 22 hours 23 min 9 sec
Validate state Invalid
Credit 1,503.98
Device peak FLOPS 2.88 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Pacific North West v6.09
windows_intelx86
Stderr
<core_client_version>7.0.28</core_client_version>
<![CDATA[
<stderr_txt>
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4224, iMonCtr=2
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5868, selfPID=5704, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5028, iMonCtr=2
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5720, selfPID=5096, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5248, selfPID=5536, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
GCPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4788, iMonCtr=2
Model crash detected, will try to restart...
CController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1528, selfPID=4928, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Regional yearly means requires 12 input files got 4
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3760, selfPID=5256, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Regional yearly means requires 12 input files got 4
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2220, selfPID=5044, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Regional yearly means requires 12 input files got 4
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5972, selfPID=5236, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3160, selfPID=4480, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Regional yearly means requires 12 input files got 4
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4292, selfPID=4528, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4968, selfPID=4472, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Regional yearly means requires 12 input files got 5
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6124, selfPID=4216, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Regional yearly means requires 12 input files got 5
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2296, selfPID=5944, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5244, selfPID=4192, iMonCtr=1
Model crash detected, will try to restart...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3316, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5912, selfPID=5440, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5744, selfPID=4720, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Regional yearly means requires 12 input files got 6
forrtl: severe (24): end-of-file during read, unit 9, file C:\ProgramData\BOINC\projects\climateprediction.net\hadam3p_pnw_bn48_1960_1_008009658\tmp\xaakg.namelists

Image              PC        Routine            Line        Source             
hadrm3p_pnw_um_6.  004BC52A  Unknown               Unknown  Unknown
hadrm3p_pnw_um_6.  00464460  Unknown               Unknown  Unknown
hadrm3p_pnw_um_6.  0046362A  Unknown               Unknown  Unknown
hadrm3p_pnw_um_6.  00442469  Unknown               Unknown  Unknown
hadrm3p_pnw_um_6.  003466EB  Unknown               Unknown  Unknown
hadrm3p_pnw_um_6.  003E2AE2  Unknown               Unknown  Unknown
hadrm3p_pnw_um_6.  003E35AF  Unknown               Unknown  Unknown
hadrm3p_pnw_um_6.  00189860  Unknown               Unknown  Unknown
hadrm3p_pnw_um_6.  004A0893  Unknown               Unknown  Unknown
kernel32.dll       767D3677  Unknown               Unknown  Unknown
ntdll.dll          77D69F42  Unknown               Unknown  Unknown
ntdll.dll          77D69F15  Unknown               Unknown  Unknown
forrtl: severe (24): end-of-file during read, unit 9, file C:\ProgramData\BOINC\projects\climateprediction.net\hadam3p_pnw_bn48_1960_1_008009658\tmp\xaakm.namelists

Image              PC        Routine            Line        Source             
hadam3p_pnw_um_6.  00C1A39A  Unknown               Unknown  Unknown
hadam3p_pnw_um_6.  00BC2CD0  Unknown               Unknown  Unknown
hadam3p_pnw_um_6.  00BC1E9A  Unknown               Unknown  Unknown
hadam3p_pnw_um_6.  00BA2819  Unknown               Unknown  Unknown
hadam3p_pnw_um_6.  00AA2287  Unknown               Unknown  Unknown
hadam3p_pnw_um_6.  00B3E7B2  Unknown               Unknown  Unknown
hadam3p_pnw_um_6.  00B3F2DA  Unknown               Unknown  Unknown
hadam3p_pnw_um_6.  008B9BD2  Unknown               Unknown  Unknown
hadam3p_pnw_um_6.  00BFE638  Unknown               Unknown  Unknown
kernel32.dll       767D3677  Unknown               Unknown  Unknown
ntdll.dll          77D69F42  Unknown               Unknown  Unknown
ntdll.dll          77D69F15  Unknown               Unknown  Unknown
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3228, selfPID=5148, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Regional yearly means requires 12 input files got 6
Called boinc_finish

</stderr_txt>
<message>
upload failure: <file_xfer_error>
  <file_name>hadam3p_pnw_bn48_1960_1_008009658_0_7.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_bn48_1960_1_008009658_0_8.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_bn48_1960_1_008009658_0_9.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_bn48_1960_1_008009658_0_10.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_bn48_1960_1_008009658_0_11.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_bn48_1960_1_008009658_0_12.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>

</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
31 Jul 2012 15:49:22 1114818 14810528 hadam3p_pnw_bn48_1960_1_008009658_0 69,216 222,532 3.2150
25 Jul 2012 10:12:29 1114818 14810528 hadam3p_pnw_bn48_1960_1_008009658_0 57,696 184,311 3.1945
16 Jul 2012 16:48:40 1114818 14810528 hadam3p_pnw_bn48_1960_1_008009658_0 46,176 146,149 3.1650
09 Jul 2012 13:20:42 1114818 14810528 hadam3p_pnw_bn48_1960_1_008009658_0 34,656 108,342 3.1262
03 Jul 2012 12:38:46 1114818 14810528 hadam3p_pnw_bn48_1960_1_008009658_0 23,136 70,653 3.0538
27 Jun 2012 10:29:36 1114818 14810528 hadam3p_pnw_bn48_1960_1_008009658_0 11,616 37,238 3.2058


©2024 climateprediction.net