climateprediction.net home page
Task 13022234

Task 13022234

Name hadam3p_eu_2kcv_1980_1_007303116_1
Workunit 7500540
Created 28 Jun 2011, 17:47:55 UTC
Sent 28 Jun 2011, 17:48:48 UTC
Report deadline 9 Jun 2012, 23:08:48 UTC
Received 24 Jul 2011, 16:59:27 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 1004102
Run time 3 days 22 hours 44 min 39 sec
CPU time 3 days 13 hours 36 min 11 sec
Validate state Invalid
Credit 1,790.21
Device peak FLOPS 2.34 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Europe v6.09
windows_intelx86
Stderr
<core_client_version>6.6.36</core_client_version>
<![CDATA[
<stderr_txt>
15:27:16 (5512): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4280, selfPID=5032, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5836, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2128, selfPID=664, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5140, iMonCtr=2
Model crash detected, will try to restart...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1644, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6028, selfPID=5228, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2016, iMonCtr=2
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4544, selfPID=4948, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4976, selfPID=4976, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2864, selfPID=2864, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=444, selfPID=5240, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4316, selfPID=4960, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3160, selfPID=5328, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3412, selfPID=4108, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3224, selfPID=5288, iMonCtr=1
Model crash detected, will try to restart...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6120, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3864, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4420, iMonCtr=2
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3360, selfPID=2896, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3496, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3224, selfPID=4888, iMonCtr=1
Model crash detected, will try to restart...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5680, iMonCtr=
2
del crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
forrtl: severe (24): end-of-file during read, unit 9, file C:\ProgramData\BOINC\projects\climateprediction.net\hadam3p_eu_2kcv_1980_1_007303116\tmp\xaakm.namelists
Image              PC        Routine            Line        Source             
hadam3p_eu_um_6.0  013AA39A  Unknown               Unknown  Unknown
hadam3p_eu_um_6.0  01352CD0  Unknown               Unknown  Unknown
hadam3p_eu_um_6.0  01351E9A  Unknown               Unknown  Unknown
hadam3p_eu_um_6.0  01332819  Unknown               Unknown  Unknown
hadam3p_eu_um_6.0  01232287  Unknown               Unknown  Unknown
hadam3p_eu_um_6.0  012CE7B2  Unknown               Unknown  Unknown
hadam3p_eu_um_6.0  012CF2DA  Unknown               Unknown  Unknown
hadam3p_eu_um_6.0  01049BD2  Unknown               Unknown  Unknown
hadam3p_eu_um_6.0  0138E638  Unknown               Unknown  Unknown
kernel32.dll       76E4ED6C  Unknown               Unknown  Unknown
ntdll.dll          774337F5  Unknown               Unknown  Unknown
ntdll.dll          774337C8  Unknown               Unknown  Unknown
forrtl: severe (24): end-of-file during read, unit 9, file C:\ProgramData\BOINC\projects\climateprediction.net\hadam3p_eu_2kcv_1980_1_007303116\tmp\xaakg.namelists
Image              PC        Routine            Line        Source             
hadrm3p_eu_um_6.0  00C6C52A  Unknown               Unknown  Unknown
hadrm3p_eu_um_6.0  00C14460  Unknown               Unknown  Unknown
hadrm3p_eu_um_6.0  00C1362A  Unknown               Unknown  Unknown
hadrm3p_eu_um_6.0  00BF2469  Unknown               Unknown  Unknown
hadrm3p_eu_um_6.0  00AF66EB  Unknown               Unknown  Unknown
hadrm3p_eu_um_6.0  00B92AE2  Unknown               Unknown  Unknown
hadrm3p_eu_um_6.0  00B935AF  Unknown               Unknown  Unknown
hadrm3p_eu_um_6.0  00939860  Unknown               Unknown  Unknown
hadrm3p_eu_um_6.0  00C50893  Unknown               Unknown  Unknown
kernel32.dll       76E4ED6C  Unknown               Unknown  Unknown
ntdll.dll          774337F5  Unknown               Unknown  Unknown
ntdll.dll          774337C8  Unknown               Unknown  Unknown
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5012, selfPID=4648, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Called boinc_finish

</stderr_txt>
<message>
<file_xfer_error>
  <file_name>hadam3p_eu_2kcv_1980_1_007303116_1_10.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_2kcv_1980_1_007303116_1_11.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_2kcv_1980_1_007303116_1_12.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>

</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
25 Jul 2011 22:04:49 1004102 13022234 hadam3p_eu_2kcv_1980_1_007303116_1 103,776 308,590 2.9736
25 Jul 2011 20:55:41 1004102 13022234 hadam3p_eu_2kcv_1980_1_007303116_1 92,256 275,870 2.9903
25 Jul 2011 19:40:35 1004102 13022234 hadam3p_eu_2kcv_1980_1_007303116_1 80,736 243,783 3.0195
25 Jul 2011 19:22:44 1004102 13022234 hadam3p_eu_2kcv_1980_1_007303116_1 69,216 212,861 3.0753
25 Jul 2011 19:22:44 1004102 13022234 hadam3p_eu_2kcv_1980_1_007303116_1 57,696 181,423 3.1445
25 Jul 2011 18:11:38 1004102 13022234 hadam3p_eu_2kcv_1980_1_007303116_1 46,176 149,811 3.2443
25 Jul 2011 17:20:18 1004102 13022234 hadam3p_eu_2kcv_1980_1_007303116_1 34,656 114,750 3.3111
25 Jul 2011 14:41:27 1004102 13022234 hadam3p_eu_2kcv_1980_1_007303116_1 23,136 79,437 3.4335
25 Jul 2011 14:06:15 1004102 13022234 hadam3p_eu_2kcv_1980_1_007303116_1 11,616 40,438 3.4812


©2024 cpdn.org