climateprediction.net home page
Task 14119482

Task 14119482

Name hadam3p_eu_9jhx_1988_1_007761237_0
Workunit 7916346
Created 20 Feb 2012, 17:11:16 UTC
Sent 20 Feb 2012, 17:14:57 UTC
Report deadline 1 Feb 2013, 22:34:57 UTC
Received 6 Mar 2012, 7:13:29 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 1149893
Run time 5 days 19 hours 35 min 5 sec
CPU time 4 days 11 hours 9 min 22 sec
Validate state Invalid
Credit 1,790.21
Device peak FLOPS 1.64 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Europe v6.09
windows_intelx86
Stderr
<core_client_version>6.12.34</core_client_version>
<![CDATA[
<stderr_txt>
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6924, iMonCtr=2
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6136, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4412, selfPID=6432, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7216, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5564, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=556, selfPID=5504, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
06:35:57 (5712): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
06:36:21 (5712): No heartbeat from core client for 30 sec - exiting
06:36:23 (5712): No heartbeat from core client for 30 sec - exiting
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7000, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3340, selfPID=4076, iMonCtr=1
Model crash detected, will try to restart...
07:05:47 (6436): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
07:05:49 (6436): No heartbeat from core client for 30 sec - exiting
07:05:50 (6436): No heartbeat from core client for 30 sec - exiting
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1940, selfPID=5248, iMonCtr=1
Model crash detected, will try to restart...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4048, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6812, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Suspended CPDN Monitor - Suspend request from BOINC...
23:08:13 (6304): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3328, selfPID=7224, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6236, selfPID=6820, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3204, selfPID=6524, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
forrtl: severe (24): end-of-file during read, unit 9, file C:\ProgramData\BOINC\projects\climateprediction.net\hadam3p_eu_9jhx_1988_1_007761237\tmp\xaakg.namelists

Image              PC        Routine            Line        Source             
hadrm3p_eu_um_6.0  00E4C52A  Unknown               Unknown  Unknown
hadrm3p_eu_um_6.0  00DF4460  Unknown               Unknown  Unknown
hadrm3p_eu_um_6.0  00DF362A  Unknown               Unknown  Unknown
hadrm3p_eu_um_6.0  00DD2469  Unknown               Unknown  Unknown
hadrm3p_eu_um_6.0  00CD66EB  Unknown               Unknown  Unknown
hadrm3p_eu_um_6.0  00D72AE2  Unknown               Unknown  Unknown
hadrm3p_eu_um_6.0  00D735AF  Unknown               Unknown  Unknown
hadrm3p_eu_um_6.0  00B19860  Unknown               Unknown  Unknown
hadrm3p_eu_um_6.0  00E30893  Unknown               Unknown  Unknown
kernel32.dll       75CC339A  Unknown               Unknown  Unknown
ntdll.dll          77439EF2  Unknown               Unknown  Unknown
ntdll.dll          77439EC5  Unknown               Unknown  Unknown
rrtl: severe (24): end-of-file during read, unit 9, file C:\ProgramData\BOINC\projects\climateprediction.net\hadam3p_eu_9jhx_1988_1_007761237\tmp\xaakm.namelists

Image              PC        Routine            Line        Source             
hadam3p_eu_um_6.0  0174A39A  Unknown               Unknown  Unknown
hadam3p_eu_um_6.0  016F2CD0  Unknown               Unknown  Unknown
hadam3p_eu_um_6.0  016F1E9A  Unknown               Unknown  Unknown
hadam3p_eu_um_6.0  016D2819  Unknown               Unknown  Unknown
hadam3p_eu_um_6.0  015D2287  Unknown               Unknown  Unknown
hadam3p_eu_um_6.0  0166E7B2  Unknown               Unknown  Unknown
hadam3p_eu_um_6.0  0166F2DA  Unknown               Unknown  Unknown
hadam3p_eu_um_6.0  013E9BD2  Unknown               Unknown  Unknown
hadam3p_eu_um_6.0  0172E638  Unknown               Unknown  Unknown
kernel32.dll       75CC339A  Unknown               Unknown  Unknown
ntdll.dll          77439EF2  Unknown               Unknown  Unknown
ntdll.dll          77439EC5  Unknown               Unknown  Unknown
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7460, selfPID=6932, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Called boinc_finish

</stderr_txt>
<message>
upload failure: <file_xfer_error>
  <file_name>hadam3p_eu_9jhx_1988_1_007761237_0_10.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_9jhx_1988_1_007761237_0_11.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_9jhx_1988_1_007761237_0_12.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>

</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
04 Mar 2012 12:52:56 1149893 14119482 hadam3p_eu_9jhx_1988_1_007761237_0 103,776 359,132 3.4606
03 Mar 2012 12:59:18 1149893 14119482 hadam3p_eu_9jhx_1988_1_007761237_0 92,256 319,942 3.4680
02 Mar 2012 12:28:18 1149893 14119482 hadam3p_eu_9jhx_1988_1_007761237_0 80,736 280,889 3.4791
01 Mar 2012 15:14:26 1149893 14119482 hadam3p_eu_9jhx_1988_1_007761237_0 69,216 242,053 3.4971
29 Feb 2012 20:17:52 1149893 14119482 hadam3p_eu_9jhx_1988_1_007761237_0 57,696 202,448 3.5089
28 Feb 2012 22:30:58 1149893 14119482 hadam3p_eu_9jhx_1988_1_007761237_0 46,176 162,450 3.5181
28 Feb 2012 11:14:24 1149893 14119482 hadam3p_eu_9jhx_1988_1_007761237_0 34,656 124,405 3.5897
22 Feb 2012 20:34:17 1149893 14119482 hadam3p_eu_9jhx_1988_1_007761237_0 23,136 84,027 3.6319
21 Feb 2012 20:57:15 1149893 14119482 hadam3p_eu_9jhx_1988_1_007761237_0 11,616 41,905 3.6075


©2024 climateprediction.net