climateprediction.net home page
Task 16440330

Task 16440330

Name hadam3p_eu_n17l_2013_1_008541220_2
Workunit 8688732
Created 1 Apr 2014, 19:18:50 UTC
Sent 1 Apr 2014, 19:28:47 UTC
Report deadline 15 Mar 2015, 0:48:47 UTC
Received 17 Jun 2014, 14:33:34 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 1294522
Run time 2 days 5 hours 37 min 25 sec
CPU time 2 days 1 hours 55 min 4 sec
Validate state Invalid
Credit 1,194.02
Device peak FLOPS 2.83 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Europe v6.09
windows_intelx86
Stderr
<core_client_version>7.2.42</core_client_version>
<![CDATA[
<stderr_txt>
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1892, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=188, iMonCtr=2
Model crash detected, will try to restart...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3220, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6108, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6088, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4204, selfPID=5092, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=248, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5352, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3784, selfPID=4420, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
GController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5864, selfPID=4468, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4500, selfPID=4604, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7452, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4120, iMonCtr=2
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
07:59:31 (5944): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5616, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4440, selfPID=4448, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadam3p_eu_n17l_2013_1_008541220/dataout/atmos_restart.day after 11 attempts
forrtl: severe (24): end-of-file during read, unit 9, file C:\ProgramData\BOINC\projects\climateprediction.net\hadam3p_eu_n17l_2013_1_008541220\tmp\xaakg.namelists

Image              PC        Routine            Line        Source             
hadrm3p_eu_um_6.0  0057C52A  Unknown               Unknown  Unknown
hadrm3p_eu_um_6.0  00524460  Unknown               Unknown  Unknown
hadrm3p_eu_um_6.0  0052362A  Unknown               Unknown  Unknown
hadrm3p_eu_um_6.0  00502469  Unknown               Unknown  Unknown
hadrm3p_eu_um_6.0  004066EB  Unknown               Unknown  Unknown
hadrm3p_eu_um_6.0  004A2AE2  Unknown               Unknown  Unknown
hadrm3p_eu_um_6.0  004A35AF  Unknown               Unknown  Unknown
hadrm3p_eu_um_6.0  00249860  Unknown               Unknown  Unknown
hadrm3p_eu_um_6.0  00560893  Unknown               Unknown  Unknown
kernel32.dll       75CCEE1C  Unknown               Unknown  Unknown
ntdll.dll          777037EB  Unknown               Unknown  Unknown
ntdll.dll          777037BE  Unknown               Unknown  Unknown
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5536, iMonCtr=2
Model crash detected, will try to restart...
forrtl: severe (24): end-of-file during read, unit 9, file C:\ProgramData\BOINC\projects\climateprediction.net\hadam3p_eu_n17l_2013_1_008541220\tmp\xaakm.namelists

Image              PC        Routine            Line        Source             
hadam3p_eu_um_6.0  0069A39A  Unknown               Unknown  Unknown
hadam3p_eu_um_6.0  00642CD0  Unknown               Unknown  Unknown
hadam3p_eu_um_6.0  00641E9A  Unknown               Unknown  Unknown
hadam3p_eu_um_6.0  00622819  Unknown               Unknown  Unknown
hadam3p_eu_um_6.0  00522287  Unknown               Unknown  Unknown
hadam3p_eu_um_6.0  005BE7B2  Unknown               Unknown  Unknown
hadam3p_eu_um_6.0  005BF2DA  Unknown               Unknown  Unknown
hadam3p_eu_um_6.0  00339BD2  Unknown               Unknown  Unknown
hadam3p_eu_um_6.0  0067E638  Unknown               Unknown  Unknown
kernel32.dll       75CCEE1C  Unknown               Unknown  Unknown
ntdll.dll          777037EB  Unknown               Unknown  Unknown
ntdll.dll          777037BE  Unknown               Unknown  Unknown
Leaving CPDN_Main::Monitor...
Called boinc_finish

</stderr_txt>
<message>
upload failure: <file_xfer_error>
  <file_name>hadam3p_eu_n17l_2013_1_008541220_2_7.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_n17l_2013_1_008541220_2_8.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_n17l_2013_1_008541220_2_9.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_n17l_2013_1_008541220_2_10.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_n17l_2013_1_008541220_2_11.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_n17l_2013_1_008541220_2_12.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>

</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
16 Jun 2014 15:03:56 1294522 16440330 hadam3p_eu_n17l_2013_1_008541220_2 69,216 155,677 2.2491
10 Jun 2014 21:20:52 1294522 16440330 hadam3p_eu_n17l_2013_1_008541220_2 57,696 130,309 2.2585
07 Apr 2014 15:21:43 1294522 16440330 hadam3p_eu_n17l_2013_1_008541220_2 46,176 104,619 2.2657
04 Apr 2014 16:40:17 1294522 16440330 hadam3p_eu_n17l_2013_1_008541220_2 34,656 78,282 2.2588
03 Apr 2014 17:36:34 1294522 16440330 hadam3p_eu_n17l_2013_1_008541220_2 23,136 52,225 2.2573
02 Apr 2014 18:35:26 1294522 16440330 hadam3p_eu_n17l_2013_1_008541220_2 11,616 26,082 2.2454


©2024 climateprediction.net