climateprediction.net home page
Task 16292815

Task 16292815

Name hadam3p_eu_q47i_2009_1_008328589_1
Workunit 8479450
Created 23 Feb 2014, 2:19:31 UTC
Sent 23 Feb 2014, 2:19:38 UTC
Report deadline 5 Feb 2015, 7:39:38 UTC
Received 28 Feb 2014, 0:15:01 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 957844
Run time 2 days 10 hours 45 min 32 sec
CPU time 2 days 10 hours 45 min 32 sec
Validate state Invalid
Credit 1,591.48
Device peak FLOPS 2.96 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Europe v6.09
windows_intelx86
Stderr
<core_client_version>6.4.7</core_client_version>
<![CDATA[
<stderr_txt>
01:08:35 (5364): No heartbeat from core client for 30 sec - exiting
01:08:36 (5364): No heartbeat from core client for 30 sec - exiting
01:08:37 (5364): No heartbeat from core client for 30 sec - exiting
01:08:38 (5364): No heartbeat from core client for 30 sec - exiting
01:08:39 (5364): No heartbeat from core client for 30 sec - exiting
01:08:40 (5364): No heartbeat from core client for 30 sec - exiting
01:08:41 (5364): No heartbeat from core client for 30 sec - exiting
01:08:43 (5364): No heartbeat from core client for 30 sec - exiting
01:08:44 (5364): No heartbeat from core client for 30 sec - exiting
01:08:45 (5364): No heartbeat from core client for 30 sec - exiting
01:08:46 (5364): No heartbeat from core client for 30 sec - exiting
01:08:47 (5364): No heartbeat from core client for 30 sec - exiting
01:08:48 (5364): No heartbeat from core client for 30 sec - exiting
01:08:49 (5364): No heartbeat from core client for 30 sec - exiting
01:08:50 (5364): No heartbeat from core client for 30 sec - exiting
01:08:51 (5364): No heartbeat from core client for 30 sec - exiting
01:08:52 (5364): No heartbeat from core client for 30 sec - exiting
01:08:53 (5364): No heartbeat from core client for 30 sec - exiting
01:08:55 (5364): No heartbeat from core client for 30 sec - exiting
01:08:56 (5364): No heartbeat from core client for 30 sec - exiting
01:08:57 (5364): No heartbeat from core client for 30 sec - exiting
01:08:58 (5364): No heartbeat from core client for 30 sec - exiting
01:08:59 (5364): No heartbeat from core client for 30 sec - exiting
01:09:00 (5364): No heartbeat from core client for 30 sec - exiting
01:09:01 (5364): No heartbeat from core client for 30 sec - exiting
01:09:02 (5364): No heartbeat from core client for 30 sec - exiting
01:09:03 (5364): No heartbeat from core client for 30 sec - exiting
01:09:05 (5364): No heartbeat from core client for 30 sec - exiting
01:09:06 (5364): No heartbeat from core client for 30 sec - exiting
01:09:07 (5364): No heartbeat from core client for 30 sec - exiting
01:09:08 (5364): No heartbeat from core client for 30 sec - exiting
01:09:09 (5364): No heartbeat from core client for 30 sec - exiting
01:09:10 (5364): No heartbeat from core client for 30 sec - exiting
01:09:11 (5364): No heartbeat from core client for 30 sec - exiting
01:09:12 (5364): No heartbeat from core client for 30 sec - exiting
01:09:13 (5364): No heartbeat from core client for 30 sec - exiting
01:09:14 (5364): No heartbeat from core client for 30 sec - exiting
01:09:15 (5364): No heartbeat from core client for 30 sec - exiting
01:09:17 (5364): No heartbeat from core client for 30 sec - exiting
01:09:18 (5364): No heartbeat from core client for 30 sec - exiting
01:09:19 (5364): No heartbeat from core client for 30 sec - exiting
01:09:20 (5364): No heartbeat from core client for 30 sec - exiting
01:09:21 (5364): No heartbeat from core client for 30 sec - exiting
01:09:22 (5364): No heartbeat from core client for 30 sec - exiting
01:09:23 (5364): No heartbeat from core client for 30 sec - exiting
01:09:24 (5364): No heartbeat from core client for 30 sec - exiting
01:09:25 (5364): No heartbeat from core client for 30 sec - exiting
01:09:26 (5364): No heartbeat from core client for 30 sec - exiting
01:09:27 (5364): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4992, selfPID=4968, iMonCtr=1
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4992, selfPID=4992, iMonCtr=1
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4476, selfPID=4416, iMonCtr=1
forrtl: Toegang geweigerd.
forrtl: severe (38): error during write, unit 8, file C:\ProgramData\BOINC\projects\climateprediction.net\hadam3p_eu_q47i_2009_1_008328589\tmp\xaakm.pipe_dummy
Image              PC        Routine            Line        Source             
hadam3p_eu_um_6.0  0043A39A  Unknown               Unknown  Unknown
hadam3p_eu_um_6.0  003E2CD0  Unknown               Unknown  Unknown
hadam3p_eu_um_6.0  003E1E9A  Unknown               Unknown  Unknown
hadam3p_eu_um_6.0  003BAA9D  Unknown               Unknown  Unknown
hadam3p_eu_um_6.0  0035F27C  Unknown               Unknown  Unknown
hadam3p_eu_um_6.0  000D9BD2  Unknown               Unknown  Unknown
hadam3p_eu_um_6.0  0041E638  Unknown               Unknown  Unknown
kernel32.dll       76AAD2E9  Unknown               Unknown  Unknown
ntdll.dll          77B41603  Unknown               Unknown  Unknown
ntdll.dll          77B415D6  Unknown               Unknown  Unknown
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6820, selfPID=6820, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6820, selfPID=6796, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
No Process Handle
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6812, iMonCtr=2
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6948, selfPID=6956, iMonCtr=1
Global Worker:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=0
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7272, selfPID=7288, iMonCtr=1
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7272, selfPID=7272, iMonCtr=1
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7716, selfPID=7724, iMonCtr=1
forrtl: Toegang geweigerd.
forrtl: severe (38): error during write, unit 8, file C:\ProgramData\BOINC\projects\climateprediction.net\hadam3p_eu_q47i_2009_1_008328589\tmp\xaakm.pipe_dummy
Image              PC        Routine            Line        Source             
hadam3p_eu_um_6.0  0043A39A  Unknown               Unknown  Unknown
hadam3p_eu_um_6.0  003E2CD0  Unknown               Unknown  Unknown
hadam3p_eu_um_6.0  003E1E9A  Unknown               Unknown  Unknown
hadam3p_eu_um_6.0  003BAA9D  Unknown               Unknown  Unknown
hadam3p_eu_um_6.0  0035F27C  Unknown               Unknown  Unknown
hadam3p_eu_um_6.0  000D9BD2  Unknown               Unknown  Unknown
hadam3p_eu_um_6.0  0041E638  Unknown               Unknown  Unknown
kernel32.dll       76AAD2E9  Unknown               Unknown  Unknown
ntdll.dll          77B41603  Unknown               Unknown  Unknown
ntdll.dll          77B415D6  Unknown               Unknown  Unknown
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4748, selfPID=4748, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4748, selfPID=3824, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
No Process Handle
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5548, iMonCtr=2
cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadam3p_eu_q47i_2009_1_008328589/dataout/atmos_restart.day after 11 attempts
forrtl: severe (24): end-of-file during read, unit 9, file C:\ProgramData\BOINC\projects\climateprediction.net\hadam3p_eu_q47i_2009_1_008328589\tmp\xaakm.namelists
Image              PC        Routine            Line        Source             
hadam3p_eu_um_6.0  0043A39A  Unknown               Unknown  Unknown
hadam3p_eu_um_6.0  003E2CD0  Unknown               Unknown  Unknown
hadam3p_eu_um_6.0  003E1E9A  Unknown               Unknown  Unknown
hadam3p_eu_um_6.0  003C2819  Unknown               Unknown  Unknown
hadam3p_eu_um_6.0  002C2287  Unknown               Unknown  Unknown
hadam3p_eu_um_6.0  0035E7B2  Unknown               Unknown  Unknown
hadam3p_eu_um_6.0  0035F2DA  Unknown               Unknown  Unknown
hadam3p_eu_um_6.0  000D9BD2  Unknown               Unknown  Unknown
hadam3p_eu_um_6.0  0041E638  Unknown               Unknown  Unknown
kernel32.dll       76AAD2E9  Unknown               Unknown  Unknown
ntdll.dll          77B41603  Unknown               Unknown  Unknown
ntdll.dll          77B415D6  Unknown               Unknown  Unknown
forrtl: severe (24): end-of-file during read, unit 9, file C:\ProgramData\BOINC\projects\climateprediction.net\hadam3p_eu_q47i_2009_1_008328589\tmp\xaakg.namelists
Image              PC        Routine            Line        Source             
hadrm3p_eu_um_6.0  0134C52A  Unknown               Unknown  Unknown
hadrm3p_eu_um_6.0  012F4460  Unknown               Unknown  Unknown
hadrm3p_eu_um_6.0  012F362A  Unknown               Unknown  Unknown
hadrm3p_eu_um_6.0  012D2469  Unknown               Unknown  Unknown
hadrm3p_eu_um_6.0  011D66EB  Unknown               Unknown  Unknown
hadrm3p_eu_um_6.0  01272AE2  Unknown               Unknown  Unknown
hadrm3p_eu_um_6.0  012735AF  Unknown               Unknown  Unknown
hadrm3p_eu_um_6.0  01019860  Unknown               Unknown  Unknown
hadrm3p_eu_um_6.0  01330893  Unknown               Unknown  Unknown
kernel32.dll       76AAD2E9  Unknown               Unknown  Unknown
ntdll.dll          77B41603  Unknown               Unknown  Unknown
ntdll.dll          77B415D6  Unknown               Unknown  Unknown
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=812, selfPID=1068, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Called boinc_finish

</stderr_txt>
<message>
<file_xfer_error>
  <file_name>hadam3p_eu_q47i_2009_1_008328589_1_9.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_q47i_2009_1_008328589_1_10.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_q47i_2009_1_008328589_1_11.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_q47i_2009_1_008328589_1_12.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>

</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
27 Feb 2014 18:12:08 957844 16292815 hadam3p_eu_q47i_2009_1_008328589_1 92,256 202,634 2.1964
27 Feb 2014 00:18:47 957844 16292815 hadam3p_eu_q47i_2009_1_008328589_1 80,736 176,706 2.1887
26 Feb 2014 11:00:12 957844 16292815 hadam3p_eu_q47i_2009_1_008328589_1 69,216 152,306 2.2004
26 Feb 2014 03:18:29 957844 16292815 hadam3p_eu_q47i_2009_1_008328589_1 57,696 125,809 2.1805
25 Feb 2014 13:55:02 957844 16292815 hadam3p_eu_q47i_2009_1_008328589_1 46,176 98,842 2.1405
25 Feb 2014 04:08:45 957844 16292815 hadam3p_eu_q47i_2009_1_008328589_1 34,656 75,232 2.1708
24 Feb 2014 21:02:48 957844 16292815 hadam3p_eu_q47i_2009_1_008328589_1 23,136 50,881 2.1992
23 Feb 2014 23:42:50 957844 16292815 hadam3p_eu_q47i_2009_1_008328589_1 11,616 28,033 2.4133


©2024 cpdn.org