climateprediction.net home page
Task 15208407

Task 15208407

Name hadam3p_saf_1dxc_1959_1_006945880_1
Workunit 7149196
Created 30 Aug 2012, 23:29:29 UTC
Sent 30 Aug 2012, 23:36:10 UTC
Report deadline 13 Aug 2013, 4:56:10 UTC
Received 8 Sep 2012, 0:34:49 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 1224712
Run time 2 days 8 hours 25 min 55 sec
CPU time 1 days 23 hours 32 min 38 sec
Validate state Invalid
Credit 1,122.82
Device peak FLOPS 2.75 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Southern Africa v6.09
windows_intelx86
Stderr
<core_client_version>6.8.42</core_client_version>
<![CDATA[
<stderr_txt>
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
02:20:00 (4772): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2160, selfPID=1600, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4188, selfPID=4188, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2948, selfPID=2948, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2148, selfPID=2148, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4632, selfPID=4632, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4260, selfPID=4260, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=532, selfPID=4820, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
RegionController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2296, selfPID=4676, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4212, selfPID=4212, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2688, selfPID=2688, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
11:05:29 (4104): No heartbeat from core client for 30 sec - exiting
11:05:30 (4104): No heartbeat from core client for 30 sec - exiting
11:05:31 (4104): No heartbeat from core client for 30 sec - exiting
11:05:32 (4104): No heartbeat from core client for 30 sec - exiting
11:05:33 (4104): No heartbeat from core client for 30 sec - exiting
11:05:34 (4104): No heartbeat from core client for 30 sec - exiting
11:05:35 (4104): No heartbeat from core client for 30 sec - exiting
11:05:37 (4104): No heartbeat from core client for 30 sec - exiting
11:05:38 (4104): No heartbeat from core client for 30 sec - exiting
11:05:39 (4104): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4440, selfPID=3256, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
forrtl: severe (24): end-of-file during read, unit 9, file C:\ProgramData\BOINC\projects\climateprediction.net\hadam3p_saf_1dxc_1959_1_006945880\tmp\xaakm.namelists
Image              PC        Routine            Line        Source             
hadam3p_saf_um_6.  004AA39A  Unknown               Unknown  Unknown
hadam3p_saf_um_6.  00452CD0  Unknown               Unknown  Unknown
hadam3p_saf_um_6.  00451E9A  Unknown               Unknown  Unknown
hadam3p_saf_um_6.  00432819  Unknown               Unknown  Unknown
hadam3p_saf_um_6.  00332287  Unknown               Unknown  Unknown
hadam3p_saf_um_6.  003CE7B2  Unknown               Unknown  Unknown
hadam3p_saf_um_6.  003CF2DA  Unknown               Unknown  Unknown
hadam3p_saf_um_6.  00149BD2  Unknown               Unknown  Unknown
hadam3p_saf_um_6.  0048E638  Unknown               Unknown  Unknown
kernel32.dll       74BA339A  Unknown               Unknown  Unknown
ntdll.dll          77009EF2  Unknown               Unknown  Unknown
ntdll.dll          77009EC5  Unknown               Unknown  Unknown
forrtl: severe (24): end-of-file during read, unit 9, file C:\ProgramData\BOINC\projects\climateprediction.net\hadam3p_saf_1dxc_1959_1_006945880\tmp\xaakg.namelists
Image              PC        Routine            Line        Source             
hadrm3p_saf_um_6.  005AC52A  Unknown               Unknown  Unknown
hadrm3p_saf_um_6.  00554460  Unknown               Unknown  Unknown
hadrm3p_saf_um_6.  0055362A  Unknown               Unknown  Unknown
hadrm3p_saf_um_6.  00532469  Unknown               Unknown  Unknown
hadrm3p_saf_um_6.  004366EB  Unknown               Unknown  Unknown
hadrm3p_saf_um_6.  004D2AE2  Unknown               Unknown  Unknown
hadrm3p_saf_um_6.  004D35AF  Unknown               Unknown  Unknown
hadrm3p_saf_um_6.  00279860  Unknown               Unknown  Unknown
hadrm3p_saf_um_6.  00590893  Unknown               Unknown  Unknown
kernel32.dll       74BA339A  Unknown               Unknown  Unknown
ntdll.dll          77009EF2  Unknown               Unknown  Unknown
ntdll.dll          77009EC5  Unknown               Unknown  Unknown
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2736, selfPID=3752, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Called boinc_finish

</stderr_txt>
<message>
<file_xfer_error>
  <file_name>hadam3p_saf_1dxc_1959_1_006945880_1_7.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_saf_1dxc_1959_1_006945880_1_8.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_saf_1dxc_1959_1_006945880_1_9.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_saf_1dxc_1959_1_006945880_1_10.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_saf_1dxc_1959_1_006945880_1_11.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_saf_1dxc_1959_1_006945880_1_12.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>

</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
06 Sep 2012 11:58:49 1224712 15208407 hadam3p_saf_1dxc_1959_1_006945880_1 69,216 162,647 2.3498
05 Sep 2012 15:11:56 1224712 15208407 hadam3p_saf_1dxc_1959_1_006945880_1 57,696 136,518 2.3662
04 Sep 2012 23:33:15 1224712 15208407 hadam3p_saf_1dxc_1959_1_006945880_1 46,176 110,275 2.3881
04 Sep 2012 00:18:53 1224712 15208407 hadam3p_saf_1dxc_1959_1_006945880_1 34,656 83,220 2.4013
03 Sep 2012 00:14:17 1224712 15208407 hadam3p_saf_1dxc_1959_1_006945880_1 23,136 55,876 2.4151
01 Sep 2012 16:22:25 1224712 15208407 hadam3p_saf_1dxc_1959_1_006945880_1 11,616 27,899 2.4018


©2024 cpdn.org