climateprediction.net home page
Task 15342225

Task 15342225

Name hadam3p_eu_82ex_2005_1_008210512_1
Workunit 8365636
Created 6 Oct 2012, 8:37:52 UTC
Sent 6 Oct 2012, 8:38:06 UTC
Report deadline 18 Sep 2013, 13:58:06 UTC
Received 20 Oct 2012, 17:15:20 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 1170519
Run time 2 days 1 hours 21 min 29 sec
CPU time 1 days 22 hours 0 min 58 sec
Validate state Invalid
Credit 993.71
Device peak FLOPS 2.50 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Europe v6.09
windows_intelx86
Stderr
<core_client_version>7.0.28</core_client_version>
<![CDATA[
<stderr_txt>
03:03:14 (2792): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
03:03:16 (2792): No heartbeat from core client for 30 sec - exiting
06:24:25 (1436): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
06:24:26 (1436): No heartbeat from core client for 30 sec - exiting
06:24:27 (1436): No heartbeat from core client for 30 sec - exiting
06:24:28 (1436): No heartbeat from core client for 30 sec - exiting
07:07:54 (4372): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6284, selfPID=6284, iMonCtr=2
07:07:55 (4372): No heartbeat from core client for 30 sec - exiting
07:07:56 (4372): No heartbeat from core client for 30 sec - exiting
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6468, selfPID=5352, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4648, selfPID=3224, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Suspended CPDN Monitor - Suspend request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2104, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3768, selfPID=2872, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Colobal Worerer:: CPDDN pNoceprocess is not ninnning, exiting, bRetVal1, 1, checkPID=0, selfPI3712, iM iMonCtr=
2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4756, selfPID=1692, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3596, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1108, selfPID=4556, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4200, selfPID=2648, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2540, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=128, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
GRegional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1336, selfPID=4804, iMonCtr=1
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4416, selfPID=3188, iMonCtr=1
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6008, iMonCtr=2
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3968, selfPID=5220, iMonCtr=1
Model crash detected, will try to restart...
15:39:14 (4688): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
15:40:56 (5180): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5400, iMonCtr=2
Model crash detected, will try to restart...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4904, selfPID=4904, iMonCtr=1
CPDN Monitor - Quit request from BOINC...
GCPDN Monitor - Quit request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2676, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2828, iMonCtr=2
Model crash detected, will try to restart...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4568, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3060, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
19:04:14 (2904): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4640, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1856, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4404, selfPID=4404, iMonCtr=2
10:41:06 (2020): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
10:41:08 (2020): No heartbeat from core client for 30 sec - exiting
11:42:14 (124): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
11:42:15 (124): No heartbeat from core client for 30 sec - exiting
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3460, selfPID=3460, iMonCtr=1
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1652, selfPID=1652, iMonCtr=2
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2852, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2880, iMonCtr=2
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
13:08:25 (3500): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
13:08:30 (3500): No heartbeat from core client for 30 sec - exiting
13:08:31 (3500): No heartbeat from core client for 30 sec - exiting
13:08:32 (3500): No heartbeat from core client for 30 sec - exiting
07:54:12 (4436): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2552, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
forrtl: severe (24): end-of-file during read, unit 9, file C:\ProgramData\BOINC\projects\climateprediction.net\hadam3p_eu_82ex_2005_1_008210512\tmp\xaakg.namelists

Image              PC        Routine            Line        Source             
hadrm3p_eu_um_6.0  00E0C52A  Unknown               Unknown  Unknown
hadrm3p_eu_um_6.0  00DB4460  Unknown               Unknown  Unknown
hadrm3p_eu_um_6.0  00DB362A  Unknown               Unknown  Unknown
hadrm3p_eu_um_6.0  00D92469  Unknown               Unknown  Unknown
hadrm3p_eu_um_6.0  00C966EB  Unknown               Unknown  Unknown
hadrm3p_eu_um_6.0  00D32AE2  Unknown               Unknown  Unknown
hadrm3p_eu_um_6.0  00D335AF  Unknown               Unknown  Unknown
hadrm3p_eu_um_6.0  00AD9860  Unknown               Unknown  Unknown
hadrm3p_eu_um_6.0  00DF0893  Unknown               Unknown  Unknown
kernel32.dll       75F033AA  Unknown               Unknown  Unknown
ntdll.dll          77369EF2  Unknown               Unknown  Unknown
ntdll.dll          77369EC5  Unknown               Unknown  Unknownforrtl: severe (24): end-of-file during read, unit 9, file C:\ProgramData\BOINC\projects\climateprediction.net\hadam3p_eu_82ex_2005_1_008210512\tmp\xaakm.namelists

Image              PC        Routine            Line        Source             
hadam3p_eu_um_6.0  0150A39A  Unknown               Unknown  Unknown
hadam3p_eu_um_6.0  014B2CD0  Unknown               Unknown  Unknown
hadam3p_eu_um_6.0  014B1E9A  Unknown               Unknown  Unknown
hadam3p_eu_um_6.0  01492819  Unknown               Unknown  Unknown
hadam3p_eu_um_6.0  01392287  Unknown               Unknown  Unknown
hadam3p_eu_um_6.0  0142E7B2  Unknown               Unknown  Unknown
hadam3p_eu_um_6.0  0142F2DA  Unknown               Unknown  Unknown
hadam3p_eu_um_6.0  011A9BD2  Unknown               Unknown  Unknown
hadam3p_eu_um_6.0  014EE638  Unknown               Unknown  Unknown
kernel32.dll       75F033AA  Unknown               Unknown  Unknown
ntdll.dll          77369EF2  Unknown               Unknown  Unknown
ntdll.dll          77369EC5  Unknown               Unknown  Unknown
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2388, selfPID=916, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Called boinc_finish

</stderr_txt>
<message>
upload failure: <file_xfer_error>
  <file_name>hadam3p_eu_82ex_2005_1_008210512_1_6.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_82ex_2005_1_008210512_1_7.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_82ex_2005_1_008210512_1_8.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_82ex_2005_1_008210512_1_9.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_82ex_2005_1_008210512_1_10.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_82ex_2005_1_008210512_1_11.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_82ex_2005_1_008210512_1_12.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>

</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
20 Oct 2012 10:46:49 1170519 15342225 hadam3p_eu_82ex_2005_1_008210512_1 57,604 159,854 2.7751
20 Oct 2012 09:46:37 1170519 15342225 hadam3p_eu_82ex_2005_1_008210512_1 57,600 159,386 2.7671
17 Oct 2012 16:26:53 1170519 15342225 hadam3p_eu_82ex_2005_1_008210512_1 46,080 127,428 2.7654
14 Oct 2012 10:01:31 1170519 15342225 hadam3p_eu_82ex_2005_1_008210512_1 34,560 95,897 2.7748
10 Oct 2012 19:10:59 1170519 15342225 hadam3p_eu_82ex_2005_1_008210512_1 23,040 64,606 2.8041
08 Oct 2012 02:39:34 1170519 15342225 hadam3p_eu_82ex_2005_1_008210512_1 11,616 33,337 2.8699


©2024 cpdn.org