climateprediction.net home page
Task 16334805

Task 16334805

Name hadam3p_eu_p1w7_2013_1_008545192_0
Workunit 8692704
Created 5 Mar 2014, 15:50:00 UTC
Sent 9 Mar 2014, 11:01:51 UTC
Report deadline 19 Feb 2015, 16:21:51 UTC
Received 10 Apr 2014, 14:08:04 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 1285351
Run time 5 days 2 hours 50 min 44 sec
CPU time 4 days 13 hours 57 min 29 sec
Validate state Invalid
Credit 2,187.67
Device peak FLOPS 2.78 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Europe v6.09
windows_intelx86
Stderr
<core_client_version>7.2.42</core_client_version>
<![CDATA[
<stderr_txt>
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5788, selfPID=5576, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
10:59:37 (4216): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7588, selfPID=6932, iMonCtr=1
Model crash detected, will try to restart...
11:36:29 (4580): No heartbeat from core client for 30 sec - exiting
11:36:30 (4580): No heartbeat from core client for 30 sec - exiting
11:36:31 (4580): No heartbeat from core client for 30 sec - exiting
11:36:32 (4580): No heartbeat from core client for 30 sec - exiting
11:36:33 (4580): No heartbeat from core client for 30 sec - exiting
11:36:34 (4580): No heartbeat from core client for 30 sec - exiting
11:36:35 (4580): No heartbeat from core client for 30 sec - exiting
11:36:36 (4580): No heartbeat from core client for 30 sec - exiting
11:36:37 (4580): No heartbeat from core client for 30 sec - exiting
11:36:38 (4580): No heartbeat from core client for 30 sec - exiting
11:36:39 (4580): No heartbeat from core client for 30 sec - exiting
11:36:40 (4580): No heartbeat from core client for 30 sec - exiting
11:36:41 (4580): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5432, selfPID=4508, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2552, iMonCtr=2
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=428, selfPID=256, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4808, selfPID=5040, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
22:17:13 (5468): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
G11:29:37 (3572): No heartbeat from core client for 30 sec - exiting
11:29:38 (3572): No heartbeat from core client for 30 sec - exiting
11:29:39 (3572): No heartbeat from core client for 30 sec - exiting
11:29:40 (3572): No heartbeat from core client for 30 sec - exiting
11:29:41 (3572): No heartbeat from core client for 30 sec - exiting
11:29:42 (3572): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2360, selfPID=2360, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6140, selfPID=4996, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5536, selfPID=4700, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=192, selfPID=5484, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1856, selfPID=5524, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=248, selfPID=4388, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4800, iMonCtr=2
Model crash detected, will try to restart...
GController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6212, selfPID=5916, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
forrtl: severe (24): end-of-file during read, unit 9, file D:\BOINC\projects\climateprediction.net\hadam3p_eu_p1w7_2013_1_008545192\tmp\xaakg.namelists

Image              PC        Routine            Line        Source             
hadrm3p_eu_um_6.0  0160C52A  Unknown               Unknown  Unknown
hadrm3p_eu_um_6.0  015B4460  Unknown               Unknown  Unknown
hadrm3p_eu_um_6.0  015B362A  Unknown               Unknown  Unknown
hadrm3p_eu_um_6.0  01592469  Unknown               Unknown  Unknown
hadrm3p_eu_um_6.0  014966EB  Unknown               Unknown  Unknown
hadrm3p_eu_um_6.0  01532AE2  Unknown               Unknown  Unknown
hadrm3p_eu_um_6.0  015335AF  Unknown               Unknown  Unknown
hadrm3p_eu_um_6.0  012D9860  Unknown               Unknown  Unknown
hadrm3p_eu_um_6.0  015F0893  Unknown               Unknown  Unknown
kernel32.dll       7556ED5C  Unknown               Unknown  Unknown
ntdll.dll          76FC37EB  Unknown               Unknown  Unknown
ntdll.dll          76FC37BE  Unknown               Unknown  Unknownforrtl: severe (24): end-of-file during read, unit 9, file D:\BOINC\projects\climateprediction.net\hadam3p_eu_p1w7_2013_1_008545192\tmp\xaakm.namelists

Image              PC        Routine            Line        Source             
hadam3p_eu_um_6.0  014DA39A  Unknown               Unknown  Unknown
hadam3p_eu_um_6.0  01482CD0  Unknown               Unknown  Unknown
hadam3p_eu_um_6.0  01481E9A  Unknown               Unknown  Unknown
hadam3p_eu_um_6.0  01462819  Unknown               Unknown  Unknown
hadam3p_eu_um_6.0  01362287  Unknown               Unknown  Unknown
hadam3p_eu_um_6.0  013FE7B2  Unknown               Unknown  Unknown
hadam3p_eu_um_6.0  013FF2DA  Unknown               Unknown  Unknown
hadam3p_eu_um_6.0  01179BD2  Unknown               Unknown  Unknown
hadam3p_eu_um_6.0  014BE638  Unknown               Unknown  Unknown
kernel32.dll       7556ED5C  Unknown               Unknown  Unknown
ntdll.dll          76FC37EB  Unknown               Unknown  Unknown
ntdll.dll          76FC37BE  Unknown               Unknown  Unknown
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4776, selfPID=3316, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Called boinc_finish

</stderr_txt>
<message>
upload failure: <file_xfer_error>
  <file_name>hadam3p_eu_p1w7_2013_1_008545192_0_12.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>

</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
06 Apr 2014 06:13:46 1285351 16334805 hadam3p_eu_p1w7_2013_1_008545192_0 126,816 379,219 2.9903
05 Apr 2014 11:55:50 1285351 16334805 hadam3p_eu_p1w7_2013_1_008545192_0 115,296 342,359 2.9694
03 Apr 2014 14:18:32 1285351 16334805 hadam3p_eu_p1w7_2013_1_008545192_0 103,800 305,727 2.9453
03 Apr 2014 03:54:50 1285351 16334805 hadam3p_eu_p1w7_2013_1_008545192_0 103,776 305,153 2.9405
02 Apr 2014 16:14:48 1285351 16334805 hadam3p_eu_p1w7_2013_1_008545192_0 92,256 268,834 2.9140
27 Mar 2014 20:26:05 1285351 16334805 hadam3p_eu_p1w7_2013_1_008545192_0 80,736 231,594 2.8685
24 Mar 2014 14:53:29 1285351 16334805 hadam3p_eu_p1w7_2013_1_008545192_0 69,228 195,414 2.8228
24 Mar 2014 13:53:46 1285351 16334805 hadam3p_eu_p1w7_2013_1_008545192_0 69,216 194,961 2.8167
23 Mar 2014 02:42:59 1285351 16334805 hadam3p_eu_p1w7_2013_1_008545192_0 57,696 162,640 2.8189
21 Mar 2014 15:11:42 1285351 16334805 hadam3p_eu_p1w7_2013_1_008545192_0 46,176 130,460 2.8253
17 Mar 2014 11:55:54 1285351 16334805 hadam3p_eu_p1w7_2013_1_008545192_0 34,656 97,187 2.8043
16 Mar 2014 09:12:06 1285351 16334805 hadam3p_eu_p1w7_2013_1_008545192_0 23,136 64,305 2.7794
12 Mar 2014 16:11:05 1285351 16334805 hadam3p_eu_p1w7_2013_1_008545192_0 11,616 31,740 2.7324


©2024 climateprediction.net