climateprediction.net home page
Task 14137846

Task 14137846

Name hadam3p_eu_9xc6_1974_1_007779174_0
Workunit 7934283
Created 20 Feb 2012, 20:27:53 UTC
Sent 6 Mar 2012, 15:30:08 UTC
Report deadline 16 Feb 2013, 20:50:08 UTC
Received 20 May 2012, 16:51:20 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 1098998
Run time 3 days 19 hours 56 min 56 sec
CPU time 3 days 18 hours 51 min 15 sec
Validate state Invalid
Credit 2,187.67
Device peak FLOPS 2.67 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Europe v6.09
windows_intelx86
Stderr
<core_client_version>7.0.25</core_client_version>
<![CDATA[
<stderr_txt>
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4812, selfPID=1732, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5796, iMonCtr=2
Colobantrorker:: Cllerp:o: CPDN process is not running, exiting, bRetVal = 1, checkPIDD=4860, iMonCtr=2
 iMonCtr=2
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3944, iMonCtr=2
Model crash detected, will try to restart...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4936, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4944, selfPID=2468, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3632, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
02:47:08 (3812): No heartbeat from core client for 30 sec - exiting
02:47:10 (3812): No heartbeat from core client for 30 sec - exiting
02:47:11 (3812): No heartbeat from core client for 30 sec - exiting
02:47:12 (3812): No heartbeat from core client for 30 sec - exiting
02:47:13 (3812): No heartbeat from core client for 30 sec - exiting
02:47:14 (3812): No heartbeat from core client for 30 sec - exiting
02:47:15 (3812): No heartbeat from core client for 30 sec - exiting
02:47:16 (3812): No heartbeat from core client for 30 sec - exiting
02:47:17 (3812): No heartbeat from core client for 30 sec - exiting
02:47:18 (3812): No heartbeat from core client for 30 sec - exiting
02:47:19 (3812): No heartbeat from core client for 30 sec - exiting
02:47:21 (3812): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4360, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4324, selfPID=4496, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4516, iMonCtr=2
Model crash detected, will try to restart...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3640, iMonCtr=2
Leaving CPDN_Main::Monitor...
15:19:13 (4804): No heartbeat from core client for 30 sec - exiting
15:19:15 (4804): No heartbeat from core client for 30 sec - exiting
15:19:16 (4804): No heartbeat from core client for 30 sec - exiting
15:19:17 (4804): No heartbeat from core client for 30 sec - exiting
15:19:18 (4804): No heartbeat from core client for 30 sec - exiting
15:19:19 (4804): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4036, iMonCtr=2
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1180, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4064, selfPID=4936, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
23:25:17 (3176): No heartbeat from core client for 30 sec - exiting
23:25:19 (3176): No heartbeat from core client for 30 sec - exiting
23:25:20 (3176): No heartbeat from core client for 30 sec - exiting
23:25:21 (3176): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4896, selfPID=4196, iMonCtr=1
Model crash detected, will try to restart...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5056, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5064, selfPID=2180, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4756, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Glontrbalololerr:: CPDN process is not running, exiting, bRetVal = 1, checkPIID=0, sefPID=3944, iMonCtr=2
2
odel crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2660, selfPID=4332, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4420, selfPID=4288, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4320, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4460, selfPID=2388, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2728, selfPID=3748, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Suspended CPDN Monitor - Suspend request from BOINC...
Glontroobal W:rkePD:: CPDN priocess is not running, , bRetgVal = 1, checkPID=0, selfPIsD=3872,4 iMonCtr=2
Mode
l crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
01:21:13 (3872): No heartbeat from core client for 30 sec - exiting
01:21:14 (3872): No heartbeat from core client for 30 sec - exiting
01:21:15 (3872): No heartbeat from core client for 30 sec - exiting
01:21:16 (3872): No heartbeat from core client for 30 sec - exiting
01:21:18 (3872): No heartbeat from core client for 30 sec - exiting
cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadam3p_eu_9xc6_1974_1_007779174/dataout/atmos_restart.day after 11 attempts
cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadam3p_eu_9xc6_1974_1_007779174/dataout/region_restart.day after 11 attempts

Mode
Morashed: READd: ST: End of file in READ from history file for namelr namelist NLIHISTO                                                                                                                                                                                     tmp/xaakm.pipe_dummy                                                            2048     
2048    
Leaving CPDN_Main::Monitor...
Called boinc_finish

</stderr_txt>
<message>
upload failure: <file_xfer_error>
  <file_name>hadam3p_eu_9xc6_1974_1_007779174_0_12.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>

</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
20 May 2012 14:49:40 1098998 14137846 hadam3p_eu_9xc6_1974_1_007779174_0 126,816 320,893 2.5304
16 May 2012 16:47:37 1098998 14137846 hadam3p_eu_9xc6_1974_1_007779174_0 115,296 292,728 2.5389
14 May 2012 15:26:48 1098998 14137846 hadam3p_eu_9xc6_1974_1_007779174_0 103,776 264,210 2.5460
13 May 2012 04:58:02 1098998 14137846 hadam3p_eu_9xc6_1974_1_007779174_0 92,256 236,106 2.5592
11 May 2012 16:16:08 1098998 14137846 hadam3p_eu_9xc6_1974_1_007779174_0 80,736 207,673 2.5722
08 May 2012 17:15:31 1098998 14137846 hadam3p_eu_9xc6_1974_1_007779174_0 69,216 178,227 2.5749
06 May 2012 14:45:52 1098998 14137846 hadam3p_eu_9xc6_1974_1_007779174_0 57,696 148,803 2.5791
04 May 2012 17:57:46 1098998 14137846 hadam3p_eu_9xc6_1974_1_007779174_0 46,176 117,721 2.5494
03 May 2012 08:19:44 1098998 14137846 hadam3p_eu_9xc6_1974_1_007779174_0 34,656 88,500 2.5537
29 Apr 2012 16:32:18 1098998 14137846 hadam3p_eu_9xc6_1974_1_007779174_0 23,136 59,517 2.5725
16 Apr 2012 14:51:08 1098998 14137846 hadam3p_eu_9xc6_1974_1_007779174_0 11,616 30,016 2.5840


©2024 climateprediction.net