climateprediction.net home page
Task 16000815

Task 16000815

Name hadcm3n_o0ft_1980_40_008388500_1
Workunit 8539359
Created 2 Sep 2013, 21:20:55 UTC
Sent 2 Sep 2013, 21:26:03 UTC
Report deadline 3 Dec 2013, 4:53:14 UTC
Received 17 Oct 2013, 22:57:20 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status -226 (0xFFFFFF1E) ERR_TOO_MANY_EXITS
Computer ID 1266258
Run time 7 days 1 hours 25 min 10 sec
CPU time 5 days 9 hours 41 min 39 sec
Validate state Invalid
Credit 2,177.28
Device peak FLOPS 1.98 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>7.0.64</core_client_version>
<![CDATA[
<message>
too many exit(0)s
</message>
<stderr_txt>
10:15:24 (1276): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4788, iMonCtr=1
Model crash detected, will try to restart...
08:13:34 (3056): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
08:13:35 (3056): No heartbeat from core client for 30 sec - exiting
08:13:37 (3056): No heartbeat from core client for 30 sec - exiting
08:13:38 (3056): No heartbeat from core client for 30 sec - exiting
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4312, iMonCtr=1
Model crash detected, will try to restart...
CConController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6008, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3652, iMonCtr=1
Model crash detected, will try to restart...
22:04:07 (5752): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5012, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3944, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3180, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4176, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2268, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4044, iMonCtr=1
Model crash detected, will try to restart...
14:35:46 (4272): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
16:30:39 (5976): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
16:46:34 (5528): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CCPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2240, iMonCtr=1
Model crash detected, will try to restart...
20:45:02 (4248): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4448, iMonCtr=1
Model crash detected, will try to restart...
19:42:53 (2484): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
09:48:34 (2860): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
19:08:22 (2252): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
19:24:24 (5908): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
19:40:20 (6452): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2324, iMonCtr=1
Model crash detected, will try to restart...
17:38:53 (4596): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4052, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
22:38:55 (4332): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
06:11:57 (4424): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
16:56:14 (3484): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3292, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2836, iMonCtr=1
Model crash detected, will try to restart...
14:23:50 (3872): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2080, iMonCtr=1
Model crash detected, will try to restart...
12:21:35 (3296): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not ruController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2416, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2576, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
11 Oct 2013 07:13:42 1266258 16000815 hadcm3n_o0ft_1980_40_008388500_1 181,440 413,103 2.2768
08 Oct 2013 00:08:41 1266258 16000815 hadcm3n_o0ft_1980_40_008388500_1 155,520 352,360 2.2657
03 Oct 2013 04:19:07 1266258 16000815 hadcm3n_o0ft_1980_40_008388500_1 129,600 292,734 2.2588
01 Oct 2013 05:34:15 1266258 16000815 hadcm3n_o0ft_1980_40_008388500_1 103,680 233,549 2.2526
26 Sep 2013 23:59:32 1266258 16000815 hadcm3n_o0ft_1980_40_008388500_1 77,760 175,162 2.2526
19 Sep 2013 08:22:15 1266258 16000815 hadcm3n_o0ft_1980_40_008388500_1 51,840 116,520 2.2477
15 Sep 2013 00:21:46 1266258 16000815 hadcm3n_o0ft_1980_40_008388500_1 25,920 58,995 2.2760


©2024 cpdn.org