climateprediction.net home page
Task 15914906

Task 15914906

Name hadcm3n_3dot_2020_40_008363278_1
Workunit 8514137
Created 14 Aug 2013, 11:39:25 UTC
Sent 14 Aug 2013, 18:09:32 UTC
Report deadline 14 Nov 2013, 1:36:43 UTC
Received 30 Oct 2013, 17:42:09 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 193 (0x000000C1) EXIT_SIGNAL
Computer ID 945967
Run time 9 days 5 hours 10 min 20 sec
CPU time 9 days 5 hours 10 min 20 sec
Validate state Invalid
Credit 3,110.40
Device peak FLOPS 1.91 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>6.4.5</core_client_version>
<![CDATA[
<message>
 - exit code 193 (0xc1)
</message>
<stderr_txt>
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5040, iMonCtr=1
Model crash detected, will try to restart...
10:36:45 (6852): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3700, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7300, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3568, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2640, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6560, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7896, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7356, iMonCtr=1
Model crash detected, will try to restart...
10:29:42 (7372): No heartbeat from core client for 30 sec - exiting
10:29:43 (7372): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
09:42:47 (7932): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3580, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7896, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7596, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6880, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6880, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1340, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5732, iMonCtr=1
Model crash detected, will try to restart...
14:07:50 (7560): No heartbeat from core client for 30 sec - exiting
14:07:51 (7560): No heartbeat from core client for 30 sec - exiting
14:07:52 (7560): No heartbeat from core client for 30 sec - exiting
14:07:53 (7560): No heartbeat from core client for 30 sec - exiting
14:07:54 (7560): No heartbeat from core client for 30 sec - exiting
14:07:55 (7560): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7564, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7564, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6232, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6232, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7604, iMonCtr=1
Model crash detected, will try to restart...
09:50:06 (6252): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
09:50:07 (6252): No heartbeat from core client for 30 sec - exiting
09:50:08 (6252): No heartbeat from core client for 30 sec - exiting
09:50:09 (6252): No heartbeat from core client for 30 sec - exiting
09:50:10 (6252): No heartbeat from core client for 30 sec - exiting
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7180, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4568, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6864, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4664, iMonCtr=1
Model crash detected, will try to restart...
10:11:43 (7268): No heartbeat from core client for 30 sec - exiting
10:11:44 (7268): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7928, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7732, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7668, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7668, iMonCtr=1
Model crash detected, will try to restart...
CController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6736, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1812, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1812, iMonCtr=1
Model crash detected, will try to restart...
09:52:32 (4108): No heartbeat from core client for 30 sec - exiting
09:52:33 (4108): No heartbeat from core client for 30 sec - exiting
09:52:34 (4108): No heartbeat from core client for 30 sec - exiting
09:52:35 (4108): No heartbeat from core client for 30 sec - exiting
09:52:36 (4108): No heartbeat from core client for 30 sec - exiting
09:52:37 (4108): No heartbeat from core client for 30 sec - exiting
09:52:38 (4108): No heartbeat from core client for 30 sec - exiting
09:52:39 (4108): No heartbeat from core client for 30 sec - exiting
09:52:40 (4108): No heartbeat from core client for 30 sec - exiting
09:52:41 (4108): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
09:54:23 (7388): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
09:54:33 (7512): Can't acquire lockfile (32) - waiting 35s
09:57:51 (7512): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7052, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7916, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7916, iMonCtr=1
Model crash detected, will try to restart...
10:52:49 (7408): No heartbeat from core client for 30 sec - exiting
10:52:50 (7408): No heartbeat from core client for 30 sec - exiting
10:52:51 (7408): No heartbeat from core client for 30 sec - exiting
10:52:52 (7408): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=448, iMonCtr=1
Model crash detected, will try to restart...
10:05:00 (3184): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5732, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5576, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7776, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7424, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7120, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7120, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6872, iMonCtr=1
Model crash detected, will try to restart...
Signal 11 received, exiting...
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
30 Oct 2013 16:45:51 945967 15914906 hadcm3n_3dot_2020_40_008363278_1 259,200 796,202 3.0718
24 Oct 2013 14:29:58 945967 15914906 hadcm3n_3dot_2020_40_008363278_1 233,280 716,718 3.0724
18 Oct 2013 11:41:13 945967 15914906 hadcm3n_3dot_2020_40_008363278_1 207,360 635,659 3.0655
03 Oct 2013 11:22:27 945967 15914906 hadcm3n_3dot_2020_40_008363278_1 181,440 555,770 3.0631
26 Sep 2013 13:49:46 945967 15914906 hadcm3n_3dot_2020_40_008363278_1 155,520 476,159 3.0617
20 Sep 2013 11:30:10 945967 15914906 hadcm3n_3dot_2020_40_008363278_1 129,600 396,454 3.0591
12 Sep 2013 13:46:11 945967 15914906 hadcm3n_3dot_2020_40_008363278_1 103,680 317,486 3.0622
06 Sep 2013 10:07:32 945967 15914906 hadcm3n_3dot_2020_40_008363278_1 77,760 238,177 3.0630
30 Aug 2013 18:06:37 945967 15914906 hadcm3n_3dot_2020_40_008363278_1 51,840 158,926 3.0657
22 Aug 2013 15:16:13 945967 15914906 hadcm3n_3dot_2020_40_008363278_1 25,920 80,403 3.1020


©2024 cpdn.org