climateprediction.net home page
Task 13790762

Task 13790762

Name hadcm3n_t4wd_1940_40_007446431_2
Workunit 7643934
Created 17 Dec 2011, 13:14:26 UTC
Sent 17 Dec 2011, 13:14:58 UTC
Report deadline 17 Mar 2012, 20:42:09 UTC
Received 17 Feb 2012, 13:03:21 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 193 (0x000000C1) EXIT_SIGNAL
Computer ID 1169284
Run time 12 days 4 hours 23 min 50 sec
CPU time 11 days 22 hours 33 min 51 sec
Validate state Invalid
Credit 6,220.80
Device peak FLOPS 2.34 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>6.12.34</core_client_version>
<![CDATA[
<message>
 - exit code 193 (0xc1)
</message>
<stderr_txt>
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4516, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4516, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5012, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4944, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4936, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4436, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4940, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4940, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4940, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4652, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4896, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4948, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4200, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4944, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4340, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4288, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
11:34:12 (3680): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
11:37:03 (4756): No heartbeat from core client for 30 sec - exiting
11:37:04 (4756): No heartbeat from core client for 30 sec - exiting
11:37:05 (4756): No heartbeat from core client for 30 sec - exiting
11:37:06 (4756): No heartbeat from core client for 30 sec - exiting
11:37:08 (4756): No heartbeat from core client for 30 sec - exiting
11:37:09 (4756): No heartbeat from core client for 30 sec - exiting
11:37:10 (4756): No heartbeat from core client for 30 sec - exiting
11:37:11 (4756): No heartbeat from core client for 30 sec - exiting
11:37:12 (4756): No heartbeat from core client for 30 sec - exiting
11:38:21 (4756): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4940, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4940, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4940, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4672, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Signal 11 received, exiting...
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
16 Feb 2012 13:03:39 1169284 13790762 hadcm3n_t4wd_1940_40_007446431_2 518,400 1,031,626 1.9900
12 Feb 2012 09:34:30 1169284 13790762 hadcm3n_t4wd_1940_40_007446431_2 492,480 968,157 1.9659
11 Feb 2012 09:44:49 1169284 13790762 hadcm3n_t4wd_1940_40_007446431_2 466,560 912,408 1.9556
07 Feb 2012 13:31:19 1169284 13790762 hadcm3n_t4wd_1940_40_007446431_2 440,640 845,439 1.9187
05 Feb 2012 05:16:11 1169284 13790762 hadcm3n_t4wd_1940_40_007446431_2 414,720 784,498 1.8916
04 Feb 2012 06:49:05 1169284 13790762 hadcm3n_t4wd_1940_40_007446431_2 388,800 742,937 1.9108
31 Jan 2012 14:48:09 1169284 13790762 hadcm3n_t4wd_1940_40_007446431_2 362,880 679,189 1.8717
29 Jan 2012 08:02:27 1169284 13790762 hadcm3n_t4wd_1940_40_007446431_2 336,960 631,839 1.8751
16 Jan 2012 13:16:13 1169284 13790762 hadcm3n_t4wd_1940_40_007446431_2 311,040 568,472 1.8276
15 Jan 2012 06:09:53 1169284 13790762 hadcm3n_t4wd_1940_40_007446431_2 285,120 532,611 1.8680
14 Jan 2012 06:01:36 1169284 13790762 hadcm3n_t4wd_1940_40_007446431_2 259,200 471,578 1.8194
08 Jan 2012 13:51:04 1169284 13790762 hadcm3n_t4wd_1940_40_007446431_2 233,280 410,728 1.7607
08 Jan 2012 02:53:43 1169284 13790762 hadcm3n_t4wd_1940_40_007446431_2 207,360 372,065 1.7943
07 Jan 2012 03:26:09 1169284 13790762 hadcm3n_t4wd_1940_40_007446431_2 181,440 327,108 1.8028
04 Jan 2012 11:36:50 1169284 13790762 hadcm3n_t4wd_1940_40_007446431_2 155,520 280,852 1.8059
01 Jan 2012 04:21:28 1169284 13790762 hadcm3n_t4wd_1940_40_007446431_2 129,600 213,796 1.6497
30 Dec 2011 14:52:51 1169284 13790762 hadcm3n_t4wd_1940_40_007446431_2 103,680 168,134 1.6217
29 Dec 2011 13:45:24 1169284 13790762 hadcm3n_t4wd_1940_40_007446431_2 77,760 127,660 1.6417
22 Dec 2011 13:09:39 1169284 13790762 hadcm3n_t4wd_1940_40_007446431_2 51,840 86,034 1.6596
18 Dec 2011 10:57:56 1169284 13790762 hadcm3n_t4wd_1940_40_007446431_2 25,920 42,922 1.6559


©2024 cpdn.org