climateprediction.net home page
Task 15200879

Task 15200879

Name hadcm3n_o0tc_2100_40_008166583_4
Workunit 8321707
Created 29 Aug 2012, 17:42:25 UTC
Sent 29 Aug 2012, 17:42:30 UTC
Report deadline 29 Nov 2012, 1:09:41 UTC
Received 16 Oct 2012, 13:48:35 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 25 (0x00000019) Unknown error code
Computer ID 1174717
Run time 28 days 21 hours 43 min 16 sec
CPU time 26 days 18 hours 36 min 59 sec
Validate state Invalid
Credit 12,130.56
Device peak FLOPS 3.33 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>7.0.28</core_client_version>
<![CDATA[
<message>
The drive cannot locate a specific area or track on the disk. (0x19) - exit code 25 (0x19)
</message>
<stderr_txt>
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2452, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2452, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2452, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2452, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2452, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2452, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
07:05:28 (4696): No heartbeat from core client for 30 sec - exiting
07:05:29 (4696): No heartbeat from core client for 30 sec - exiting
07:05:30 (4696): No heartbeat from core client for 30 sec - exiting
07:05:31 (4696): No heartbeat from core client for 30 sec - exiting
07:05:32 (4696): No heartbeat from core client for 30 sec - exiting
07:05:33 (4696): No heartbeat from core client for 30 sec - exiting
07:05:34 (4696): No heartbeat from core client for 30 sec - exiting
07:05:35 (4696): No heartbeat from core client for 30 sec - exiting
07:05:37 (4696): No heartbeat from core client for 30 sec - exiting
07:05:38 (4696): No heartbeat from core client for 30 sec - exiting
07:05:39 (4696): No heartbeat from core client for 30 sec - exiting
07:05:40 (4696): No heartbeat from core client for 30 sec - exiting
07:05:41 (4696): No heartbeat from core client for 30 sec - exiting
07:05:42 (4696): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
07:05:43 (4696): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - Suspend request from BOINC...
19:13:56 (4352): No heartbeat from core client for 30 sec - exiting
19:13:57 (4352): No heartbeat from core client for 30 sec - exiting
19:13:58 (4352): No heartbeat from core client for 30 sec - exiting
19:13:59 (4352): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
19:15:16 (3220): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
18:03:43 (3228): No heartbeat from core client for 30 sec - exiting
18:03:44 (3228): No heartbeat from core client for 30 sec - exiting
18:03:45 (3228): No heartbeat from core client for 30 sec - exiting
18:03:46 (3228): No heartbeat from core client for 30 sec - exiting
18:03:47 (3228): No heartbeat from core client for 30 sec - exiting
18:03:48 (3228): No heartbeat from core client for 30 sec - exiting
18:03:49 (3228): No heartbeat from core client for 30 sec - exiting
18:03:50 (3228): No heartbeat from core client for 30 sec - exiting
18:03:51 (3228): No heartbeat from core client for 30 sec - exiting
18:03:52 (3228): No heartbeat from core client for 30 sec - exiting
18:03:53 (3228): No heartbeat from core client for 30 sec - exiting
18:03:54 (3228): No heartbeat from core client for 30 sec - exiting
18:03:55 (3228): No heartbeat from core client for 30 sec - exiting
18:03:56 (3228): No heartbeat from core client for 30 sec - exiting
18:03:57 (3228): No heartbeat from core client for 30 sec - exiting
18:03:58 (3228): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
16 Oct 2012 00:53:12 1174717 15200879 hadcm3n_o0tc_2100_40_008166583_4 1,010,880 2,312,639 2.2877
11 Oct 2012 16:27:15 1174717 15200879 hadcm3n_o0tc_2100_40_008166583_4 984,960 2,275,686 2.3104
11 Oct 2012 04:49:55 1174717 15200879 hadcm3n_o0tc_2100_40_008166583_4 959,040 2,238,022 2.3336
10 Oct 2012 16:38:06 1174717 15200879 hadcm3n_o0tc_2100_40_008166583_4 933,120 2,200,803 2.3585
10 Oct 2012 05:10:33 1174717 15200879 hadcm3n_o0tc_2100_40_008166583_4 907,200 2,163,541 2.3849
09 Oct 2012 17:59:27 1174717 15200879 hadcm3n_o0tc_2100_40_008166583_4 881,280 2,126,289 2.4127
09 Oct 2012 06:30:01 1174717 15200879 hadcm3n_o0tc_2100_40_008166583_4 855,360 2,089,258 2.4425
08 Oct 2012 19:39:06 1174717 15200879 hadcm3n_o0tc_2100_40_008166583_4 829,440 2,052,097 2.4741
08 Oct 2012 08:10:56 1174717 15200879 hadcm3n_o0tc_2100_40_008166583_4 803,520 2,014,832 2.5075
07 Oct 2012 20:42:39 1174717 15200879 hadcm3n_o0tc_2100_40_008166583_4 777,600 1,977,470 2.5430
07 Oct 2012 09:35:05 1174717 15200879 hadcm3n_o0tc_2100_40_008166583_4 751,680 1,940,072 2.5810
06 Oct 2012 22:52:07 1174717 15200879 hadcm3n_o0tc_2100_40_008166583_4 725,760 1,902,713 2.6217
06 Oct 2012 11:18:11 1174717 15200879 hadcm3n_o0tc_2100_40_008166583_4 699,840 1,865,668 2.6658
05 Oct 2012 23:40:11 1174717 15200879 hadcm3n_o0tc_2100_40_008166583_4 673,920 1,828,490 2.7132
05 Oct 2012 13:33:28 1174717 15200879 hadcm3n_o0tc_2100_40_008166583_4 648,000 1,791,192 2.7642
16 Sep 2012 11:35:12 1174717 15200879 hadcm3n_o0tc_2100_40_008166583_4 622,080 872,073 1.4019
16 Sep 2012 01:37:48 1174717 15200879 hadcm3n_o0tc_2100_40_008166583_4 596,160 835,409 1.4013
15 Sep 2012 13:58:45 1174717 15200879 hadcm3n_o0tc_2100_40_008166583_4 570,240 798,585 1.4004
15 Sep 2012 02:46:12 1174717 15200879 hadcm3n_o0tc_2100_40_008166583_4 544,320 761,543 1.3991
14 Sep 2012 16:35:52 1174717 15200879 hadcm3n_o0tc_2100_40_008166583_4 518,400 725,038 1.3986


©2024 cpdn.org