climateprediction.net home page
Task 13412224

Task 13412224

Name hadcm3n_u1pi_1980_40_007459975_3
Workunit 7657478
Created 23 Sep 2011, 0:11:32 UTC
Sent 23 Sep 2011, 0:15:15 UTC
Report deadline 23 Dec 2011, 7:42:26 UTC
Received 4 Nov 2011, 4:35:19 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 25 (0x00000019) Unknown error code
Computer ID 1150721
Run time 25 days 14 hours 18 min 50 sec
CPU time 23 days 10 hours 5 min 8 sec
Validate state Invalid
Credit 7,776.00
Device peak FLOPS 2.64 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>6.12.34</core_client_version>
<![CDATA[
<message>
The drive cannot locate a specific area or track on the disk. (0x19) - exit code 25 (0x19)
</message>
<stderr_txt>
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3672, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4240, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4160, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4160, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
CPDN Monitor - Quit request from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4136, iMonCtr=1
Model crash detected, will try to restart...
15:42:51 (4136): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3196, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3196, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3196, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3140, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
02 Nov 2011 13:15:01 1150721 13412224 hadcm3n_u1pi_1980_40_007459975_3 648,000 1,991,885 3.0739
02 Nov 2011 01:18:56 1150721 13412224 hadcm3n_u1pi_1980_40_007459975_3 622,080 1,951,743 3.1374
09 Oct 2011 14:21:50 1150721 13412224 hadcm3n_u1pi_1980_40_007459975_3 596,160 968,954 1.6253
09 Oct 2011 01:55:25 1150721 13412224 hadcm3n_u1pi_1980_40_007459975_3 570,240 926,151 1.6241
08 Oct 2011 14:00:15 1150721 13412224 hadcm3n_u1pi_1980_40_007459975_3 544,320 883,377 1.6229
08 Oct 2011 01:40:37 1150721 13412224 hadcm3n_u1pi_1980_40_007459975_3 518,400 840,537 1.6214
07 Oct 2011 13:25:21 1150721 13412224 hadcm3n_u1pi_1980_40_007459975_3 492,480 797,718 1.6198
07 Oct 2011 00:11:02 1150721 13412224 hadcm3n_u1pi_1980_40_007459975_3 466,560 755,046 1.6183
06 Oct 2011 11:36:37 1150721 13412224 hadcm3n_u1pi_1980_40_007459975_3 440,640 712,934 1.6180
05 Oct 2011 22:53:49 1150721 13412224 hadcm3n_u1pi_1980_40_007459975_3 414,720 670,271 1.6162
04 Oct 2011 13:41:18 1150721 13412224 hadcm3n_u1pi_1980_40_007459975_3 388,800 627,632 1.6143
02 Oct 2011 21:16:55 1150721 13412224 hadcm3n_u1pi_1980_40_007459975_3 362,880 585,101 1.6124
02 Oct 2011 08:15:19 1150721 13412224 hadcm3n_u1pi_1980_40_007459975_3 336,960 542,404 1.6097
01 Oct 2011 19:29:05 1150721 13412224 hadcm3n_u1pi_1980_40_007459975_3 311,040 499,589 1.6062
01 Oct 2011 07:17:15 1150721 13412224 hadcm3n_u1pi_1980_40_007459975_3 285,120 457,144 1.6033
30 Sep 2011 17:40:50 1150721 13412224 hadcm3n_u1pi_1980_40_007459975_3 259,200 414,723 1.6000
30 Sep 2011 05:28:06 1150721 13412224 hadcm3n_u1pi_1980_40_007459975_3 233,280 372,215 1.5956
29 Sep 2011 16:49:50 1150721 13412224 hadcm3n_u1pi_1980_40_007459975_3 207,360 329,761 1.5903
29 Sep 2011 04:04:11 1150721 13412224 hadcm3n_u1pi_1980_40_007459975_3 181,440 287,499 1.5845
28 Sep 2011 16:42:16 1150721 13412224 hadcm3n_u1pi_1980_40_007459975_3 155,520 245,761 1.5803


©2024 cpdn.org