climateprediction.net home page
Task 15401579

Task 15401579

Name hadcm3n_o5ts_2100_40_008239987_0
Workunit 8395111
Created 26 Oct 2012, 22:50:48 UTC
Sent 26 Oct 2012, 22:50:57 UTC
Report deadline 26 Jan 2013, 6:18:08 UTC
Received 25 Dec 2012, 15:15:03 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1215462
Run time 20 days 7 hours 0 min 3 sec
CPU time 20 days 6 hours 7 min 2 sec
Validate state Invalid
Credit 10,264.32
Device peak FLOPS 1.69 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>7.0.28</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1248, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1248, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2548, iMonCtr=1
Model crash detected, will try to restart...
01:00:14 (2548): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3108, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3108, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2660, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2660, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
20 Nov 2012 08:51:16 1215462 15401579 hadcm3n_o5ts_2100_40_008239987_0 855,360 1,744,630 2.0396
19 Nov 2012 18:22:26 1215462 15401579 hadcm3n_o5ts_2100_40_008239987_0 829,440 1,692,571 2.0406
19 Nov 2012 03:53:00 1215462 15401579 hadcm3n_o5ts_2100_40_008239987_0 803,520 1,640,506 2.0416
18 Nov 2012 13:15:23 1215462 15401579 hadcm3n_o5ts_2100_40_008239987_0 777,600 1,587,627 2.0417
17 Nov 2012 22:13:15 1215462 15401579 hadcm3n_o5ts_2100_40_008239987_0 751,680 1,533,607 2.0402
14 Nov 2012 10:59:15 1215462 15401579 hadcm3n_o5ts_2100_40_008239987_0 725,760 1,481,045 2.0407
13 Nov 2012 20:21:37 1215462 15401579 hadcm3n_o5ts_2100_40_008239987_0 699,840 1,428,611 2.0413
13 Nov 2012 05:33:02 1215462 15401579 hadcm3n_o5ts_2100_40_008239987_0 673,920 1,375,469 2.0410
12 Nov 2012 14:43:14 1215462 15401579 hadcm3n_o5ts_2100_40_008239987_0 648,000 1,321,976 2.0401
12 Nov 2012 00:01:18 1215462 15401579 hadcm3n_o5ts_2100_40_008239987_0 622,080 1,269,306 2.0404
11 Nov 2012 09:32:44 1215462 15401579 hadcm3n_o5ts_2100_40_008239987_0 596,160 1,217,189 2.0417
10 Nov 2012 19:06:12 1215462 15401579 hadcm3n_o5ts_2100_40_008239987_0 570,240 1,165,015 2.0430
09 Nov 2012 21:03:46 1215462 15401579 hadcm3n_o5ts_2100_40_008239987_0 544,320 1,112,292 2.0435
09 Nov 2012 06:00:06 1215462 15401579 hadcm3n_o5ts_2100_40_008239987_0 518,400 1,058,891 2.0426
08 Nov 2012 15:20:34 1215462 15401579 hadcm3n_o5ts_2100_40_008239987_0 492,480 1,006,209 2.0431
07 Nov 2012 23:13:09 1215462 15401579 hadcm3n_o5ts_2100_40_008239987_0 466,560 953,579 2.0439
07 Nov 2012 08:32:37 1215462 15401579 hadcm3n_o5ts_2100_40_008239987_0 440,640 900,810 2.0443
06 Nov 2012 17:52:17 1215462 15401579 hadcm3n_o5ts_2100_40_008239987_0 414,720 848,060 2.0449
06 Nov 2012 03:09:56 1215462 15401579 hadcm3n_o5ts_2100_40_008239987_0 388,800 795,155 2.0452
05 Nov 2012 12:23:41 1215462 15401579 hadcm3n_o5ts_2100_40_008239987_0 362,880 742,064 2.0449


©2024 cpdn.org