climateprediction.net home page
Task 16028580

Task 16028580

Name hadcm3n_857v_1980_40_008464719_1
Workunit 8615558
Created 20 Sep 2013, 16:43:19 UTC
Sent 20 Sep 2013, 16:59:31 UTC
Report deadline 21 Dec 2013, 0:26:42 UTC
Received 29 Oct 2013, 5:34:18 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1238397
Run time 8 days 12 hours 54 min 2 sec
CPU time 8 days 0 hours 46 min 41 sec
Validate state Invalid
Credit 7,464.96
Device peak FLOPS 3.23 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>7.0.64</core_client_version>
<![CDATA[
<message>
The device does not recognize the command.
 (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
21:53:42 (6964): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - No 'heartbeat' from BOINC...
21:53:43 (6964): No heartbeat from core client for 30 sec - exiting
18:03:38 (1072): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
18:03:40 (1072): No heartbeat from core client for 30 sec - exiting
18:03:41 (1072): No heartbeat from core client for 30 sec - exiting
18:03:42 (1072): No heartbeat from core client for 30 sec - exiting
18:03:43 (1072): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
15:19:08 (17124): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - No 'heartbeat' from BOINC...
15:19:10 (17124): No heartbeat from core client for 30 sec - exiting
15:19:11 (17124): No heartbeat from core client for 30 sec - exiting
15:19:12 (17124): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
19:57:13 (11860): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
19:57:15 (11860): No heartbeat from core client for 30 sec - exiting
19:57:55 (4296): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
20:18:22 (2268): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
11:44:15 (6356): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6972, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6412, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6412, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
18:26:53 (6848): No heartbeat from core client for 30 sec - exiting
18:26:54 (6848): No heartbeat from core client for 30 sec - exiting
18:26:55 (6848): No heartbeat from core client for 30 sec - exiting
18:26:56 (6848): No heartbeat from core client for 30 sec - exiting
18:26:57 (6848): No heartbeat from core client for 30 sec - exiting
18:26:58 (6848): No heartbeat from core client for 30 sec - exiting
18:26:59 (6848): No heartbeat from core client for 30 sec - exiting
18:27:00 (6848): No heartbeat from core client for 30 sec - exiting
18:27:01 (6848): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4100, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
04:52:35 (4100): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
11:52:29 (2456): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
12:43:31 (2268): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8264, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8264, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
07 Oct 2013 14:44:50 1238397 16028580 hadcm3n_857v_1980_40_008464719_1 622,080 685,448 1.1019
07 Oct 2013 05:53:12 1238397 16028580 hadcm3n_857v_1980_40_008464719_1 596,160 655,865 1.1001
05 Oct 2013 20:38:37 1238397 16028580 hadcm3n_857v_1980_40_008464719_1 570,240 625,174 1.0963
05 Oct 2013 11:39:58 1238397 16028580 hadcm3n_857v_1980_40_008464719_1 544,320 594,832 1.0928
05 Oct 2013 02:51:32 1238397 16028580 hadcm3n_857v_1980_40_008464719_1 518,400 564,439 1.0888
04 Oct 2013 17:56:43 1238397 16028580 hadcm3n_857v_1980_40_008464719_1 492,480 533,933 1.0842
04 Oct 2013 09:47:08 1238397 16028580 hadcm3n_857v_1980_40_008464719_1 466,560 505,307 1.0830
02 Oct 2013 20:25:08 1238397 16028580 hadcm3n_857v_1980_40_008464719_1 440,640 478,609 1.0862
02 Oct 2013 12:43:52 1238397 16028580 hadcm3n_857v_1980_40_008464719_1 414,720 451,285 1.0882
02 Oct 2013 05:12:26 1238397 16028580 hadcm3n_857v_1980_40_008464719_1 388,800 424,750 1.0925
30 Sep 2013 18:08:24 1238397 16028580 hadcm3n_857v_1980_40_008464719_1 362,880 397,414 1.0952
30 Sep 2013 10:46:07 1238397 16028580 hadcm3n_857v_1980_40_008464719_1 336,960 370,983 1.1010
26 Sep 2013 19:58:02 1238397 16028580 hadcm3n_857v_1980_40_008464719_1 311,040 343,259 1.1036
26 Sep 2013 11:08:15 1238397 16028580 hadcm3n_857v_1980_40_008464719_1 285,120 312,787 1.0970
26 Sep 2013 02:59:50 1238397 16028580 hadcm3n_857v_1980_40_008464719_1 259,200 283,686 1.0945
25 Sep 2013 13:37:42 1238397 16028580 hadcm3n_857v_1980_40_008464719_1 233,280 256,009 1.0974
25 Sep 2013 09:28:43 1238397 16028580 hadcm3n_857v_1980_40_008464719_1 207,360 229,176 1.1052
23 Sep 2013 12:44:17 1238397 16028580 hadcm3n_857v_1980_40_008464719_1 181,440 200,243 1.1036
23 Sep 2013 10:19:21 1238397 16028580 hadcm3n_857v_1980_40_008464719_1 155,520 172,561 1.1096
23 Sep 2013 10:19:21 1238397 16028580 hadcm3n_857v_1980_40_008464719_1 129,600 142,826 1.1021


©2024 cpdn.org