climateprediction.net home page
Task 14911180

Task 14911180

Name hadcm3n_y9sz_1980_40_008049647_2
Workunit 8204761
Created 13 Jul 2012, 7:21:16 UTC
Sent 13 Jul 2012, 7:31:24 UTC
Report deadline 12 Oct 2012, 14:58:35 UTC
Received 19 Aug 2012, 22:15:37 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1139859
Run time 15 days 11 hours 44 min 1 sec
CPU time 8 days 11 hours 8 min 51 sec
Validate state Invalid
Credit 6,531.84
Device peak FLOPS 2.52 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>7.0.28</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
20:49:56 (5860): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...

Model crashed: TEMPHIST: Failed in OPEN of history file                                                                                                                                                                                                                        tmp/pipe_dummy                                                                  2048    
07:11:52 (7968): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
06:26:08 (6064): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
06:26:09 (6064): No heartbeat from core client for 30 sec - exiting
06:26:10 (6064): No heartbeat from core client for 30 sec - exiting
06:28:11 (6404): No heartbeat from core client for 30 sec - exiting
06:28:12 (6404): No heartbeat from core client for 30 sec - exiting
06:28:13 (6404): No heartbeat from core client for 30 sec - exiting
06:28:14 (6404): No heartbeat from core client for 30 sec - exiting
06:28:15 (6404): No heartbeat from core client for 30 sec - exiting
06:28:16 (6404): No heartbeat from core client for 30 sec - exiting
06:28:17 (6404): No heartbeat from core client for 30 sec - exiting
06:28:18 (6404): No heartbeat from core client for 30 sec - exiting
06:28:19 (6404): No heartbeat from core client for 30 sec - exiting
06:28:20 (6404): No heartbeat from core client for 30 sec - exiting
06:28:21 (6404): No heartbeat from core client for 30 sec - exiting
06:28:22 (6404): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
06:28:23 (6404): No heartbeat from core client for 30 sec - exiting
06:28:24 (6404): No heartbeat from core client for 30 sec - exiting
06:28:25 (6404): No heartbeat from core client for 30 sec - exiting
06:28:26 (6404): No heartbeat from core client for 30 sec - exiting
06:28:27 (6404): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
18:12:02 (3004): No heartbeat from core client for 30 sec - exiting
18:12:03 (3004): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
18:12:04 (3004): No heartbeat from core client for 30 sec - exiting
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5348, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5348, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5348, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5348, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5204, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5204, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
19 Aug 2012 15:50:54 1139859 14911180 hadcm3n_y9sz_1980_40_008049647_2 544,320 716,654 1.3166
18 Aug 2012 21:16:44 1139859 14911180 hadcm3n_y9sz_1980_40_008049647_2 518,400 679,946 1.3116
16 Aug 2012 17:37:50 1139859 14911180 hadcm3n_y9sz_1980_40_008049647_2 492,480 643,686 1.3070
15 Aug 2012 23:13:46 1139859 14911180 hadcm3n_y9sz_1980_40_008049647_2 466,560 606,921 1.3008
14 Aug 2012 03:35:28 1139859 14911180 hadcm3n_y9sz_1980_40_008049647_2 440,640 570,886 1.2956
13 Aug 2012 12:15:07 1139859 14911180 hadcm3n_y9sz_1980_40_008049647_2 414,720 534,570 1.2890
12 Aug 2012 22:16:51 1139859 14911180 hadcm3n_y9sz_1980_40_008049647_2 388,800 498,500 1.2822
12 Aug 2012 05:49:52 1139859 14911180 hadcm3n_y9sz_1980_40_008049647_2 362,880 462,225 1.2738
11 Aug 2012 15:53:08 1139859 14911180 hadcm3n_y9sz_1980_40_008049647_2 336,960 425,921 1.2640
10 Aug 2012 22:39:51 1139859 14911180 hadcm3n_y9sz_1980_40_008049647_2 311,040 389,320 1.2517
10 Aug 2012 01:45:35 1139859 14911180 hadcm3n_y9sz_1980_40_008049647_2 285,120 352,120 1.2350
09 Aug 2012 05:46:06 1139859 14911180 hadcm3n_y9sz_1980_40_008049647_2 259,200 315,413 1.2169
08 Aug 2012 12:21:58 1139859 14911180 hadcm3n_y9sz_1980_40_008049647_2 233,280 278,917 1.1956
07 Aug 2012 16:00:20 1139859 14911180 hadcm3n_y9sz_1980_40_008049647_2 207,360 290,571 1.4013
07 Aug 2012 01:08:20 1139859 14911180 hadcm3n_y9sz_1980_40_008049647_2 181,440 254,802 1.4043
06 Aug 2012 08:03:46 1139859 14911180 hadcm3n_y9sz_1980_40_008049647_2 155,520 218,368 1.4041
04 Aug 2012 21:29:26 1139859 14911180 hadcm3n_y9sz_1980_40_008049647_2 129,600 182,192 1.4058
04 Aug 2012 03:54:51 1139859 14911180 hadcm3n_y9sz_1980_40_008049647_2 103,680 146,128 1.4094
29 Jul 2012 12:19:55 1139859 14911180 hadcm3n_y9sz_1980_40_008049647_2 77,760 109,736 1.4112
27 Jul 2012 19:13:50 1139859 14911180 hadcm3n_y9sz_1980_40_008049647_2 51,840 71,978 1.3885


©2024 climateprediction.net