climateprediction.net home page
Task 17229558

Task 17229558

Name hadcm3n_s0kv_1940_40_009093912_0
Workunit 9224248
Created 20 Oct 2014, 16:52:25 UTC
Sent 20 Oct 2014, 16:52:42 UTC
Report deadline 20 Jan 2015, 0:19:53 UTC
Received 14 Nov 2014, 11:01:04 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 740707
Run time 22 days 2 hours 21 min 48 sec
CPU time 17 days 22 hours 1 min 41 sec
Validate state Invalid
Credit 11,819.52
Device peak FLOPS 2.91 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>6.10.58</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
02:56:02 (4936): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
02:56:03 (4936): No heartbeat from core client for 30 sec - exiting
02:56:04 (4936): No heartbeat from core client for 30 sec - exiting
02:56:05 (4936): No heartbeat from core client for 30 sec - exiting
02:56:06 (4936): No heartbeat from core client for 30 sec - exiting
02:56:07 (4936): No heartbeat from core client for 30 sec - exiting
02:56:08 (4936): No heartbeat from core client for 30 sec - exiting
02:56:09 (4936): No heartbeat from core client for 30 sec - exiting
02:56:10 (4936): No heartbeat from core client for 30 sec - exiting
02:56:11 (4936): No heartbeat from core client for 30 sec - exiting
02:56:12 (4936): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16

Model crashed: STWORK  : I/O error - PP fixed length header                                                                                                                                                                                                                    tmp/pipe_dummy                                                                  2048    
BUFFIN: C I/O Error feof - Unit 65 - Return code = 16
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16

Model crashed: STWORK  : I/O error - PP fixed length header                                                                                                                                                                                                                    tmp/pipe_dummy                                                                  2048    
BUFFIN: C I/O Error feof - Unit 65 - Return code = 16
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16

Model crashed: STWORK  : I/O error - PP fixed length header                                                                                                                                                                                                                    tmp/pipe_dummy                                                                  2048    
BUFFIN: C I/O Error feof - Unit 65 - Return code = 16
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16

Model crashed: STWORK  : I/O error - PP fixed length header                                                                                                                                                                                                                    tmp/pipe_dummy                                                                  2048    
BUFFIN: C I/O Error feof - Unit 65 - Return code = 16
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16

Model crashed: STWORK  : I/O error - PP fixed length header                                                                                                                                                                                                                    tmp/pipe_dummy                                                                  2048    
BUFFIN: C I/O Error feof - Unit 65 - Return code = 16
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16

Model crashed: STWORK  : I/O error - PP fixed length header                                                                                                                                                                                                                    tmp/pipe_dummy                                                                  2048    
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
13 Nov 2014 10:54:11 740707 17229558 hadcm3n_s0kv_1940_40_009093912_0 984,960 1,546,756 1.5704
12 Nov 2014 22:25:30 740707 17229558 hadcm3n_s0kv_1940_40_009093912_0 959,040 1,505,091 1.5694
12 Nov 2014 07:40:10 740707 17229558 hadcm3n_s0kv_1940_40_009093912_0 933,120 1,464,388 1.5693
11 Nov 2014 19:45:21 740707 17229558 hadcm3n_s0kv_1940_40_009093912_0 907,200 1,424,006 1.5697
11 Nov 2014 06:43:35 740707 17229558 hadcm3n_s0kv_1940_40_009093912_0 881,280 1,383,973 1.5704
10 Nov 2014 17:55:16 740707 17229558 hadcm3n_s0kv_1940_40_009093912_0 855,360 1,343,123 1.5702
10 Nov 2014 05:29:39 740707 17229558 hadcm3n_s0kv_1940_40_009093912_0 829,440 1,301,940 1.5697
09 Nov 2014 17:08:49 740707 17229558 hadcm3n_s0kv_1940_40_009093912_0 803,520 1,260,303 1.5685
09 Nov 2014 01:42:15 740707 17229558 hadcm3n_s0kv_1940_40_009093912_0 777,600 1,219,350 1.5681
08 Nov 2014 11:07:30 740707 17229558 hadcm3n_s0kv_1940_40_009093912_0 751,680 1,178,207 1.5674
07 Nov 2014 22:15:59 740707 17229558 hadcm3n_s0kv_1940_40_009093912_0 725,760 1,137,332 1.5671
07 Nov 2014 09:59:29 740707 17229558 hadcm3n_s0kv_1940_40_009093912_0 699,840 1,096,212 1.5664
06 Nov 2014 21:44:01 740707 17229558 hadcm3n_s0kv_1940_40_009093912_0 673,920 1,054,629 1.5649
06 Nov 2014 10:27:14 740707 17229558 hadcm3n_s0kv_1940_40_009093912_0 648,000 1,013,375 1.5639
05 Nov 2014 23:07:34 740707 17229558 hadcm3n_s0kv_1940_40_009093912_0 622,080 974,595 1.5667
05 Nov 2014 10:33:51 740707 17229558 hadcm3n_s0kv_1940_40_009093912_0 596,160 934,620 1.5677
04 Nov 2014 22:07:29 740707 17229558 hadcm3n_s0kv_1940_40_009093912_0 570,240 893,304 1.5665
04 Nov 2014 06:12:39 740707 17229558 hadcm3n_s0kv_1940_40_009093912_0 544,320 853,204 1.5675
03 Nov 2014 15:24:04 740707 17229558 hadcm3n_s0kv_1940_40_009093912_0 518,400 812,615 1.5675
03 Nov 2014 00:53:06 740707 17229558 hadcm3n_s0kv_1940_40_009093912_0 492,480 771,649 1.5669


©2024 cpdn.org