climateprediction.net home page
Task 15502770

Task 15502770

Name hadcm3n_o7fm_2140_40_008269875_0
Workunit 8424999
Created 24 Dec 2012, 0:25:02 UTC
Sent 24 Dec 2012, 6:52:45 UTC
Report deadline 25 Mar 2013, 14:19:56 UTC
Received 15 Jan 2013, 15:20:47 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1260087
Run time 21 days 8 hours 27 min 48 sec
CPU time 16 days 21 hours 58 min 42 sec
Validate state Invalid
Credit 9,331.20
Device peak FLOPS 1.98 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>7.0.28</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
02:07:46 (11428): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
MainError:	07:53:33 AM	No files match the supplied pattern.
MainError:	07:53:33 AM	No files match the supplied pattern.
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2652, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3676, iMonCtr=1
Model crash detected, will try to restart...
MainError:	12:17:55 AM	No files match the supplied pattern.
MainError:	12:17:55 AM	No files match the supplied pattern.
MainError:	06:55:04 PM	No files match the supplied pattern.
MainError:	06:55:04 PM	No files match the supplied pattern.
MainError:	11:54:05 AM	No files match the supplied pattern.
MainError:	11:54:05 AM	No files match the supplied pattern.
Suspended CPDN Monitor - Suspend request from BOINC...
MainError:	06:28:36 AM	No files match the supplied pattern.
MainError:	06:28:36 AM	No files match the supplied pattern.
MainError:	12:32:48 AM	No files match the supplied pattern.
MainError:	12:32:48 AM	No files match the supplied pattern.
MainError:	07:10:20 PM	No files match the supplied pattern.
MainError:	07:10:20 PM	No files match the supplied pattern.
01:00:16 (5804): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
MainError:	12:06:00 AM	No files match the supplied pattern.
MainError:	12:06:00 AM	No files match the supplied pattern.
MainError:	05:18:28 AM	No files match the supplied pattern.
MainError:	05:18:28 AM	No files match the supplied pattern.
MainError:	10:35:42 PM	No files match the supplied pattern.
MainError:	10:35:42 PM	No files match the supplied pattern.
Error converting file to netcdf: dataout/o7fmka.ph11c10
Error converting file to netcdf: dataout/o7fmka.pg11c10
Error converting file to netcdf: dataout/o7fmka.pe11c10
MainError:	02:00:13 PM	No files match the supplied pattern.
MainError:	02:00:13 PM	No files match the supplied pattern.
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16

Model crashed: STWORK  : I/O error - PP fixed length header                                                                                                                                                                                                                    tmp/pipe_dummy                                                                  2048    
BUFFIN: C I/O Error feof - Unit 64 - Return code = 16
BUFFIN: C I/O Error feof - Unit 66 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16

Model crashed: STWORK  : I/O error - PP fixed length header                                                                                                                                                                                                                    tmp/pipe_dummy                                                                  2048    
BUFFIN: C I/O Error feof - Unit 64 - Return code = 16
BUFFIN: C I/O Error feof - Unit 66 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16

Model crashed: STWORK  : I/O error - PP fixed length header                                                                                                                                                                                                                    tmp/pipe_dummy                                                                  2048    
BUFFIN: C I/O Error feof - Unit 64 - Return code = 16
BUFFIN: C I/O Error feof - Unit 66 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16

Model crashed: STWORK  : I/O error - PP fixed length header                                                                                                                                                                                                                    tmp/pipe_dummy                                                                  2048    
BUFFIN: C I/O Error feof - Unit 64 - Return code = 16
BUFFIN: C I/O Error feof - Unit 66 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16

Model crashed: STWORK  : I/O error - PP fixed length header                                                                                                                                                                                                                    tmp/pipe_dummy                                                                  2048    
BUFFIN: C I/O Error feof - Unit 64 - Return code = 16
BUFFIN: C I/O Error feof - Unit 66 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16

Model crashed: STWORK  : I/O error - PP fixed length header                                                                                                                                                                                                                    tmp/pipe_dummy                                                                  2048    
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
15 Jan 2013 14:00:34 1260087 15502770 hadcm3n_o7fm_2140_40_008269875_0 777,600 1,646,246 2.1171
14 Jan 2013 22:36:03 1260087 15502770 hadcm3n_o7fm_2140_40_008269875_0 751,680 1,592,069 2.1180
14 Jan 2013 05:22:54 1260087 15502770 hadcm3n_o7fm_2140_40_008269875_0 725,760 1,536,892 2.1176
13 Jan 2013 12:09:56 1260087 15502770 hadcm3n_o7fm_2140_40_008269875_0 699,840 1,481,093 2.1163
12 Jan 2013 19:12:07 1260087 15502770 hadcm3n_o7fm_2140_40_008269875_0 673,920 1,425,740 2.1156
12 Jan 2013 00:32:46 1260087 15502770 hadcm3n_o7fm_2140_40_008269875_0 648,000 1,369,701 2.1137
11 Jan 2013 06:28:54 1260087 15502770 hadcm3n_o7fm_2140_40_008269875_0 622,080 1,313,726 2.1118
10 Jan 2013 12:45:11 1260087 15502770 hadcm3n_o7fm_2140_40_008269875_0 596,160 1,258,268 2.1106
09 Jan 2013 18:56:34 1260087 15502770 hadcm3n_o7fm_2140_40_008269875_0 570,240 1,202,610 2.1090
09 Jan 2013 00:33:16 1260087 15502770 hadcm3n_o7fm_2140_40_008269875_0 544,320 1,146,163 2.1057
08 Jan 2013 07:56:47 1260087 15502770 hadcm3n_o7fm_2140_40_008269875_0 518,400 1,091,761 2.1060
07 Jan 2013 15:51:46 1260087 15502770 hadcm3n_o7fm_2140_40_008269875_0 492,480 1,036,690 2.1050
07 Jan 2013 00:24:13 1260087 15502770 hadcm3n_o7fm_2140_40_008269875_0 466,560 982,075 2.1049
06 Jan 2013 06:55:45 1260087 15502770 hadcm3n_o7fm_2140_40_008269875_0 440,640 927,102 2.1040
05 Jan 2013 13:50:32 1260087 15502770 hadcm3n_o7fm_2140_40_008269875_0 414,720 870,951 2.1001
04 Jan 2013 20:38:09 1260087 15502770 hadcm3n_o7fm_2140_40_008269875_0 388,800 815,977 2.0987
04 Jan 2013 03:49:17 1260087 15502770 hadcm3n_o7fm_2140_40_008269875_0 362,880 761,014 2.0972
03 Jan 2013 11:46:35 1260087 15502770 hadcm3n_o7fm_2140_40_008269875_0 336,960 706,993 2.0982
02 Jan 2013 18:52:54 1260087 15502770 hadcm3n_o7fm_2140_40_008269875_0 311,040 651,037 2.0931
02 Jan 2013 02:28:46 1260087 15502770 hadcm3n_o7fm_2140_40_008269875_0 285,120 596,036 2.0905


©2024 cpdn.org