climateprediction.net home page
Task 15644094

Task 15644094

Name hadcm3n_o2rj_2140_40_008269636_3
Workunit 8424760
Created 1 Mar 2013, 8:45:43 UTC
Sent 1 Mar 2013, 8:45:51 UTC
Report deadline 31 May 2013, 16:13:02 UTC
Received 19 Apr 2013, 11:58:09 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1221441
Run time 17 days 19 hours 5 min 59 sec
CPU time 16 days 23 hours 24 min 35 sec
Validate state Invalid
Credit 9,331.20
Device peak FLOPS 2.73 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>7.0.28</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4768, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=316, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3396, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4748, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2588, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2588, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2588, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4464, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
MainError:	04:18:40 AM	No files match the supplied pattern.
MainError:	04:18:40 AM	No files match the supplied pattern.
Suspended CPDN Monitor - Suspend request from BOINC...
MainError:	06:18:03 PM	No files match the supplied pattern.
MainError:	06:18:03 PM	No files match the supplied pattern.
MainError:	09:05:20 AM	No files match the supplied pattern.
MainError:	09:05:20 AM	No files match the supplied pattern.
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
MainError:	06:57:11 AM	No files match the supplied pattern.
MainError:	06:57:11 AM	No files match the supplied pattern.
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
MainError:	08:07:02 AM	No files match the supplied pattern.
MainError:	08:07:02 AM	No files match the supplied pattern.
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
MainError:	11:32:30 PM	No files match the supplied pattern.
MainError:	11:32:30 PM	No files match the supplied pattern.
MainError:	12:43:33 AM	No files match the supplied pattern.
MainError:	12:43:33 AM	No files match the supplied pattern.
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5160, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5012, iMonCtr=1
Model crash detected, will try to restart...
MainError:	08:45:00 AM	No files match the supplied pattern.
MainError:	08:45:00 AM	No files match the supplied pattern.
MainError:	03:15:27 PM	No files match the supplied pattern.
MainError:	03:15:27 PM	No files match the supplied pattern.
MainError:	01:40:55 PM	No files match the supplied pattern.
MainError:	01:40:55 PM	No files match the supplied pattern.
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4432, iMonCtr=1
Model crash detected, will try to restart...
Error converting file to netcdf: dataout/o2rjka.ph11c10
Error converting file to netcdf: dataout/o2rjka.pg11c10
Error converting file to netcdf: dataout/o2rjka.pe11c10
MainError:	12:14:14 AM	No files match the supplied pattern.
MainError:	12:14:14 AM	No files match the supplied pattern.
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16

Model crashed: STWORK  : I/O error - PP fixed length header                                                                                                                                                                                                                    tmp/pipe_dummy                                                                  2048    
BUFFIN: C I/O Error feof - Unit 64 - Return code = 16
BUFFIN: C I/O Error feof - Unit 66 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16

Model crashed: STWORK  : I/O error - PP fixed length header                                                                                                                                                                                                                    tmp/pipe_dummy                                                                  2048    
BUFFIN: C I/O Error feof - Unit 64 - Return code = 16
BUFFIN: C I/O Error feof - Unit 66 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16

Model crashed: STWORK  : I/O error - PP fixed length header                                                                                                                                                                                                                    tmp/pipe_dummy                                                                  2048    
BUFFIN: C I/O Error feof - Unit 64 - Return code = 16
BUFFIN: C I/O Error feof - Unit 66 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16

Model crashed: STWORK  : I/O error - PP fixed length header                                                                                                                                                                                                                    tmp/pipe_dummy                                                                  2048    
BUFFIN: C I/O Error feof - Unit 64 - Return code = 16
BUFFIN: C I/O Error feof - Unit 66 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16

Model crashed: STWORK  : I/O error - PP fixed length header                                                                                                                                                                                                                    tmp/pipe_dummy                                                                  2048    
BUFFIN: C I/O Error feof - Unit 64 - Return code = 16
BUFFIN: C I/O Error feof - Unit 66 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16

Model crashed: STWORK  : I/O error - PP fixed length header                                                                                                                                                                                                                    tmp/pipe_dummy                                                                  2048    
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
18 Apr 2013 12:17:50 1221441 15644094 hadcm3n_o2rj_2140_40_008269636_3 777,600 1,466,657 1.8861
16 Apr 2013 13:45:07 1221441 15644094 hadcm3n_o2rj_2140_40_008269636_3 751,680 1,418,595 1.8872
11 Apr 2013 15:20:31 1221441 15644094 hadcm3n_o2rj_2140_40_008269636_3 725,760 1,369,003 1.8863
10 Apr 2013 08:53:18 1221441 15644094 hadcm3n_o2rj_2140_40_008269636_3 699,840 1,320,133 1.8863
05 Apr 2013 12:56:33 1221441 15644094 hadcm3n_o2rj_2140_40_008269636_3 673,920 1,269,913 1.8844
03 Apr 2013 23:37:00 1221441 15644094 hadcm3n_o2rj_2140_40_008269636_3 648,000 1,220,421 1.8834
03 Apr 2013 08:55:37 1221441 15644094 hadcm3n_o2rj_2140_40_008269636_3 622,080 1,170,713 1.8819
29 Mar 2013 07:30:23 1221441 15644094 hadcm3n_o2rj_2140_40_008269636_3 596,160 1,122,212 1.8824
25 Mar 2013 09:07:20 1221441 15644094 hadcm3n_o2rj_2140_40_008269636_3 570,240 1,074,146 1.8837
24 Mar 2013 18:20:41 1221441 15644094 hadcm3n_o2rj_2140_40_008269636_3 544,320 1,025,257 1.8836
24 Mar 2013 04:20:41 1221441 15644094 hadcm3n_o2rj_2140_40_008269636_3 518,400 975,516 1.8818
23 Mar 2013 07:32:49 1221441 15644094 hadcm3n_o2rj_2140_40_008269636_3 492,480 927,881 1.8841
21 Mar 2013 07:43:56 1221441 15644094 hadcm3n_o2rj_2140_40_008269636_3 466,560 879,119 1.8843
20 Mar 2013 17:11:01 1221441 15644094 hadcm3n_o2rj_2140_40_008269636_3 440,640 830,163 1.8840
20 Mar 2013 03:20:46 1221441 15644094 hadcm3n_o2rj_2140_40_008269636_3 414,720 781,904 1.8854
18 Mar 2013 12:14:46 1221441 15644094 hadcm3n_o2rj_2140_40_008269636_3 388,800 733,753 1.8872
16 Mar 2013 08:07:59 1221441 15644094 hadcm3n_o2rj_2140_40_008269636_3 362,880 686,101 1.8907
15 Mar 2013 05:22:34 1221441 15644094 hadcm3n_o2rj_2140_40_008269636_3 336,960 636,846 1.8900
14 Mar 2013 14:40:27 1221441 15644094 hadcm3n_o2rj_2140_40_008269636_3 311,040 587,303 1.8882
14 Mar 2013 00:32:23 1221441 15644094 hadcm3n_o2rj_2140_40_008269636_3 285,120 537,595 1.8855


©2024 climateprediction.net