climateprediction.net home page
Task 15682763

Task 15682763

Name hadcm3n_o7bq_2140_40_008269939_1
Workunit 8425063
Created 25 Mar 2013, 14:04:29 UTC
Sent 25 Mar 2013, 14:05:06 UTC
Report deadline 24 Jun 2013, 21:32:17 UTC
Received 13 Apr 2013, 5:46:20 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1278384
Run time 16 days 10 hours 22 min 58 sec
CPU time 15 days 16 hours 50 min 24 sec
Validate state Invalid
Credit 9,331.20
Device peak FLOPS 2.76 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>7.0.28</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4664, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
MainError:	02:57:14 PM	No files match the supplied pattern.
MainError:	02:57:14 PM	No files match the supplied pattern.
MainError:	04:08:16 AM	No files match the supplied pattern.
MainError:	04:08:16 AM	No files match the supplied pattern.
MainError:	05:07:41 PM	No files match the supplied pattern.
MainError:	05:07:41 PM	No files match the supplied pattern.
MainError:	06:09:40 AM	No files match the supplied pattern.
MainError:	06:09:40 AM	No files match the supplied pattern.
MainError:	07:21:11 PM	No files match the supplied pattern.
MainError:	07:21:11 PM	No files match the supplied pattern.
MainError:	08:19:00 AM	No files match the supplied pattern.
MainError:	08:19:00 AM	No files match the supplied pattern.
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
MainError:	09:29:52 PM	No files match the supplied pattern.
MainError:	09:29:52 PM	No files match the supplied pattern.
MainError:	10:46:20 AM	No files match the supplied pattern.
MainError:	10:46:20 AM	No files match the supplied pattern.
Suspended CPDN Monitor - Suspend request from BOINC...
MainError:	11:41:33 PM	No files match the supplied pattern.
MainError:	11:41:33 PM	No files match the supplied pattern.
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
MainError:	01:53:11 PM	No files match the supplied pattern.
MainError:	01:53:11 PM	No files match the supplied pattern.
00:33:43 (3440): No heartbeat from core client for 30 sec - exiting
00:33:44 (3440): No heartbeat from core client for 30 sec - exiting
00:33:46 (3440): No heartbeat from core client for 30 sec - exiting
00:33:47 (3440): No heartbeat from core client for 30 sec - exiting
00:33:48 (3440): No heartbeat from core client for 30 sec - exiting
00:33:49 (3440): No heartbeat from core client for 30 sec - exiting
00:33:50 (3440): No heartbeat from core client for 30 sec - exiting
00:33:51 (3440): No heartbeat from core client for 30 sec - exiting
00:33:52 (3440): No heartbeat from core client for 30 sec - exiting
00:33:53 (3440): No heartbeat from core client for 30 sec - exiting
00:33:54 (3440): No heartbeat from core client for 30 sec - exiting
00:33:55 (3440): No heartbeat from core client for 30 sec - exiting
00:33:56 (3440): No heartbeat from core client for 30 sec - exiting
00:33:58 (3440): No heartbeat from core client for 30 sec - exiting
00:33:59 (3440): No heartbeat from core client for 30 sec - exiting
00:34:00 (3440): No heartbeat from core client for 30 sec - exiting
00:34:01 (3440): No heartbeat from core client for 30 sec - exiting
00:34:02 (3440): No heartbeat from core client for 30 sec - exiting
00:34:03 (3440): No heartbeat from core client for 30 sec - exiting
00:34:04 (3440): No heartbeat from core client for 30 sec - exiting
00:34:05 (3440): No heartbeat from core client for 30 sec - exiting
00:34:06 (3440): No heartbeat from core client for 30 sec - exiting
00:34:07 (3440): No heartbeat from core client for 30 sec - exiting
00:34:08 (3440): No heartbeat from core client for 30 sec - exiting
Error converting file to netcdf: dataout/o7bqko.pc20c10
Error converting file to netcdf: dataout/o7bqko.pb20c10
Error converting file to netcdf: dataout/o7bqko.pa20c10
MainError:	04:46:28 AM	No files match the supplied pattern.
MainError:	04:46:28 AM	No files match the supplied pattern.
BUFFIN: C I/O Error feof - Unit 60 - Return code = 16
BUFFIN: C I/O Error feof - Unit 61 - Return code = 16
BUFFIN: C I/O Error feof - Unit 62 - Return code = 16
BUFFIN: C I/O Error feof - Unit 64 - Return code = 16
BUFFIN: C I/O Error feof - Unit 65 - Return code = 16
BUFFIN: C I/O Error feof - Unit 66 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16

Model crashed: STWORK  : I/O error - PP fixed length header                                                                                                                                                                                                                    tmp/pipe_dummy                                                                  2048    
BUFFIN: C I/O Error feof - Unit 60 - Return code = 16
BUFFIN: C I/O Error feof - Unit 61 - Return code = 16
BUFFIN: C I/O Error feof - Unit 62 - Return code = 16
BUFFIN: C I/O Error feof - Unit 64 - Return code = 16
BUFFIN: C I/O Error feof - Unit 65 - Return code = 16
BUFFIN: C I/O Error feof - Unit 66 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16

Model crashed: STWORK  : I/O error - PP fixed length header                                                                                                                                                                                                                    tmp/pipe_dummy                                                                  2048    
BUFFIN: C I/O Error feof - Unit 60 - Return code = 16
BUFFIN: C I/O Error feof - Unit 61 - Return code = 16
BUFFIN: C I/O Error feof - Unit 62 - Return code = 16
BUFFIN: C I/O Error feof - Unit 64 - Return code = 16
BUFFIN: C I/O Error feof - Unit 65 - Return code = 16
BUFFIN: C I/O Error feof - Unit 66 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16

Model crashed: STWORK  : I/O error - PP fixed length header                                                                                                                                                                                                                    tmp/pipe_dummy                                                                  2048    
BUFFIN: C I/O Error feof - Unit 60 - Return code = 16
BUFFIN: C I/O Error feof - Unit 61 - Return code = 16
BUFFIN: C I/O Error feof - Unit 62 - Return code = 16
BUFFIN: C I/O Error feof - Unit 64 - Return code = 16
BUFFIN: C I/O Error feof - Unit 65 - Return code = 16
BUFFIN: C I/O Error feof - Unit 66 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16

Model crashed: STWORK  : I/O error - PP fixed length header                                                                                                                                                                                                                    tmp/pipe_dummy                                                                  2048    
BUFFIN: C I/O Error feof - Unit 60 - Return code = 16
BUFFIN: C I/O Error feof - Unit 61 - Return code = 16
BUFFIN: C I/O Error feof - Unit 62 - Return code = 16
BUFFIN: C I/O Error feof - Unit 64 - Return code = 16
BUFFIN: C I/O Error feof - Unit 65 - Return code = 16
BUFFIN: C I/O Error feof - Unit 66 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16

Model crashed: STWORK  : I/O error - PP fixed length header                                                                                                                                                                                                                    tmp/pipe_dummy                                                                  2048    
BUFFIN: C I/O Error feof - Unit 60 - Return code = 16
BUFFIN: C I/O Error feof - Unit 61 - Return code = 16
BUFFIN: C I/O Error feof - Unit 62 - Return code = 16
BUFFIN: C I/O Error feof - Unit 64 - Return code = 16
BUFFIN: C I/O Error feof - Unit 65 - Return code = 16
BUFFIN: C I/O Error feof - Unit 66 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16

Model crashed: STWORK  : I/O error - PP fixed length header                                                                                                                                                                                                                    tmp/pipe_dummy                                                                  2048    
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
12 Apr 2013 03:07:21 1274770 15682763 hadcm3n_o7bq_2140_40_008269939_1 777,600 1,354,503 1.7419
11 Apr 2013 14:15:20 1274770 15682763 hadcm3n_o7bq_2140_40_008269939_1 751,680 1,308,866 1.7413
11 Apr 2013 00:36:26 1274770 15682763 hadcm3n_o7bq_2140_40_008269939_1 725,760 1,263,376 1.7408
10 Apr 2013 14:45:27 1274770 15682763 hadcm3n_o7bq_2140_40_008269939_1 699,840 1,217,513 1.7397
09 Apr 2013 21:58:54 1274770 15682763 hadcm3n_o7bq_2140_40_008269939_1 673,920 1,172,062 1.7392
09 Apr 2013 08:34:54 1274770 15682763 hadcm3n_o7bq_2140_40_008269939_1 648,000 1,126,727 1.7388
08 Apr 2013 19:22:57 1274770 15682763 hadcm3n_o7bq_2140_40_008269939_1 622,080 1,081,444 1.7384
08 Apr 2013 15:40:39 1274770 15682763 hadcm3n_o7bq_2140_40_008269939_1 596,160 1,036,592 1.7388
07 Apr 2013 17:11:24 1274770 15682763 hadcm3n_o7bq_2140_40_008269939_1 570,240 991,156 1.7381
07 Apr 2013 04:46:36 1274770 15682763 hadcm3n_o7bq_2140_40_008269939_1 544,320 945,516 1.7371
06 Apr 2013 15:00:34 1274770 15682763 hadcm3n_o7bq_2140_40_008269939_1 518,400 900,513 1.7371
06 Apr 2013 01:55:49 1274770 15682763 hadcm3n_o7bq_2140_40_008269939_1 492,480 854,906 1.7359
05 Apr 2013 12:16:16 1274770 15682763 hadcm3n_o7bq_2140_40_008269939_1 466,560 808,950 1.7339
04 Apr 2013 23:10:58 1274770 15682763 hadcm3n_o7bq_2140_40_008269939_1 440,640 762,584 1.7306
04 Apr 2013 08:05:22 1274770 15682763 hadcm3n_o7bq_2140_40_008269939_1 414,720 717,226 1.7294
03 Apr 2013 19:07:27 1274770 15682763 hadcm3n_o7bq_2140_40_008269939_1 388,800 671,957 1.7283
03 Apr 2013 05:14:08 1274770 15682763 hadcm3n_o7bq_2140_40_008269939_1 362,880 627,181 1.7283
03 Apr 2013 05:14:08 1274770 15682763 hadcm3n_o7bq_2140_40_008269939_1 336,960 582,257 1.7280
02 Apr 2013 01:46:00 1274770 15682763 hadcm3n_o7bq_2140_40_008269939_1 311,040 537,530 1.7282
01 Apr 2013 17:26:41 1274770 15682763 hadcm3n_o7bq_2140_40_008269939_1 285,120 493,132 1.7296


©2024 climateprediction.net