climateprediction.net home page
Task 15655409

Task 15655409

Name hadcm3n_o3sp_2140_40_008269406_2
Workunit 8424530
Created 9 Mar 2013, 20:26:17 UTC
Sent 9 Mar 2013, 20:26:28 UTC
Report deadline 9 Jun 2013, 3:53:39 UTC
Received 29 Mar 2013, 18:57:34 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 546879
Run time 13 days 19 hours 38 min 17 sec
CPU time 13 days 19 hours 38 min 17 sec
Validate state Invalid
Credit 9,331.20
Device peak FLOPS 2.82 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>5.10.45</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
07:20:02 (4632): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
07:20:03 (4632): No heartbeat from core client for 30 sec - exiting
07:20:04 (4632): No heartbeat from core client for 30 sec - exiting
07:20:05 (4632): No heartbeat from core client for 30 sec - exiting
07:20:06 (4632): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - Quit request from BOINC...
07:25:59 (4440): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
06:57:40 (4392): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
20:28:29 (3612): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
20:28:30 (3612): No heartbeat from core client for 30 sec - exiting
20:28:31 (3612): No heartbeat from core client for 30 sec - exiting
20:28:33 (3612): No heartbeat from core client for 30 sec - exiting
20:28:34 (3612): No heartbeat from core client for 30 sec - exiting
20:28:35 (3612): No heartbeat from core client for 30 sec - exiting
20:28:36 (3612): No heartbeat from core client for 30 sec - exiting
20:28:37 (3612): No heartbeat from core client for 30 sec - exiting
20:28:38 (3612): No heartbeat from core client for 30 sec - exiting
20:28:39 (3612): No heartbeat from core client for 30 sec - exiting
Atmos Hold Restart file rename failed on atmos_restart.hold
06:15:29 (4392): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
06:15:30 (4392): No heartbeat from core client for 30 sec - exiting
06:15:31 (4392): No heartbeat from core client for 30 sec - exiting
06:15:32 (4392): No heartbeat from core client for 30 sec - exiting
06:15:33 (4392): No heartbeat from core client for 30 sec - exiting
06:15:35 (4392): No heartbeat from core client for 30 sec - exiting
06:15:36 (4392): No heartbeat from core client for 30 sec - exiting
06:15:37 (4392): No heartbeat from core client for 30 sec - exiting
06:15:38 (4392): No heartbeat from core client for 30 sec - exiting
06:15:39 (4392): No heartbeat from core client for 30 sec - exiting
06:15:40 (4392): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
07:18:07 (4528): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
07:26:17 (4284): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
MainError:	06:50:17 AM	No files match the supplied pattern.
MainError:	06:50:17 AM	No files match the supplied pattern.
MainError:	05:43:54 PM	No files match the supplied pattern.
MainError:	05:43:54 PM	No files match the supplied pattern.
MainError:	04:59:16 AM	No files match the supplied pattern.
MainError:	04:59:17 AM	No files match the supplied pattern.
MainError:	05:17:55 PM	No files match the supplied pattern.
MainError:	05:17:55 PM	No files match the supplied pattern.
06:12:08 (4972): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
06:12:09 (4972): No heartbeat from core client for 30 sec - exiting
06:12:10 (4972): No heartbeat from core client for 30 sec - exiting
06:12:11 (4972): No heartbeat from core client for 30 sec - exiting
06:12:12 (4972): No heartbeat from core client for 30 sec - exiting
06:12:13 (4972): No heartbeat from core client for 30 sec - exiting
06:12:14 (4972): No heartbeat from core client for 30 sec - exiting
06:12:15 (4972): No heartbeat from core client for 30 sec - exiting
06:12:16 (4972): No heartbeat from core client for 30 sec - exiting
06:12:17 (4972): No heartbeat from core client for 30 sec - exiting
06:12:18 (4972): No heartbeat from core client for 30 sec - exiting
06:15:48 (404): No heartbeat from core client for 30 sec - exiting
06:15:49 (404): No heartbeat from core client for 30 sec - exiting
06:15:50 (404): No heartbeat from core client for 30 sec - exiting
06:15:51 (404): No heartbeat from core client for 30 sec - exiting
06:15:52 (404): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
06:15:53 (404): No heartbeat from core client for 30 sec - exiting
MainError:	06:15:20 AM	No files match the supplied pattern.
MainError:	06:15:22 AM	No files match the supplied pattern.
CPDN Monitor - Quit request from BOINC...
06:48:55 (4420): No heartbeat from core client for 30 sec - exiting
06:48:56 (4420): No heartbeat from core client for 30 sec - exiting
06:48:57 (4420): No heartbeat from core client for 30 sec - exiting
06:48:58 (4420): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
06:48:59 (4420): No heartbeat from core client for 30 sec - exiting
06:49:00 (4420): No heartbeat from core client for 30 sec - exiting
06:49:02 (4420): No heartbeat from core client for 30 sec - exiting
06:49:03 (4420): No heartbeat from core client for 30 sec - exiting
06:49:04 (4420): No heartbeat from core client for 30 sec - exiting
06:49:05 (4420): No heartbeat from core client for 30 sec - exiting
06:49:06 (4420): No heartbeat from core client for 30 sec - exiting
06:49:07 (4420): No heartbeat from core client for 30 sec - exiting
06:49:07 (5248): Can't acquire lockfile (32) - waiting 35s
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
MainError:	01:51:06 AM	No files match the supplied pattern.
MainError:	01:51:06 AM	No files match the supplied pattern.
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
MainError:	03:42:53 AM	No files match the supplied pattern.
MainError:	03:42:53 AM	No files match the supplied pattern.
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
MainError:	02:05:46 AM	No files match the supplied pattern.
MainError:	02:05:46 AM	No files match the supplied pattern.
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
07:07:40 (4704): No heartbeat from core client for 30 sec - exiting
07:07:41 (4704): No heartbeat from core client for 30 sec - exiting
07:07:42 (4704): No heartbeat from core client for 30 sec - exiting
07:07:44 (4704): No heartbeat from core client for 30 sec - exiting
07:07:45 (4704): No heartbeat from core client for 30 sec - exiting
07:07:46 (4704): No heartbeat from core client for 30 sec - exiting
07:07:47 (4704): No heartbeat from core client for 30 sec - exiting
07:07:48 (4704): No heartbeat from core client for 30 sec - exiting
07:07:49 (4704): No heartbeat from core client for 30 sec - exiting
07:07:50 (4704): No heartbeat from core client for 30 sec - exiting
07:07:51 (4704): No heartbeat from core client for 30 sec - exiting
07:07:52 (4704): No heartbeat from core client for 30 sec - exiting
07:07:53 (4704): No heartbeat from core client for 30 sec - exiting
07:07:54 (4704): No heartbeat from core client for 30 sec - exiting
07:07:56 (4704): No heartbeat from core client for 30 sec - exiting
07:07:57 (4704): No heartbeat from core client for 30 sec - exiting
07:07:58 (4704): No heartbeat from core client for 30 sec - exiting
07:07:59 (4704): No heartbeat from core client for 30 sec - exiting
07:08:00 (4704): No heartbeat from core client for 30 sec - exiting
07:08:01 (4704): No heartbeat from core client for 30 sec - exiting
07:08:02 (4704): No heartbeat from core client for 30 sec - exiting
07:08:03 (4704): No heartbeat from core client for 30 sec - exiting
07:08:04 (4704): No heartbeat from core client for 30 sec - exiting
07:08:05 (4704): No heartbeat from core client for 30 sec - exiting
07:08:06 (4704): No heartbeat from core client for 30 sec - exiting
07:08:08 (4704): No heartbeat from core client for 30 sec - exiting
07:08:09 (4704): No heartbeat from core client for 30 sec - exiting
07:08:10 (4704): No heartbeat from core client for 30 sec - exiting
07:08:11 (4704): No heartbeat from core client for 30 sec - exiting
07:08:12 (4704): No heartbeat from core client for 30 sec - exiting
07:08:13 (4704): No heartbeat from core client for 30 sec - exiting
07:08:14 (4704): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
MainError:	10:42:53 PM	No files match the supplied pattern.
MainError:	10:42:53 PM	No files match the supplied pattern.
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
06:07:25 (4404): No heartbeat from core client for 30 sec - exiting
06:07:26 (4404): No heartbeat from core client for 30 sec - exiting
06:07:27 (4404): No heartbeat from core client for 30 sec - exiting
06:07:28 (4404): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
06:07:29 (4404): No heartbeat from core client for 30 sec - exiting
06:07:30 (4404): No heartbeat from core client for 30 sec - exiting
MainError:	08:29:00 PM	No files match the supplied pattern.
MainError:	08:29:01 PM	No files match the supplied pattern.
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Error converting file to netcdf: dataout/o3spka.ph11c10
Error converting file to netcdf: dataout/o3spka.pg11c10
Error converting file to netcdf: dataout/o3spka.pe11c10
MainError:	11:05:17 AM	No files match the supplied pattern.
MainError:	11:05:17 AM	No files match the supplied pattern.
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16

Model crashed: STWORK  : I/O error - PP fixed length header                                                                                                                                                                                                                    tmp/pipe_dummy                                                                  2048    
BUFFIN: C I/O Error feof - Unit 64 - Return code = 16
BUFFIN: C I/O Error feof - Unit 66 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16

Model crashed: STWORK  : I/O error - PP fixed length header                                                                                                                                                                                                                    tmp/pipe_dummy                                                                  2048    
BUFFIN: C I/O Error feof - Unit 64 - Return code = 16
BUFFIN: C I/O Error feof - Unit 66 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16

Model crashed: STWORK  : I/O error - PP fixed length header                                                                                                                                                                                                                    tmp/pipe_dummy                                                                  2048    
BUFFIN: C I/O Error feof - Unit 64 - Return code = 16
BUFFIN: C I/O Error feof - Unit 66 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16

Model crashed: STWORK  : I/O error - PP fixed length header                                                                                                                                                                                                                    tmp/pipe_dummy                                                                  2048    
BUFFIN: C I/O Error feof - Unit 64 - Return code = 16
BUFFIN: C I/O Error feof - Unit 66 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16

Model crashed: STWORK  : I/O error - PP fixed length header                                                                                                                                                                                                                    tmp/pipe_dummy                                                                  2048    
BUFFIN: C I/O Error feof - Unit 64 - Return code = 16
BUFFIN: C I/O Error feof - Unit 66 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16

Model crashed: STWORK  : I/O error - PP fixed length header                                                                                                                                                                                                                    tmp/pipe_dummy                                                                  2048    
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
29 Mar 2013 11:06:47 546879 15655409 hadcm3n_o3sp_2140_40_008269406_2 777,600 1,201,677 1.5454
28 Mar 2013 20:29:31 546879 15655409 hadcm3n_o3sp_2140_40_008269406_2 751,680 1,160,637 1.5441
27 Mar 2013 22:44:56 546879 15655409 hadcm3n_o3sp_2140_40_008269406_2 725,760 1,118,372 1.5410
27 Mar 2013 02:09:47 546879 15655409 hadcm3n_o3sp_2140_40_008269406_2 699,840 1,076,946 1.5388
26 Mar 2013 03:44:13 546879 15655409 hadcm3n_o3sp_2140_40_008269406_2 673,920 1,035,353 1.5363
25 Mar 2013 01:55:33 546879 15655409 hadcm3n_o3sp_2140_40_008269406_2 648,000 992,503 1.5316
24 Mar 2013 06:16:23 546879 15655409 hadcm3n_o3sp_2140_40_008269406_2 622,080 950,522 1.5280
23 Mar 2013 17:21:11 546879 15655409 hadcm3n_o3sp_2140_40_008269406_2 596,160 906,967 1.5213
23 Mar 2013 05:01:51 546879 15655409 hadcm3n_o3sp_2140_40_008269406_2 570,240 864,826 1.5166
22 Mar 2013 17:48:20 546879 15655409 hadcm3n_o3sp_2140_40_008269406_2 544,320 825,180 1.5160
22 Mar 2013 06:53:21 546879 15655409 hadcm3n_o3sp_2140_40_008269406_2 518,400 786,254 1.5167
21 Mar 2013 13:20:22 546879 15655409 hadcm3n_o3sp_2140_40_008269406_2 492,480 747,329 1.5175
21 Mar 2013 02:17:32 546879 15655409 hadcm3n_o3sp_2140_40_008269406_2 466,560 707,978 1.5174
20 Mar 2013 08:01:54 546879 15655409 hadcm3n_o3sp_2140_40_008269406_2 440,640 669,026 1.5183
19 Mar 2013 13:30:19 546879 15655409 hadcm3n_o3sp_2140_40_008269406_2 414,720 629,237 1.5173
19 Mar 2013 02:32:33 546879 15655409 hadcm3n_o3sp_2140_40_008269406_2 388,800 590,142 1.5179
18 Mar 2013 09:03:47 546879 15655409 hadcm3n_o3sp_2140_40_008269406_2 362,880 550,651 1.5174
17 Mar 2013 21:29:49 546879 15655409 hadcm3n_o3sp_2140_40_008269406_2 336,960 511,752 1.5187
17 Mar 2013 02:57:42 546879 15655409 hadcm3n_o3sp_2140_40_008269406_2 311,040 472,249 1.5183
16 Mar 2013 14:40:31 546879 15655409 hadcm3n_o3sp_2140_40_008269406_2 285,120 432,710 1.5176


©2024 climateprediction.net