climateprediction.net home page
Task 15502611

Task 15502611

Name hadcm3n_o1he_2140_40_008269730_0
Workunit 8424854
Created 23 Dec 2012, 23:57:10 UTC
Sent 24 Dec 2012, 7:51:03 UTC
Report deadline 25 Mar 2013, 15:18:14 UTC
Received 15 Jan 2013, 7:49:30 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1090056
Run time 19 days 0 hours 18 min 22 sec
CPU time 13 days 3 hours 22 min 54 sec
Validate state Invalid
Credit 9,331.20
Device peak FLOPS 2.35 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>7.0.28</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
21:05:41 (3164): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
MainError:	07:31:57 PM	No files match the supplied pattern.
MainError:	07:31:57 PM	No files match the supplied pattern.
MainError:	10:56:19 AM	No files match the supplied pattern.
MainError:	10:56:19 AM	No files match the supplied pattern.
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5496, iMonCtr=1
Model crash detected, will try to restart...
03:39:07 (2976): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
MainError:	02:31:05 AM	No files match the supplied pattern.
MainError:	02:31:05 AM	No files match the supplied pattern.
MainError:	05:56:48 PM	No files match the supplied pattern.
MainError:	05:56:48 PM	No files match the supplied pattern.
MainError:	09:13:54 AM	No files match the supplied pattern.
MainError:	09:13:54 AM	No files match the supplied pattern.
MainError:	12:35:39 AM	No files match the supplied pattern.
MainError:	12:35:39 AM	No files match the supplied pattern.
Suspended CPDN Monitor - Suspend request from BOINC...
MainError:	03:53:46 PM	No files match the supplied pattern.
MainError:	03:53:46 PM	No files match the supplied pattern.
MainError:	07:32:55 AM	No files match the supplied pattern.
MainError:	07:32:55 AM	No files match the supplied pattern.
MainError:	10:59:41 PM	No files match the supplied pattern.
MainError:	10:59:41 PM	No files match the supplied pattern.
Suspended CPDN Monitor - Suspend request from BOINC...
MainError:	03:28:11 PM	No files match the supplied pattern.
MainError:	03:28:11 PM	No files match the supplied pattern.
Error converting file to netcdf: dataout/o1heka.ph11c10
Error converting file to netcdf: dataout/o1heka.pg11c10
Error converting file to netcdf: dataout/o1heka.pe11c10
MainError:	06:33:28 AM	No files match the supplied pattern.
MainError:	06:33:28 AM	No files match the supplied pattern.
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16

Model crashed: STWORK  : I/O error - PP fixed length header                                                                                                                                                                                                                    tmp/pipe_dummy                                                                  2048    
BUFFIN: C I/O Error feof - Unit 64 - Return code = 16
BUFFIN: C I/O Error feof - Unit 66 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16

Model crashed: STWORK  : I/O error - PP fixed length header                                                                                                                                                                                                                    tmp/pipe_dummy                                                                  2048    
BUFFIN: C I/O Error feof - Unit 64 - Return code = 16
BUFFIN: C I/O Error feof - Unit 66 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16

Model crashed: STWORK  : I/O error - PP fixed length header                                                                                                                                                                                                                    tmp/pipe_dummy                                                                  2048    
BUFFIN: C I/O Error feof - Unit 64 - Return code = 16
BUFFIN: C I/O Error feof - Unit 66 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16

Model crashed: STWORK  : I/O error - PP fixed length header                                                                                                                                                                                                                    tmp/pipe_dummy                                                                  2048    
BUFFIN: C I/O Error feof - Unit 64 - Return code = 16
BUFFIN: C I/O Error feof - Unit 66 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16

Model crashed: STWORK  : I/O error - PP fixed length header                                                                                                                                                                                                                    tmp/pipe_dummy                                                                  2048    
BUFFIN: C I/O Error feof - Unit 64 - Return code = 16
BUFFIN: C I/O Error feof - Unit 66 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16

Model crashed: STWORK  : I/O error - PP fixed length header                                                                                                                                                                                                                    tmp/pipe_dummy                                                                  2048    
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
15 Jan 2013 06:38:01 1090056 15502611 hadcm3n_o1he_2140_40_008269730_0 777,600 1,593,371 2.0491
14 Jan 2013 15:30:57 1090056 15502611 hadcm3n_o1he_2140_40_008269730_0 751,680 1,539,746 2.0484
13 Jan 2013 23:05:06 1090056 15502611 hadcm3n_o1he_2140_40_008269730_0 725,760 1,486,735 2.0485
13 Jan 2013 07:34:15 1090056 15502611 hadcm3n_o1he_2140_40_008269730_0 699,840 1,433,164 2.0478
12 Jan 2013 15:55:45 1090056 15502611 hadcm3n_o1he_2140_40_008269730_0 673,920 1,379,976 2.0477
12 Jan 2013 00:37:47 1090056 15502611 hadcm3n_o1he_2140_40_008269730_0 648,000 1,327,089 2.0480
11 Jan 2013 09:19:22 1090056 15502611 hadcm3n_o1he_2140_40_008269730_0 622,080 1,274,015 2.0480
10 Jan 2013 18:01:21 1090056 15502611 hadcm3n_o1he_2140_40_008269730_0 596,160 1,221,016 2.0481
10 Jan 2013 02:32:36 1090056 15502611 hadcm3n_o1he_2140_40_008269730_0 570,240 1,168,066 2.0484
09 Jan 2013 11:00:35 1090056 15502611 hadcm3n_o1he_2140_40_008269730_0 544,320 1,114,946 2.0483
08 Jan 2013 19:39:25 1090056 15502611 hadcm3n_o1he_2140_40_008269730_0 518,400 1,061,337 2.0473
08 Jan 2013 04:51:14 1090056 15502611 hadcm3n_o1he_2140_40_008269730_0 492,480 1,008,284 2.0474
07 Jan 2013 13:05:38 1090056 15502611 hadcm3n_o1he_2140_40_008269730_0 466,560 954,441 2.0457
06 Jan 2013 21:27:54 1090056 15502611 hadcm3n_o1he_2140_40_008269730_0 440,640 901,112 2.0450
06 Jan 2013 06:20:38 1090056 15502611 hadcm3n_o1he_2140_40_008269730_0 414,720 847,016 2.0424
05 Jan 2013 15:15:48 1090056 15502611 hadcm3n_o1he_2140_40_008269730_0 388,800 792,973 2.0395
04 Jan 2013 23:37:23 1090056 15502611 hadcm3n_o1he_2140_40_008269730_0 362,880 738,559 2.0353
04 Jan 2013 08:25:17 1090056 15502611 hadcm3n_o1he_2140_40_008269730_0 336,960 684,367 2.0310
03 Jan 2013 17:16:22 1090056 15502611 hadcm3n_o1he_2140_40_008269730_0 311,040 630,192 2.0261
03 Jan 2013 02:04:26 1090056 15502611 hadcm3n_o1he_2140_40_008269730_0 285,120 575,988 2.0202


©2024 cpdn.org