climateprediction.net home page
Task 15554388

Task 15554388

Name hadcm3n_o39x_2140_40_008269891_1
Workunit 8425015
Created 22 Jan 2013, 3:25:28 UTC
Sent 22 Jan 2013, 3:25:30 UTC
Report deadline 23 Apr 2013, 10:52:41 UTC
Received 15 Feb 2013, 17:41:04 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1158503
Run time 16 days 13 hours 14 min 11 sec
CPU time 14 days 9 hours 40 min 4 sec
Validate state Invalid
Credit 9,331.20
Device peak FLOPS 2.63 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>7.0.28</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
11:43:38 (4236): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
11:47:20 (32844): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
11:47:21 (32844): No heartbeat from core client for 30 sec - exiting
11:47:22 (32844): No heartbeat from core client for 30 sec - exiting
11:47:23 (32844): No heartbeat from core client for 30 sec - exiting
11:47:24 (32844): No heartbeat from core client for 30 sec - exiting
11:47:25 (32844): No heartbeat from core client for 30 sec - exiting
11:47:26 (32844): No heartbeat from core client for 30 sec - exiting
11:47:27 (32844): No heartbeat from core client for 30 sec - exiting
11:47:28 (32844): No heartbeat from core client for 30 sec - exiting
11:47:29 (32844): No heartbeat from core client for 30 sec - exiting
11:47:30 (32844): No heartbeat from core client for 30 sec - exiting
14:01:07 (12672): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
14:01:08 (12672): No heartbeat from core client for 30 sec - exiting
14:01:09 (12672): No heartbeat from core client for 30 sec - exiting
14:01:10 (12672): No heartbeat from core client for 30 sec - exiting
14:01:11 (12672): No heartbeat from core client for 30 sec - exiting
14:01:12 (12672): No heartbeat from core client for 30 sec - exiting
14:01:13 (12672): No heartbeat from core client for 30 sec - exiting
14:01:14 (12672): No heartbeat from core client for 30 sec - exiting
14:01:15 (12672): No heartbeat from core client for 30 sec - exiting
14:01:16 (12672): No heartbeat from core client for 30 sec - exiting
14:01:17 (12672): No heartbeat from core client for 30 sec - exiting
14:53:55 (12960): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
MainError:	04:16:32 AM	No files match the supplied pattern.
MainError:	04:16:32 AM	No files match the supplied pattern.
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
forrtl: The requested operation cannot be performed on a file with a user-mapped section open.

Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4436, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
MainError:	01:49:56 AM	No files match the supplied pattern.
MainError:	01:49:56 AM	No files match the supplied pattern.
CPDN Monitor - Quit request from BOINC...
forrtl: The requested operation cannot be performed on a file with a user-mapped section open.

Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4332, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
MainError:	01:41:09 AM	No files match the supplied pattern.
MainError:	01:41:09 AM	No files match the supplied pattern.
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
MainError:	11:48:58 PM	No files match the supplied pattern.
MainError:	11:48:58 PM	No files match the supplied pattern.
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
MainError:	10:44:39 PM	No files match the supplied pattern.
MainError:	10:44:39 PM	No files match the supplied pattern.
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
MainError:	10:00:26 PM	No files match the supplied pattern.
MainError:	10:00:26 PM	No files match the supplied pattern.
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
MainError:	08:12:44 PM	No files match the supplied pattern.
MainError:	08:12:45 PM	No files match the supplied pattern.
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
MainError:	06:03:02 PM	No files match the supplied pattern.
MainError:	06:03:02 PM	No files match the supplied pattern.
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
MainError:	07:24:33 PM	No files match the supplied pattern.
MainError:	07:24:33 PM	No files match the supplied pattern.
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
MainError:	06:56:32 PM	No files match the supplied pattern.
MainError:	06:56:32 PM	No files match the supplied pattern.
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Error converting file to netcdf: dataout/o39xka.ph11c10
Error converting file to netcdf: dataout/o39xka.pg11c10
Error converting file to netcdf: dataout/o39xka.pe11c10
MainError:	04:41:12 PM	No files match the supplied pattern.
MainError:	04:41:12 PM	No files match the supplied pattern.
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16

Model crashed: STWORK  : I/O error - PP fixed length header                                                                                                                                                                                                                    tmp/pipe_dummy                                                                  2048    
BUFFIN: C I/O Error feof - Unit 64 - Return code = 16
BUFFIN: C I/O Error feof - Unit 66 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16

Model crashed: STWORK  : I/O error - PP fixed length header                                                                                                                                                                                                                    tmp/pipe_dummy                                                                  2048    
BUFFIN: C I/O Error feof - Unit 64 - Return code = 16
BUFFIN: C I/O Error feof - Unit 66 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16

Model crashed: STWORK  : I/O error - PP fixed length header                                                                                                                                                                                                                    tmp/pipe_dummy                                                                  2048    
BUFFIN: C I/O Error feof - Unit 64 - Return code = 16
BUFFIN: C I/O Error feof - Unit 66 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16

Model crashed: STWORK  : I/O error - PP fixed length header                                                                                                                                                                                                                    tmp/pipe_dummy                                                                  2048    
BUFFIN: C I/O Error feof - Unit 64 - Return code = 16
BUFFIN: C I/O Error feof - Unit 66 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16

Model crashed: STWORK  : I/O error - PP fixed length header                                                                                                                                                                                                                    tmp/pipe_dummy                                                                  2048    
BUFFIN: C I/O Error feof - Unit 64 - Return code = 16
BUFFIN: C I/O Error feof - Unit 66 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16

Model crashed: STWORK  : I/O error - PP fixed length header                                                                                                                                                                                                                    tmp/pipe_dummy                                                                  2048    
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
14 Feb 2013 16:45:52 1158503 15554388 hadcm3n_o39x_2140_40_008269891_1 777,600 1,245,752 1.6020
13 Feb 2013 19:01:59 1158503 15554388 hadcm3n_o39x_2140_40_008269891_1 751,680 1,202,273 1.5994
12 Feb 2013 19:27:06 1158503 15554388 hadcm3n_o39x_2140_40_008269891_1 725,760 1,159,856 1.5981
11 Feb 2013 18:06:42 1158503 15554388 hadcm3n_o39x_2140_40_008269891_1 699,840 1,116,797 1.5958
09 Feb 2013 20:16:37 1158503 15554388 hadcm3n_o39x_2140_40_008269891_1 673,920 1,073,903 1.5935
08 Feb 2013 22:03:44 1158503 15554388 hadcm3n_o39x_2140_40_008269891_1 648,000 1,032,455 1.5933
07 Feb 2013 22:47:57 1158503 15554388 hadcm3n_o39x_2140_40_008269891_1 622,080 990,638 1.5925
06 Feb 2013 23:49:55 1158503 15554388 hadcm3n_o39x_2140_40_008269891_1 596,160 948,387 1.5908
06 Feb 2013 01:41:35 1158503 15554388 hadcm3n_o39x_2140_40_008269891_1 570,240 904,547 1.5863
05 Feb 2013 01:50:27 1158503 15554388 hadcm3n_o39x_2140_40_008269891_1 544,320 860,833 1.5815
03 Feb 2013 04:21:39 1158503 15554388 hadcm3n_o39x_2140_40_008269891_1 518,400 817,917 1.5778
02 Feb 2013 02:39:15 1158503 15554388 hadcm3n_o39x_2140_40_008269891_1 492,480 775,403 1.5745
01 Feb 2013 05:16:06 1158503 15554388 hadcm3n_o39x_2140_40_008269891_1 466,560 734,552 1.5744
31 Jan 2013 15:35:42 1158503 15554388 hadcm3n_o39x_2140_40_008269891_1 440,640 693,733 1.5744
30 Jan 2013 18:06:05 1158503 15554388 hadcm3n_o39x_2140_40_008269891_1 414,720 652,522 1.5734
30 Jan 2013 05:02:40 1158503 15554388 hadcm3n_o39x_2140_40_008269891_1 388,800 611,367 1.5724
29 Jan 2013 16:20:43 1158503 15554388 hadcm3n_o39x_2140_40_008269891_1 362,880 570,408 1.5719
29 Jan 2013 03:18:40 1158503 15554388 hadcm3n_o39x_2140_40_008269891_1 336,960 529,199 1.5705
28 Jan 2013 13:45:18 1158503 15554388 hadcm3n_o39x_2140_40_008269891_1 311,040 487,677 1.5679
28 Jan 2013 00:38:09 1158503 15554388 hadcm3n_o39x_2140_40_008269891_1 285,120 446,311 1.5653


©2024 cpdn.org