climateprediction.net home page
Task 15752591

Task 15752591

Name hadcm3n_o3h8_2140_40_008269745_4
Workunit 8424869
Created 26 Apr 2013, 21:30:21 UTC
Sent 26 Apr 2013, 21:30:24 UTC
Report deadline 27 Jul 2013, 4:57:35 UTC
Received 19 May 2013, 12:40:47 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1209810
Run time 14 days 6 hours 50 min 53 sec
CPU time 12 days 2 hours 27 min 38 sec
Validate state Invalid
Credit 9,331.20
Device peak FLOPS 2.90 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>7.0.64</core_client_version>
<![CDATA[
<message>
Das Gerät erkennt den Befehl nicht.
 (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2152, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Atmos Hold Restart file rename failed on atmos_restart.hold
Atmos Hold Restart file rename failed on atmos_restart.hold
Atmos Hold Restart file rename failed on atmos_restart.hold
15:56:45 (3404): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5964, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5964, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5964, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
MainError:	05:21:05 AM	No files match the supplied pattern.
MainError:	05:21:05 AM	No files match the supplied pattern.
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
MainError:	12:37:18 AM	No files match the supplied pattern.
MainError:	12:37:18 AM	No files match the supplied pattern.
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
MainError:	10:00:38 PM	No files match the supplied pattern.
MainError:	10:00:38 PM	No files match the supplied pattern.
MainError:	09:32:02 AM	No files match the supplied pattern.
MainError:	09:32:02 AM	No files match the supplied pattern.
MainError:	10:00:11 PM	No files match the supplied pattern.
MainError:	10:00:11 PM	No files match the supplied pattern.
CPDN Monitor - Quit request from BOINC...
MainError:	02:24:58 PM	No files match the supplied pattern.
MainError:	02:24:58 PM	No files match the supplied pattern.
Suspended CPDN Monitor - Suspend request from BOINC...
MainError:	02:38:53 AM	No files match the supplied pattern.
MainError:	02:38:53 AM	No files match the supplied pattern.
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3988, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3988, iMonCtr=1
Model crash detected, will try to restart...
MainError:	02:00:23 AM	No files match the supplied pattern.
MainError:	02:00:23 AM	No files match the supplied pattern.
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
MainError:	01:42:22 PM	No files match the supplied pattern.
MainError:	01:42:22 PM	No files match the supplied pattern.
Suspended CPDN Monitor - Suspend request from BOINC...
MainError:	12:39:56 AM	No files match the supplied pattern.
MainError:	12:39:56 AM	No files match the supplied pattern.
Error converting file to netcdf: dataout/o3h8ka.ph11c10
Error converting file to netcdf: dataout/o3h8ka.pg11c10
Error converting file to netcdf: dataout/o3h8ka.pe11c10
MainError:	11:39:33 AM	No files match the supplied pattern.
MainError:	11:39:33 AM	No files match the supplied pattern.
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16

Model crashed: STWORK  : I/O error - PP fixed length header                                                                                                                                                                                                                    tmp/pipe_dummy                                                                  2048    
BUFFIN: C I/O Error feof - Unit 64 - Return code = 16
BUFFIN: C I/O Error feof - Unit 66 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16

Model crashed: STWORK  : I/O error - PP fixed length header                                                                                                                                                                                                                    tmp/pipe_dummy                                                                  2048    
BUFFIN: C I/O Error feof - Unit 64 - Return code = 16
BUFFIN: C I/O Error feof - Unit 66 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16

Model crashed: STWORK  : I/O error - PP fixed length header                                                                                                                                                                                                                    tmp/pipe_dummy                                                                  2048    
BUFFIN: C I/O Error feof - Unit 64 - Return code = 16
BUFFIN: C I/O Error feof - Unit 66 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16

Model crashed: STWORK  : I/O error - PP fixed length header                                                                                                                                                                                                                    tmp/pipe_dummy                                                                  2048    
BUFFIN: C I/O Error feof - Unit 64 - Return code = 16
BUFFIN: C I/O Error feof - Unit 66 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16

Model crashed: STWORK  : I/O error - PP fixed length header                                                                                                                                                                                                                    tmp/pipe_dummy                                                                  2048    
BUFFIN: C I/O Error feof - Unit 64 - Return code = 16
BUFFIN: C I/O Error feof - Unit 66 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16

Model crashed: STWORK  : I/O error - PP fixed length header                                                                                                                                                                                                                    tmp/pipe_dummy                                                                  2048    
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
19 May 2013 11:43:11 1209810 15752591 hadcm3n_o3h8_2140_40_008269745_4 777,600 1,136,107 1.4610
19 May 2013 00:44:16 1209810 15752591 hadcm3n_o3h8_2140_40_008269745_4 751,680 1,100,192 1.4636
18 May 2013 13:45:32 1209810 15752591 hadcm3n_o3h8_2140_40_008269745_4 725,760 1,063,497 1.4654
15 May 2013 02:03:12 1209810 15752591 hadcm3n_o3h8_2140_40_008269745_4 699,840 1,028,040 1.4690
14 May 2013 02:41:23 1209810 15752591 hadcm3n_o3h8_2140_40_008269745_4 673,920 989,688 1.4686
13 May 2013 14:27:10 1209810 15752591 hadcm3n_o3h8_2140_40_008269745_4 648,000 950,345 1.4666
12 May 2013 22:03:27 1209810 15752591 hadcm3n_o3h8_2140_40_008269745_4 622,080 916,585 1.4734
12 May 2013 09:36:20 1209810 15752591 hadcm3n_o3h8_2140_40_008269745_4 596,160 875,855 1.4692
11 May 2013 22:02:04 1209810 15752591 hadcm3n_o3h8_2140_40_008269745_4 570,240 839,398 1.4720
11 May 2013 00:38:55 1209810 15752591 hadcm3n_o3h8_2140_40_008269745_4 544,320 800,269 1.4702
10 May 2013 05:25:12 1209810 15752591 hadcm3n_o3h8_2140_40_008269745_4 518,400 760,392 1.4668
09 May 2013 18:43:50 1209810 15752591 hadcm3n_o3h8_2140_40_008269745_4 492,480 724,677 1.4715
08 May 2013 20:56:59 1209810 15752591 hadcm3n_o3h8_2140_40_008269745_4 466,560 686,661 1.4718
08 May 2013 00:06:37 1209810 15752591 hadcm3n_o3h8_2140_40_008269745_4 440,640 644,745 1.4632
06 May 2013 22:28:41 1209810 15752591 hadcm3n_o3h8_2140_40_008269745_4 414,720 599,802 1.4463
05 May 2013 23:10:42 1209810 15752591 hadcm3n_o3h8_2140_40_008269745_4 388,800 558,744 1.4371
05 May 2013 10:10:15 1209810 15752591 hadcm3n_o3h8_2140_40_008269745_4 362,880 514,975 1.4191
04 May 2013 10:06:24 1209810 15752591 hadcm3n_o3h8_2140_40_008269745_4 336,960 475,715 1.4118
03 May 2013 21:53:07 1209810 15752591 hadcm3n_o3h8_2140_40_008269745_4 311,040 435,729 1.4009
03 May 2013 08:13:25 1209810 15752591 hadcm3n_o3h8_2140_40_008269745_4 285,120 393,965 1.3818


©2024 climateprediction.net