climateprediction.net home page
Task 15278143

Task 15278143

Name hadcm3n_zaj7_1880_40_008199956_1
Workunit 8355080
Created 13 Sep 2012, 3:56:42 UTC
Sent 13 Sep 2012, 4:10:18 UTC
Report deadline 13 Dec 2012, 11:37:29 UTC
Received 5 Oct 2012, 5:42:58 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1229791
Run time 20 days 10 hours 44 min 17 sec
CPU time 20 days 3 hours 12 min 59 sec
Validate state Invalid
Credit 12,441.60
Device peak FLOPS 1.44 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
i686-pc-linux-gnu
Stderr
<core_client_version>7.0.27</core_client_version>
<![CDATA[
<message>
process exited with code 22 (0x16, -234)
</message>
<stderr_txt>
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
21:44:59 (1907): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
21:45:05 (1907): No heartbeat from core client for 30 sec - exiting
21:45:06 (1907): No heartbeat from core client for 30 sec - exiting
21:45:07 (1907): No heartbeat from core client for 30 sec - exiting
21:45:08 (1907): No heartbeat from core client for 30 sec - exiting
21:45:09 (1907): No heartbeat from core client for 30 sec - exiting
21:45:10 (1907): No heartbeat from core client for 30 sec - exiting
21:45:13 (1907): No heartbeat from core client for 30 sec - exiting
21:45:14 (1907): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
20:57:23 (1896): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
00:41:22 (18492): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
00:43:10 (28651): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
00:46:03 (29075): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
00:46:44 (32741): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
00:48:17 (32762): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
00:50:32 (311): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
00:53:12 (326): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
00:55:50 (348): No heartbeat from core client for 30 sec - exiting
00:55:51 (348): No heartbeat from core client for 30 sec - exiting
00:55:52 (348): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 60 - Return code = 1

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 61 - Return code = 1

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 62 - Return code = 1

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 63 - Return code = 1

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 64 - Return code = 1

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 65 - Return code = 1

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 66 - Return code = 1

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 67 - Return code = 1

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 68 - Return code = 1

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 69 - Return code = 1
Error: Input file: dataout/zaj7ko.pjc0c10 is not a valid UM file.
Error converting file to netcdf: dataout/zaj7ko.pjc0c10
Error: Input file: dataout/zaj7ko.pic0c10 is not a valid UM file.
Error converting file to netcdf: dataout/zaj7ko.pic0c10
Error: Input file: dataout/zaj7ko.pfc0c10 is not a valid UM file.
Error converting file to netcdf: dataout/zaj7ko.pfc0c10
Error: Input file: dataout/zaj7ko.pcc0c10 is not a valid UM file.
Error converting file to netcdf: dataout/zaj7ko.pcc0c10
Error: Input file: dataout/zaj7ko.pbc0c10 is not a valid UM file.
Error converting file to netcdf: dataout/zaj7ko.pbc0c10
Error: Input file: dataout/zaj7ko.pac0c10 is not a valid UM file.
Error converting file to netcdf: dataout/zaj7ko.pac0c10
Error: Input file: dataout/zaj7ka.phc0c10 is not a valid UM file.
Error converting file to netcdf: dataout/zaj7ka.phc0c10
Error: Input file: dataout/zaj7ka.pgc0c10 is not a valid UM file.
Error converting file to netcdf: dataout/zaj7ka.pgc0c10
Error: Input file: dataout/zaj7ka.pec0c10 is not a valid UM file.
Error converting file to netcdf: dataout/zaj7ka.pec0c10
Error: Input file: dataout/zaj7ka.pdc0c10 is not a valid UM file.
Error converting file to netcdf: dataout/zaj7ka.pdc0c10
00:57:56 (366): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
cpdnmonitor: cannot open input file /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_zaj7_1880_40_008199956/dataout/atmos_restart.day after 11 attempts
cpdnmonitor: cannot open input file /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_zaj7_1880_40_008199956/dataout/ocean_restart.day after 11 attempts

Model crashed: READ_FLH: I/O error                                                                                                                                                                                                                                             tmp/pipe_dummy                                                                  2048    
cpdnmonitor: cannot open input file /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_zaj7_1880_40_008199956/dataout/atmos_restart.day after 11 attempts
cpdnmonitor: cannot open input file /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_zaj7_1880_40_008199956/dataout/ocean_restart.day after 11 attempts

Model crashed: READ_FLH: I/O error                                                                                                                                                                                                                                             tmp/pipe_dummy                                                                  2048    
cpdnmonitor: cannot open input file /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_zaj7_1880_40_008199956/dataout/atmos_restart.day after 11 attempts
cpdnmonitor: cannot open input file /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_zaj7_1880_40_008199956/dataout/ocean_restart.day after 11 attempts

Model crashed: READ_FLH: I/O error                                                                                                                                                                                                                                             tmp/pipe_dummy                                                                  2048    
cpdnmonitor: cannot open input file /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_zaj7_1880_40_008199956/dataout/atmos_restart.day after 11 attempts
cpdnmonitor: cannot open input file /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_zaj7_1880_40_008199956/dataout/ocean_restart.day after 11 attempts

Model crashed: READ_FLH: I/O error                                                                                                                                                                                                                                             tmp/pipe_dummy                                                                  2048    
cpdnmonitor: cannot open input file /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_zaj7_1880_40_008199956/dataout/atmos_restart.day after 11 attempts
cpdnmonitor: cannot open input file /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_zaj7_1880_40_008199956/dataout/ocean_restart.day after 11 attempts

Model crashed: READ_FLH: I/O error                                                                                                                                                                                                                                             tmp/pipe_dummy                                                                  2048    
cpdnmonitor: cannot open input file /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_zaj7_1880_40_008199956/dataout/atmos_restart.day after 11 attempts
cpdnmonitor: cannot open input file /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_zaj7_1880_40_008199956/dataout/ocean_restart.day after 11 attempts

Model crashed: READ_FLH: I/O error                                                                                                                                                                                                                                             tmp/pipe_dummy                                                                  2048    
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
05 Oct 2012 05:46:51 1229791 15278143 hadcm3n_zaj7_1880_40_008199956_1 1,036,800 1,739,612 1.6779
04 Oct 2012 14:29:25 1229791 15278143 hadcm3n_zaj7_1880_40_008199956_1 1,010,880 1,694,001 1.6758
04 Oct 2012 02:01:38 1229791 15278143 hadcm3n_zaj7_1880_40_008199956_1 984,960 1,648,308 1.6735
03 Oct 2012 12:27:12 1229791 15278143 hadcm3n_zaj7_1880_40_008199956_1 959,040 1,606,604 1.6752
03 Oct 2012 00:49:53 1229791 15278143 hadcm3n_zaj7_1880_40_008199956_1 933,120 1,565,823 1.6781
02 Oct 2012 13:38:06 1229791 15278143 hadcm3n_zaj7_1880_40_008199956_1 907,200 1,525,626 1.6817
01 Oct 2012 23:49:39 1229791 15278143 hadcm3n_zaj7_1880_40_008199956_1 881,280 1,484,671 1.6847
01 Oct 2012 12:06:05 1229791 15278143 hadcm3n_zaj7_1880_40_008199956_1 855,360 1,442,750 1.6867
30 Sep 2012 23:58:31 1229791 15278143 hadcm3n_zaj7_1880_40_008199956_1 829,440 1,400,819 1.6889
30 Sep 2012 12:22:14 1229791 15278143 hadcm3n_zaj7_1880_40_008199956_1 803,520 1,357,319 1.6892
29 Sep 2012 23:24:37 1229791 15278143 hadcm3n_zaj7_1880_40_008199956_1 777,600 1,313,310 1.6889
29 Sep 2012 10:51:15 1229791 15278143 hadcm3n_zaj7_1880_40_008199956_1 751,680 1,268,602 1.6877
28 Sep 2012 22:38:36 1229791 15278143 hadcm3n_zaj7_1880_40_008199956_1 725,760 1,224,418 1.6871
28 Sep 2012 09:26:28 1229791 15278143 hadcm3n_zaj7_1880_40_008199956_1 699,840 1,179,849 1.6859
27 Sep 2012 20:12:24 1229791 15278143 hadcm3n_zaj7_1880_40_008199956_1 673,920 1,135,993 1.6856
27 Sep 2012 07:13:47 1229791 15278143 hadcm3n_zaj7_1880_40_008199956_1 648,000 1,091,885 1.6850
26 Sep 2012 19:44:04 1229791 15278143 hadcm3n_zaj7_1880_40_008199956_1 622,080 1,048,878 1.6861
26 Sep 2012 06:32:09 1229791 15278143 hadcm3n_zaj7_1880_40_008199956_1 596,160 1,004,411 1.6848
25 Sep 2012 17:55:32 1229791 15278143 hadcm3n_zaj7_1880_40_008199956_1 570,240 959,541 1.6827
25 Sep 2012 05:22:15 1229791 15278143 hadcm3n_zaj7_1880_40_008199956_1 544,320 915,466 1.6819


©2024 climateprediction.net