|
Name | hadcm3n_zaj7_1880_40_008199956_1 |
Workunit | 8355080 |
Created | 13 Sep 2012, 3:56:42 UTC |
Sent | 13 Sep 2012, 4:10:18 UTC |
Report deadline | 13 Dec 2012, 11:37:29 UTC |
Received | 5 Oct 2012, 5:42:58 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 1229791 |
Run time | 20 days 10 hours 44 min 17 sec |
CPU time | 20 days 3 hours 12 min 59 sec |
Validate state | Invalid |
Credit | 12,441.60 |
Device peak FLOPS | 1.44 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 i686-pc-linux-gnu |
Stderr | <core_client_version>7.0.27</core_client_version> <![CDATA[ <message> process exited with code 22 (0x16, -234) </message> <stderr_txt> Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 21:44:59 (1907): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:45:05 (1907): No heartbeat from core client for 30 sec - exiting 21:45:06 (1907): No heartbeat from core client for 30 sec - exiting 21:45:07 (1907): No heartbeat from core client for 30 sec - exiting 21:45:08 (1907): No heartbeat from core client for 30 sec - exiting 21:45:09 (1907): No heartbeat from core client for 30 sec - exiting 21:45:10 (1907): No heartbeat from core client for 30 sec - exiting 21:45:13 (1907): No heartbeat from core client for 30 sec - exiting 21:45:14 (1907): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 20:57:23 (1896): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 00:41:22 (18492): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 00:43:10 (28651): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 00:46:03 (29075): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 00:46:44 (32741): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 00:48:17 (32762): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 00:50:32 (311): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 00:53:12 (326): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 00:55:50 (348): No heartbeat from core client for 30 sec - exiting 00:55:51 (348): No heartbeat from core client for 30 sec - exiting 00:55:52 (348): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 60 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 61 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 62 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 63 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 64 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 65 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 66 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 67 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 68 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 69 - Return code = 1 Error: Input file: dataout/zaj7ko.pjc0c10 is not a valid UM file. Error converting file to netcdf: dataout/zaj7ko.pjc0c10 Error: Input file: dataout/zaj7ko.pic0c10 is not a valid UM file. Error converting file to netcdf: dataout/zaj7ko.pic0c10 Error: Input file: dataout/zaj7ko.pfc0c10 is not a valid UM file. Error converting file to netcdf: dataout/zaj7ko.pfc0c10 Error: Input file: dataout/zaj7ko.pcc0c10 is not a valid UM file. Error converting file to netcdf: dataout/zaj7ko.pcc0c10 Error: Input file: dataout/zaj7ko.pbc0c10 is not a valid UM file. Error converting file to netcdf: dataout/zaj7ko.pbc0c10 Error: Input file: dataout/zaj7ko.pac0c10 is not a valid UM file. Error converting file to netcdf: dataout/zaj7ko.pac0c10 Error: Input file: dataout/zaj7ka.phc0c10 is not a valid UM file. Error converting file to netcdf: dataout/zaj7ka.phc0c10 Error: Input file: dataout/zaj7ka.pgc0c10 is not a valid UM file. Error converting file to netcdf: dataout/zaj7ka.pgc0c10 Error: Input file: dataout/zaj7ka.pec0c10 is not a valid UM file. Error converting file to netcdf: dataout/zaj7ka.pec0c10 Error: Input file: dataout/zaj7ka.pdc0c10 is not a valid UM file. Error converting file to netcdf: dataout/zaj7ka.pdc0c10 00:57:56 (366): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... cpdnmonitor: cannot open input file /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_zaj7_1880_40_008199956/dataout/atmos_restart.day after 11 attempts cpdnmonitor: cannot open input file /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_zaj7_1880_40_008199956/dataout/ocean_restart.day after 11 attempts Model crashed: READ_FLH: I/O error tmp/pipe_dummy 2048 cpdnmonitor: cannot open input file /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_zaj7_1880_40_008199956/dataout/atmos_restart.day after 11 attempts cpdnmonitor: cannot open input file /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_zaj7_1880_40_008199956/dataout/ocean_restart.day after 11 attempts Model crashed: READ_FLH: I/O error tmp/pipe_dummy 2048 cpdnmonitor: cannot open input file /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_zaj7_1880_40_008199956/dataout/atmos_restart.day after 11 attempts cpdnmonitor: cannot open input file /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_zaj7_1880_40_008199956/dataout/ocean_restart.day after 11 attempts Model crashed: READ_FLH: I/O error tmp/pipe_dummy 2048 cpdnmonitor: cannot open input file /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_zaj7_1880_40_008199956/dataout/atmos_restart.day after 11 attempts cpdnmonitor: cannot open input file /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_zaj7_1880_40_008199956/dataout/ocean_restart.day after 11 attempts Model crashed: READ_FLH: I/O error tmp/pipe_dummy 2048 cpdnmonitor: cannot open input file /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_zaj7_1880_40_008199956/dataout/atmos_restart.day after 11 attempts cpdnmonitor: cannot open input file /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_zaj7_1880_40_008199956/dataout/ocean_restart.day after 11 attempts Model crashed: READ_FLH: I/O error tmp/pipe_dummy 2048 cpdnmonitor: cannot open input file /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_zaj7_1880_40_008199956/dataout/atmos_restart.day after 11 attempts cpdnmonitor: cannot open input file /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_zaj7_1880_40_008199956/dataout/ocean_restart.day after 11 attempts Model crashed: READ_FLH: I/O error tmp/pipe_dummy 2048 Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
05 Oct 2012 05:46:51 | 1229791 | 15278143 | hadcm3n_zaj7_1880_40_008199956_1 | 1,036,800 | 1,739,612 | 1.6779 |
04 Oct 2012 14:29:25 | 1229791 | 15278143 | hadcm3n_zaj7_1880_40_008199956_1 | 1,010,880 | 1,694,001 | 1.6758 |
04 Oct 2012 02:01:38 | 1229791 | 15278143 | hadcm3n_zaj7_1880_40_008199956_1 | 984,960 | 1,648,308 | 1.6735 |
03 Oct 2012 12:27:12 | 1229791 | 15278143 | hadcm3n_zaj7_1880_40_008199956_1 | 959,040 | 1,606,604 | 1.6752 |
03 Oct 2012 00:49:53 | 1229791 | 15278143 | hadcm3n_zaj7_1880_40_008199956_1 | 933,120 | 1,565,823 | 1.6781 |
02 Oct 2012 13:38:06 | 1229791 | 15278143 | hadcm3n_zaj7_1880_40_008199956_1 | 907,200 | 1,525,626 | 1.6817 |
01 Oct 2012 23:49:39 | 1229791 | 15278143 | hadcm3n_zaj7_1880_40_008199956_1 | 881,280 | 1,484,671 | 1.6847 |
01 Oct 2012 12:06:05 | 1229791 | 15278143 | hadcm3n_zaj7_1880_40_008199956_1 | 855,360 | 1,442,750 | 1.6867 |
30 Sep 2012 23:58:31 | 1229791 | 15278143 | hadcm3n_zaj7_1880_40_008199956_1 | 829,440 | 1,400,819 | 1.6889 |
30 Sep 2012 12:22:14 | 1229791 | 15278143 | hadcm3n_zaj7_1880_40_008199956_1 | 803,520 | 1,357,319 | 1.6892 |
29 Sep 2012 23:24:37 | 1229791 | 15278143 | hadcm3n_zaj7_1880_40_008199956_1 | 777,600 | 1,313,310 | 1.6889 |
29 Sep 2012 10:51:15 | 1229791 | 15278143 | hadcm3n_zaj7_1880_40_008199956_1 | 751,680 | 1,268,602 | 1.6877 |
28 Sep 2012 22:38:36 | 1229791 | 15278143 | hadcm3n_zaj7_1880_40_008199956_1 | 725,760 | 1,224,418 | 1.6871 |
28 Sep 2012 09:26:28 | 1229791 | 15278143 | hadcm3n_zaj7_1880_40_008199956_1 | 699,840 | 1,179,849 | 1.6859 |
27 Sep 2012 20:12:24 | 1229791 | 15278143 | hadcm3n_zaj7_1880_40_008199956_1 | 673,920 | 1,135,993 | 1.6856 |
27 Sep 2012 07:13:47 | 1229791 | 15278143 | hadcm3n_zaj7_1880_40_008199956_1 | 648,000 | 1,091,885 | 1.6850 |
26 Sep 2012 19:44:04 | 1229791 | 15278143 | hadcm3n_zaj7_1880_40_008199956_1 | 622,080 | 1,048,878 | 1.6861 |
26 Sep 2012 06:32:09 | 1229791 | 15278143 | hadcm3n_zaj7_1880_40_008199956_1 | 596,160 | 1,004,411 | 1.6848 |
25 Sep 2012 17:55:32 | 1229791 | 15278143 | hadcm3n_zaj7_1880_40_008199956_1 | 570,240 | 959,541 | 1.6827 |
25 Sep 2012 05:22:15 | 1229791 | 15278143 | hadcm3n_zaj7_1880_40_008199956_1 | 544,320 | 915,466 | 1.6819 |
©2024 climateprediction.net