Name | hadcm3n_n05o_1880_40_008410411_2 |
Workunit | 8561267 |
Created | 5 Sep 2013, 18:31:18 UTC |
Sent | 5 Sep 2013, 18:48:11 UTC |
Report deadline | 6 Dec 2013, 2:15:22 UTC |
Received | 18 Dec 2013, 11:09:36 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 1168753 |
Run time | 25 days 5 hours 52 min 56 sec |
CPU time | 21 days 12 hours 39 min 47 sec |
Validate state | Invalid |
Credit | 12,441.60 |
Device peak FLOPS | 3.15 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 i686-apple-darwin |
Stderr | <core_client_version>7.0.65</core_client_version> <![CDATA[ <message> process exited with code 22 (0x16, -234) </message> <stderr_txt> Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 08:50:11 (14895): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... hadcm3n_6.07_i686-apple-darwin(70051,0xa0c2d540) malloc: *** error for object 0x681f604: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(70051,0xa0c2d540) malloc: *** error for object 0x701f604: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(70051,0xa0c2d540) malloc: *** error for object 0x703ac04: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(70051,0xa0c2d540) malloc: *** error for object 0x703ac00: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(70051,0xa0c2d540) malloc: *** error for object 0x2828a04: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(70051,0xa0c2d540) malloc: *** error for object 0x2828a04: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(70051,0xa0c2d540) malloc: *** error for object 0x2844004: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(70051,0xa0c2d540) malloc: *** error for object 0x2844000: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(70051,0xa0c2d540) malloc: *** error for object 0x501fa04: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(70051,0xa0c2d540) malloc: *** error for object 0x501fa00: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(70051,0xa0c2d540) malloc: *** error for object 0x483a604: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(70051,0xa0c2d540) malloc: *** error for object 0x601b604: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(70051,0xa0c2d540) malloc: *** error for object 0x601b600: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(70051,0xa0c2d540) malloc: *** error for object 0x701f604: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(70051,0xa0c2d540) malloc: *** error for object 0x701f600: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(70051,0xa0c2d540) malloc: *** error for object 0x2802204: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(70051,0xa0c2d540) malloc: *** error for object 0x2802200: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(70051,0xa0c2d540) malloc: *** error for object 0x2802204: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(70051,0xa0c2d540) malloc: *** error for object 0x2802200: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(70051,0xa0c2d540) malloc: *** error for object 0x2802204: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(70051,0xa0c2d540) malloc: *** error for object 0x2802200: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(70051,0xa0c2d540) malloc: *** error for object 0x2802204: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug SIGSEGV: segmentation violation hadcm3n_6.07_i686-apple-darwin(3094,0xa0c2d540) malloc: *** error for object 0x901b604: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(3094,0xa0c2d540) malloc: *** error for object 0x1028604: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(3094,0xa0c2d540) malloc: *** error for object 0x4020604: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(3094,0xa0c2d540) malloc: *** error for object 0x4020600: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(3094,0xa0c2d540) malloc: *** error for object 0x1001c04: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(3094,0xa0c2d540) malloc: *** error for object 0x1001c00: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(3094,0xa0c2d540) malloc: *** error for object 0x1001c04: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(3094,0xa0c2d540) malloc: *** error for object 0x1001c00: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(3094,0xa0c2d540) malloc: *** error for object 0x1001c04: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug SIGSEGV: segmentation violation Crashed executable name: hadcm3n_6.07_i686-apple-darwin built using BOINC library version 6.13.0 Machine type Intel 80486 (32-bit executable) hadcm3n_6.07_i686-apple-darwin(3094,0xa0c2d540) malloc: *** error for object 0x804e04: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug System version: Macintosh OS 10.6.8 build 10K549 Mon Oct 21 04:18:41 2013 Thread 0 Crashed: atos cannot load symbols for the file hadcm3n_6.07_i686-apple-darwin for architecture i386. 0 libSystem.B.dylib 0x9a300b03 small_free_list_remove_ptr + 234 1 libSystem.B.dylib 0x9a2fd5cc szone_free_definite_size + 3457 2 libSystem.B.dylib 0x9a2fc5e8 free + 244 3 hadcm3n_6.07_i686-apple-darwin 0x0000ba58 annual_cycle(std::vector<std::basic_string<char, std::char_traits<char>, std::allocator<char> >, std::allocator<std::basic_string<char, std::char_traits<char>, std::allocator<char> > > > const*, char const*, int, int) + 3482 4 hadcm3n_6.07_i686-apple-darwin 0x0000d36b decadalMeans(int, char const*) + 957 5 hadcm3n_6.07_i686-apple-darwin 0x000067ff doCM3Proc() + 185 6 hadcm3n_6.07_i686-apple-darwin 0x0000876a worker() + 2896 7 hadcm3n_6.07_i686-apple-darwin 0x00008aa9 main + 491 8 hadcm3n_6.07_i686-apple-darwin 0x00002676 start + 54 Thread 1: hadcm3n_6.07_i686-apple-darwin(90356,0xa0c2d540) malloc: *** error for object 0x1030604: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(90356,0xa0c2d540) malloc: *** error for object 0x701f604: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(90356,0xa0c2d540) malloc: *** error for object 0x701f600: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(90356,0xa0c2d540) malloc: *** error for object 0x4001c04: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(90356,0xa0c2d540) malloc: *** error for object 0x4001c00: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(90356,0xa0c2d540) malloc: *** error for object 0x4001c04: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(90356,0xa0c2d540) malloc: *** error for object 0x4001c00: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(90356,0xa0c2d540) malloc: *** error for object 0x3801c04: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(90356,0xa0c2d540) malloc: *** error for object 0x3801c00: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(90356,0xa0c2d540) malloc: *** error for object 0x4001c04: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(90356,0xa0c2d540) malloc: *** error for object 0x4001c00: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(90356,0xa0c2d540) malloc: *** error for object 0x4001c04: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(90356,0xa0c2d540) malloc: *** error for object 0x4001c00: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(90356,0xa0c2d540) malloc: *** error for object 0x4001c04: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(90356,0xa0c2d540) malloc: *** error for object 0x4001c00: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug cpdnmonitor: cannot open input file /Library/Application Support/BOINC Data/projects/climateprediction.net/hadcm3n_n05o_1880_40_008410411/dataout/atmos_restart.day after 11 attempts cpdnmonitor: cannot open input file /Library/Application Support/BOINC Data/projects/climateprediction.net/hadcm3n_n05o_1880_40_008410411/dataout/ocean_restart.day after 11 attempts Model crashed: READ_FLH: I/O error tmp/pipe_dummy 2048 cpdnmonitor: cannot open input file /Library/Application Support/BOINC Data/projects/climateprediction.net/hadcm3n_n05o_1880_40_008410411/dataout/atmos_restart.day after 11 attempts cpdnmonitor: cannot open input file /Library/Application Support/BOINC Data/projects/climateprediction.net/hadcm3n_n05o_1880_40_008410411/dataout/ocean_restart.day after 11 attempts Model crashed: READ_FLH: I/O error tmp/pipe_dummy 2048 cpdnmonitor: cannot open input file /Library/Application Support/BOINC Data/projects/climateprediction.net/hadcm3n_n05o_1880_40_008410411/dataout/atmos_restart.day after 11 attempts cpdnmonitor: cannot open input file /Library/Application Support/BOINC Data/projects/climateprediction.net/hadcm3n_n05o_1880_40_008410411/dataout/ocean_restart.day after 11 attempts Model crashed: READ_FLH: I/O error tmp/pipe_dummy 2048 cpdnmonitor: cannot open input file /Library/Application Support/BOINC Data/projects/climateprediction.net/hadcm3n_n05o_1880_40_008410411/dataout/atmos_restart.day after 11 attempts cpdnmonitor: cannot open input file /Library/Application Support/BOINC Data/projects/climateprediction.net/hadcm3n_n05o_1880_40_008410411/dataout/ocean_restart.day after 11 attempts Model crashed: READ_FLH: I/O error tmp/pipe_dummy 2048 cpdnmonitor: cannot open input file /Library/Application Support/BOINC Data/projects/climateprediction.net/hadcm3n_n05o_1880_40_008410411/dataout/atmos_restart.day after 11 attempts cpdnmonitor: cannot open input file /Library/Application Support/BOINC Data/projects/climateprediction.net/hadcm3n_n05o_1880_40_008410411/dataout/ocean_restart.day after 11 attempts Model crashed: READ_FLH: I/O error tmp/pipe_dummy 2048 cpdnmonitor: cannot open input file /Library/Application Support/BOINC Data/projects/climateprediction.net/hadcm3n_n05o_1880_40_008410411/dataout/atmos_restart.day after 11 attempts cpdnmonitor: cannot open input file /Library/Application Support/BOINC Data/projects/climateprediction.net/hadcm3n_n05o_1880_40_008410411/dataout/ocean_restart.day after 11 attempts Model crashed: READ_FLH: I/O error tmp/pipe_dummy 2048 Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
21 Oct 2013 03:31:56 | 1168753 | 16004805 | hadcm3n_n05o_1880_40_008410411_2 | 1,036,800 | 1,860,013 | 1.7940 |
20 Oct 2013 13:22:10 | 1168753 | 16004805 | hadcm3n_n05o_1880_40_008410411_2 | 1,010,880 | 1,812,994 | 1.7935 |
19 Oct 2013 13:52:58 | 1168753 | 16004805 | hadcm3n_n05o_1880_40_008410411_2 | 984,960 | 1,766,204 | 1.7932 |
18 Oct 2013 22:06:14 | 1168753 | 16004805 | hadcm3n_n05o_1880_40_008410411_2 | 959,040 | 1,721,345 | 1.7949 |
18 Oct 2013 05:44:31 | 1168753 | 16004805 | hadcm3n_n05o_1880_40_008410411_2 | 933,120 | 1,674,890 | 1.7949 |
17 Oct 2013 05:21:39 | 1168753 | 16004805 | hadcm3n_n05o_1880_40_008410411_2 | 907,200 | 1,628,127 | 1.7947 |
15 Oct 2013 23:14:45 | 1168753 | 16004805 | hadcm3n_n05o_1880_40_008410411_2 | 881,280 | 1,581,687 | 1.7948 |
15 Oct 2013 08:15:37 | 1168753 | 16004805 | hadcm3n_n05o_1880_40_008410411_2 | 855,360 | 1,535,794 | 1.7955 |
13 Oct 2013 23:02:07 | 1168753 | 16004805 | hadcm3n_n05o_1880_40_008410411_2 | 829,440 | 1,488,936 | 1.7951 |
13 Oct 2013 08:29:11 | 1168753 | 16004805 | hadcm3n_n05o_1880_40_008410411_2 | 803,520 | 1,442,775 | 1.7956 |
12 Oct 2013 17:22:18 | 1168753 | 16004805 | hadcm3n_n05o_1880_40_008410411_2 | 777,600 | 1,397,830 | 1.7976 |
12 Oct 2013 02:21:55 | 1168753 | 16004805 | hadcm3n_n05o_1880_40_008410411_2 | 751,680 | 1,350,790 | 1.7970 |
11 Oct 2013 10:00:01 | 1168753 | 16004805 | hadcm3n_n05o_1880_40_008410411_2 | 725,760 | 1,304,178 | 1.7970 |
10 Oct 2013 15:49:46 | 1168753 | 16004805 | hadcm3n_n05o_1880_40_008410411_2 | 699,840 | 1,257,261 | 1.7965 |
09 Oct 2013 22:14:41 | 1168753 | 16004805 | hadcm3n_n05o_1880_40_008410411_2 | 673,920 | 1,209,871 | 1.7953 |
09 Oct 2013 05:31:10 | 1168753 | 16004805 | hadcm3n_n05o_1880_40_008410411_2 | 648,000 | 1,162,478 | 1.7939 |
08 Oct 2013 15:12:05 | 1168753 | 16004805 | hadcm3n_n05o_1880_40_008410411_2 | 622,080 | 1,115,588 | 1.7933 |
08 Oct 2013 00:23:49 | 1168753 | 16004805 | hadcm3n_n05o_1880_40_008410411_2 | 596,160 | 1,069,616 | 1.7942 |
07 Oct 2013 09:35:38 | 1168753 | 16004805 | hadcm3n_n05o_1880_40_008410411_2 | 570,240 | 1,022,631 | 1.7933 |
06 Oct 2013 09:10:05 | 1168753 | 16004805 | hadcm3n_n05o_1880_40_008410411_2 | 544,320 | 976,059 | 1.7932 |
©2024 cpdn.org