climateprediction.net home page
Task 16004805

Task 16004805

Name hadcm3n_n05o_1880_40_008410411_2
Workunit 8561267
Created 5 Sep 2013, 18:31:18 UTC
Sent 5 Sep 2013, 18:48:11 UTC
Report deadline 6 Dec 2013, 2:15:22 UTC
Received 18 Dec 2013, 11:09:36 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1168753
Run time 25 days 5 hours 52 min 56 sec
CPU time 21 days 12 hours 39 min 47 sec
Validate state Invalid
Credit 12,441.60
Device peak FLOPS 3.15 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
i686-apple-darwin
Stderr
<core_client_version>7.0.65</core_client_version>
<![CDATA[
<message>
process exited with code 22 (0x16, -234)
</message>
<stderr_txt>
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
08:50:11 (14895): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
hadcm3n_6.07_i686-apple-darwin(70051,0xa0c2d540) malloc: *** error for object 0x681f604: incorrect checksum for freed object - object was probably modified after being freed.
*** set a breakpoint in malloc_error_break to debug
hadcm3n_6.07_i686-apple-darwin(70051,0xa0c2d540) malloc: *** error for object 0x701f604: incorrect checksum for freed object - object was probably modified after being freed.
*** set a breakpoint in malloc_error_break to debug
hadcm3n_6.07_i686-apple-darwin(70051,0xa0c2d540) malloc: *** error for object 0x703ac04: incorrect checksum for freed object - object was probably modified after being freed.
*** set a breakpoint in malloc_error_break to debug
hadcm3n_6.07_i686-apple-darwin(70051,0xa0c2d540) malloc: *** error for object 0x703ac00: incorrect checksum for freed object - object was probably modified after being freed.
*** set a breakpoint in malloc_error_break to debug
hadcm3n_6.07_i686-apple-darwin(70051,0xa0c2d540) malloc: *** error for object 0x2828a04: incorrect checksum for freed object - object was probably modified after being freed.
*** set a breakpoint in malloc_error_break to debug
hadcm3n_6.07_i686-apple-darwin(70051,0xa0c2d540) malloc: *** error for object 0x2828a04: incorrect checksum for freed object - object was probably modified after being freed.
*** set a breakpoint in malloc_error_break to debug
hadcm3n_6.07_i686-apple-darwin(70051,0xa0c2d540) malloc: *** error for object 0x2844004: incorrect checksum for freed object - object was probably modified after being freed.
*** set a breakpoint in malloc_error_break to debug
hadcm3n_6.07_i686-apple-darwin(70051,0xa0c2d540) malloc: *** error for object 0x2844000: incorrect checksum for freed object - object was probably modified after being freed.
*** set a breakpoint in malloc_error_break to debug
hadcm3n_6.07_i686-apple-darwin(70051,0xa0c2d540) malloc: *** error for object 0x501fa04: incorrect checksum for freed object - object was probably modified after being freed.
*** set a breakpoint in malloc_error_break to debug
hadcm3n_6.07_i686-apple-darwin(70051,0xa0c2d540) malloc: *** error for object 0x501fa00: incorrect checksum for freed object - object was probably modified after being freed.
*** set a breakpoint in malloc_error_break to debug
hadcm3n_6.07_i686-apple-darwin(70051,0xa0c2d540) malloc: *** error for object 0x483a604: incorrect checksum for freed object - object was probably modified after being freed.
*** set a breakpoint in malloc_error_break to debug
hadcm3n_6.07_i686-apple-darwin(70051,0xa0c2d540) malloc: *** error for object 0x601b604: incorrect checksum for freed object - object was probably modified after being freed.
*** set a breakpoint in malloc_error_break to debug
hadcm3n_6.07_i686-apple-darwin(70051,0xa0c2d540) malloc: *** error for object 0x601b600: incorrect checksum for freed object - object was probably modified after being freed.
*** set a breakpoint in malloc_error_break to debug
hadcm3n_6.07_i686-apple-darwin(70051,0xa0c2d540) malloc: *** error for object 0x701f604: incorrect checksum for freed object - object was probably modified after being freed.
*** set a breakpoint in malloc_error_break to debug
hadcm3n_6.07_i686-apple-darwin(70051,0xa0c2d540) malloc: *** error for object 0x701f600: incorrect checksum for freed object - object was probably modified after being freed.
*** set a breakpoint in malloc_error_break to debug
hadcm3n_6.07_i686-apple-darwin(70051,0xa0c2d540) malloc: *** error for object 0x2802204: incorrect checksum for freed object - object was probably modified after being freed.
*** set a breakpoint in malloc_error_break to debug
hadcm3n_6.07_i686-apple-darwin(70051,0xa0c2d540) malloc: *** error for object 0x2802200: incorrect checksum for freed object - object was probably modified after being freed.
*** set a breakpoint in malloc_error_break to debug
hadcm3n_6.07_i686-apple-darwin(70051,0xa0c2d540) malloc: *** error for object 0x2802204: incorrect checksum for freed object - object was probably modified after being freed.
*** set a breakpoint in malloc_error_break to debug
hadcm3n_6.07_i686-apple-darwin(70051,0xa0c2d540) malloc: *** error for object 0x2802200: incorrect checksum for freed object - object was probably modified after being freed.
*** set a breakpoint in malloc_error_break to debug
hadcm3n_6.07_i686-apple-darwin(70051,0xa0c2d540) malloc: *** error for object 0x2802204: incorrect checksum for freed object - object was probably modified after being freed.
*** set a breakpoint in malloc_error_break to debug
hadcm3n_6.07_i686-apple-darwin(70051,0xa0c2d540) malloc: *** error for object 0x2802200: incorrect checksum for freed object - object was probably modified after being freed.
*** set a breakpoint in malloc_error_break to debug
hadcm3n_6.07_i686-apple-darwin(70051,0xa0c2d540) malloc: *** error for object 0x2802204: incorrect checksum for freed object - object was probably modified after being freed.
*** set a breakpoint in malloc_error_break to debug
SIGSEGV: segmentation violation
hadcm3n_6.07_i686-apple-darwin(3094,0xa0c2d540) malloc: *** error for object 0x901b604: incorrect checksum for freed object - object was probably modified after being freed.
*** set a breakpoint in malloc_error_break to debug
hadcm3n_6.07_i686-apple-darwin(3094,0xa0c2d540) malloc: *** error for object 0x1028604: incorrect checksum for freed object - object was probably modified after being freed.
*** set a breakpoint in malloc_error_break to debug
hadcm3n_6.07_i686-apple-darwin(3094,0xa0c2d540) malloc: *** error for object 0x4020604: incorrect checksum for freed object - object was probably modified after being freed.
*** set a breakpoint in malloc_error_break to debug
hadcm3n_6.07_i686-apple-darwin(3094,0xa0c2d540) malloc: *** error for object 0x4020600: incorrect checksum for freed object - object was probably modified after being freed.
*** set a breakpoint in malloc_error_break to debug
hadcm3n_6.07_i686-apple-darwin(3094,0xa0c2d540) malloc: *** error for object 0x1001c04: incorrect checksum for freed object - object was probably modified after being freed.
*** set a breakpoint in malloc_error_break to debug
hadcm3n_6.07_i686-apple-darwin(3094,0xa0c2d540) malloc: *** error for object 0x1001c00: incorrect checksum for freed object - object was probably modified after being freed.
*** set a breakpoint in malloc_error_break to debug
hadcm3n_6.07_i686-apple-darwin(3094,0xa0c2d540) malloc: *** error for object 0x1001c04: incorrect checksum for freed object - object was probably modified after being freed.
*** set a breakpoint in malloc_error_break to debug
hadcm3n_6.07_i686-apple-darwin(3094,0xa0c2d540) malloc: *** error for object 0x1001c00: incorrect checksum for freed object - object was probably modified after being freed.
*** set a breakpoint in malloc_error_break to debug
hadcm3n_6.07_i686-apple-darwin(3094,0xa0c2d540) malloc: *** error for object 0x1001c04: incorrect checksum for freed object - object was probably modified after being freed.
*** set a breakpoint in malloc_error_break to debug
SIGSEGV: segmentation violation

Crashed executable name: hadcm3n_6.07_i686-apple-darwin
built using BOINC library version 6.13.0
Machine type Intel 80486 (32-bit executable)
hadcm3n_6.07_i686-apple-darwin(3094,0xa0c2d540) malloc: *** error for object 0x804e04: incorrect checksum for freed object - object was probably modified after being freed.
*** set a breakpoint in malloc_error_break to debug
System version: Macintosh OS 10.6.8 build 10K549
Mon Oct 21 04:18:41 2013

Thread 0 Crashed:
atos cannot load symbols for the file hadcm3n_6.07_i686-apple-darwin for architecture i386.
0   libSystem.B.dylib                   0x9a300b03 small_free_list_remove_ptr + 234
1   libSystem.B.dylib                   0x9a2fd5cc szone_free_definite_size + 3457
2   libSystem.B.dylib                   0x9a2fc5e8 free + 244
3   hadcm3n_6.07_i686-apple-darwin      0x0000ba58 annual_cycle(std::vector<std::basic_string<char, std::char_traits<char>, std::allocator<char> >, std::allocator<std::basic_string<char, std::char_traits<char>, std::allocator<char> > > > const*, char const*, int, int) + 3482
4   hadcm3n_6.07_i686-apple-darwin      0x0000d36b decadalMeans(int, char const*) + 957
5   hadcm3n_6.07_i686-apple-darwin      0x000067ff doCM3Proc() + 185
6   hadcm3n_6.07_i686-apple-darwin      0x0000876a worker() + 2896
7   hadcm3n_6.07_i686-apple-darwin      0x00008aa9 main + 491
8   hadcm3n_6.07_i686-apple-darwin      0x00002676 start + 54

Thread 1:
hadcm3n_6.07_i686-apple-darwin(90356,0xa0c2d540) malloc: *** error for object 0x1030604: incorrect checksum for freed object - object was probably modified after being freed.
*** set a breakpoint in malloc_error_break to debug
hadcm3n_6.07_i686-apple-darwin(90356,0xa0c2d540) malloc: *** error for object 0x701f604: incorrect checksum for freed object - object was probably modified after being freed.
*** set a breakpoint in malloc_error_break to debug
hadcm3n_6.07_i686-apple-darwin(90356,0xa0c2d540) malloc: *** error for object 0x701f600: incorrect checksum for freed object - object was probably modified after being freed.
*** set a breakpoint in malloc_error_break to debug
hadcm3n_6.07_i686-apple-darwin(90356,0xa0c2d540) malloc: *** error for object 0x4001c04: incorrect checksum for freed object - object was probably modified after being freed.
*** set a breakpoint in malloc_error_break to debug
hadcm3n_6.07_i686-apple-darwin(90356,0xa0c2d540) malloc: *** error for object 0x4001c00: incorrect checksum for freed object - object was probably modified after being freed.
*** set a breakpoint in malloc_error_break to debug
hadcm3n_6.07_i686-apple-darwin(90356,0xa0c2d540) malloc: *** error for object 0x4001c04: incorrect checksum for freed object - object was probably modified after being freed.
*** set a breakpoint in malloc_error_break to debug
hadcm3n_6.07_i686-apple-darwin(90356,0xa0c2d540) malloc: *** error for object 0x4001c00: incorrect checksum for freed object - object was probably modified after being freed.
*** set a breakpoint in malloc_error_break to debug
hadcm3n_6.07_i686-apple-darwin(90356,0xa0c2d540) malloc: *** error for object 0x3801c04: incorrect checksum for freed object - object was probably modified after being freed.
*** set a breakpoint in malloc_error_break to debug
hadcm3n_6.07_i686-apple-darwin(90356,0xa0c2d540) malloc: *** error for object 0x3801c00: incorrect checksum for freed object - object was probably modified after being freed.
*** set a breakpoint in malloc_error_break to debug
hadcm3n_6.07_i686-apple-darwin(90356,0xa0c2d540) malloc: *** error for object 0x4001c04: incorrect checksum for freed object - object was probably modified after being freed.
*** set a breakpoint in malloc_error_break to debug
hadcm3n_6.07_i686-apple-darwin(90356,0xa0c2d540) malloc: *** error for object 0x4001c00: incorrect checksum for freed object - object was probably modified after being freed.
*** set a breakpoint in malloc_error_break to debug
hadcm3n_6.07_i686-apple-darwin(90356,0xa0c2d540) malloc: *** error for object 0x4001c04: incorrect checksum for freed object - object was probably modified after being freed.
*** set a breakpoint in malloc_error_break to debug
hadcm3n_6.07_i686-apple-darwin(90356,0xa0c2d540) malloc: *** error for object 0x4001c00: incorrect checksum for freed object - object was probably modified after being freed.
*** set a breakpoint in malloc_error_break to debug
hadcm3n_6.07_i686-apple-darwin(90356,0xa0c2d540) malloc: *** error for object 0x4001c04: incorrect checksum for freed object - object was probably modified after being freed.
*** set a breakpoint in malloc_error_break to debug
hadcm3n_6.07_i686-apple-darwin(90356,0xa0c2d540) malloc: *** error for object 0x4001c00: incorrect checksum for freed object - object was probably modified after being freed.
*** set a breakpoint in malloc_error_break to debug
cpdnmonitor: cannot open input file /Library/Application Support/BOINC Data/projects/climateprediction.net/hadcm3n_n05o_1880_40_008410411/dataout/atmos_restart.day after 11 attempts
cpdnmonitor: cannot open input file /Library/Application Support/BOINC Data/projects/climateprediction.net/hadcm3n_n05o_1880_40_008410411/dataout/ocean_restart.day after 11 attempts

Model crashed: READ_FLH: I/O error                                                                                                                                                                                                                                             tmp/pipe_dummy                                                                  2048    
cpdnmonitor: cannot open input file /Library/Application Support/BOINC Data/projects/climateprediction.net/hadcm3n_n05o_1880_40_008410411/dataout/atmos_restart.day after 11 attempts
cpdnmonitor: cannot open input file /Library/Application Support/BOINC Data/projects/climateprediction.net/hadcm3n_n05o_1880_40_008410411/dataout/ocean_restart.day after 11 attempts

Model crashed: READ_FLH: I/O error                                                                                                                                                                                                                                             tmp/pipe_dummy                                                                  2048    
cpdnmonitor: cannot open input file /Library/Application Support/BOINC Data/projects/climateprediction.net/hadcm3n_n05o_1880_40_008410411/dataout/atmos_restart.day after 11 attempts
cpdnmonitor: cannot open input file /Library/Application Support/BOINC Data/projects/climateprediction.net/hadcm3n_n05o_1880_40_008410411/dataout/ocean_restart.day after 11 attempts

Model crashed: READ_FLH: I/O error                                                                                                                                                                                                                                             tmp/pipe_dummy                                                                  2048    
cpdnmonitor: cannot open input file /Library/Application Support/BOINC Data/projects/climateprediction.net/hadcm3n_n05o_1880_40_008410411/dataout/atmos_restart.day after 11 attempts
cpdnmonitor: cannot open input file /Library/Application Support/BOINC Data/projects/climateprediction.net/hadcm3n_n05o_1880_40_008410411/dataout/ocean_restart.day after 11 attempts

Model crashed: READ_FLH: I/O error                                                                                                                                                                                                                                             tmp/pipe_dummy                                                                  2048    
cpdnmonitor: cannot open input file /Library/Application Support/BOINC Data/projects/climateprediction.net/hadcm3n_n05o_1880_40_008410411/dataout/atmos_restart.day after 11 attempts
cpdnmonitor: cannot open input file /Library/Application Support/BOINC Data/projects/climateprediction.net/hadcm3n_n05o_1880_40_008410411/dataout/ocean_restart.day after 11 attempts

Model crashed: READ_FLH: I/O error                                                                                                                                                                                                                                             tmp/pipe_dummy                                                                  2048    
cpdnmonitor: cannot open input file /Library/Application Support/BOINC Data/projects/climateprediction.net/hadcm3n_n05o_1880_40_008410411/dataout/atmos_restart.day after 11 attempts
cpdnmonitor: cannot open input file /Library/Application Support/BOINC Data/projects/climateprediction.net/hadcm3n_n05o_1880_40_008410411/dataout/ocean_restart.day after 11 attempts

Model crashed: READ_FLH: I/O error                                                                                                                                                                                                                                             tmp/pipe_dummy                                                                  2048    
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
21 Oct 2013 03:31:56 1168753 16004805 hadcm3n_n05o_1880_40_008410411_2 1,036,800 1,860,013 1.7940
20 Oct 2013 13:22:10 1168753 16004805 hadcm3n_n05o_1880_40_008410411_2 1,010,880 1,812,994 1.7935
19 Oct 2013 13:52:58 1168753 16004805 hadcm3n_n05o_1880_40_008410411_2 984,960 1,766,204 1.7932
18 Oct 2013 22:06:14 1168753 16004805 hadcm3n_n05o_1880_40_008410411_2 959,040 1,721,345 1.7949
18 Oct 2013 05:44:31 1168753 16004805 hadcm3n_n05o_1880_40_008410411_2 933,120 1,674,890 1.7949
17 Oct 2013 05:21:39 1168753 16004805 hadcm3n_n05o_1880_40_008410411_2 907,200 1,628,127 1.7947
15 Oct 2013 23:14:45 1168753 16004805 hadcm3n_n05o_1880_40_008410411_2 881,280 1,581,687 1.7948
15 Oct 2013 08:15:37 1168753 16004805 hadcm3n_n05o_1880_40_008410411_2 855,360 1,535,794 1.7955
13 Oct 2013 23:02:07 1168753 16004805 hadcm3n_n05o_1880_40_008410411_2 829,440 1,488,936 1.7951
13 Oct 2013 08:29:11 1168753 16004805 hadcm3n_n05o_1880_40_008410411_2 803,520 1,442,775 1.7956
12 Oct 2013 17:22:18 1168753 16004805 hadcm3n_n05o_1880_40_008410411_2 777,600 1,397,830 1.7976
12 Oct 2013 02:21:55 1168753 16004805 hadcm3n_n05o_1880_40_008410411_2 751,680 1,350,790 1.7970
11 Oct 2013 10:00:01 1168753 16004805 hadcm3n_n05o_1880_40_008410411_2 725,760 1,304,178 1.7970
10 Oct 2013 15:49:46 1168753 16004805 hadcm3n_n05o_1880_40_008410411_2 699,840 1,257,261 1.7965
09 Oct 2013 22:14:41 1168753 16004805 hadcm3n_n05o_1880_40_008410411_2 673,920 1,209,871 1.7953
09 Oct 2013 05:31:10 1168753 16004805 hadcm3n_n05o_1880_40_008410411_2 648,000 1,162,478 1.7939
08 Oct 2013 15:12:05 1168753 16004805 hadcm3n_n05o_1880_40_008410411_2 622,080 1,115,588 1.7933
08 Oct 2013 00:23:49 1168753 16004805 hadcm3n_n05o_1880_40_008410411_2 596,160 1,069,616 1.7942
07 Oct 2013 09:35:38 1168753 16004805 hadcm3n_n05o_1880_40_008410411_2 570,240 1,022,631 1.7933
06 Oct 2013 09:10:05 1168753 16004805 hadcm3n_n05o_1880_40_008410411_2 544,320 976,059 1.7932


©2024 cpdn.org