climateprediction.net home page
Task 13029058

Task 13029058

Name hadcm3n_o6hp_1940_40_007266595_2
Workunit 7464835
Created 29 Jun 2011, 8:20:47 UTC
Sent 29 Jun 2011, 8:34:39 UTC
Report deadline 28 Sep 2011, 16:01:50 UTC
Received 16 Aug 2011, 7:41:36 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 193 (0x000000C1) EXIT_SIGNAL
Computer ID 1129441
Run time 15 days 9 hours 37 min 20 sec
CPU time 14 days 10 hours 33 min 33 sec
Validate state Invalid
Credit 9,331.20
Device peak FLOPS 2.59 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
i686-apple-darwin
Stderr
<core_client_version>6.10.58</core_client_version>
<![CDATA[
<message>
process exited with code 193 (0xc1, -63)
</message>
<stderr_txt>
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
20:18:14 (56040): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
hadcm3n_6.07_i686-apple-darwin(37625,0xa0506540) malloc: *** error for object 0x2009c04: incorrect checksum for freed object - object was probably modified after being freed.
*** set a breakpoint in malloc_error_break to debug
hadcm3n_6.07_i686-apple-darwin(37625,0xa0506540) malloc: *** error for object 0x2009c00: incorrect checksum for freed object - object was probably modified after being freed.
*** set a breakpoint in malloc_error_break to debug
hadcm3n_6.07_i686-apple-darwin(37625,0xa0506540) malloc: *** error for object 0x6001c04: incorrect checksum for freed object - object was probably modified after being freed.
*** set a breakpoint in malloc_error_break to debug
hadcm3n_6.07_i686-apple-darwin(37625,0xa0506540) malloc: *** error for object 0x6001c00: incorrect checksum for freed object - object was probably modified after being freed.
*** set a breakpoint in malloc_error_break to debug
hadcm3n_6.07_i686-apple-darwin(37625,0xa0506540) malloc: *** error for object 0x6001c04: incorrect checksum for freed object - object was probably modified after being freed.
*** set a breakpoint in malloc_error_break to debug
hadcm3n_6.07_i686-apple-darwin(37625,0xa0506540) malloc: *** error for object 0x6001c00: incorrect checksum for freed object - object was probably modified after being freed.
*** set a breakpoint in malloc_error_break to debug
hadcm3n_6.07_i686-apple-darwin(37625,0xa0506540) malloc: *** error for object 0x6001c00: incorrect checksum for freed object - object was probably modified after being freed.
*** set a breakpoint in malloc_error_break to debug
SIGSEGV: segmentation violation

Crashed executable name: hadcm3n_6.07_i686-apple-darwin
built using BOINC library version 6.13.0
Machine type Intel 80486 (32-bit executable)
Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=37634, selfPID=37634, iMonCtr=1
hadcm3n_6.07_i686-apple-darwin(20588,0xa0506540) malloc: *** error for object 0x801c04: incorrect checksum for freed object - object was probably modified after being freed.
*** set a breakpoint in malloc_error_break to debug
hadcm3n_6.07_i686-apple-darwin(20588,0xa0506540) malloc: *** error for object 0x801c00: incorrect checksum for freed object - object was probably modified after being freed.
*** set a breakpoint in malloc_error_break to debug
hadcm3n_6.07_i686-apple-darwin(20588,0xa0506540) malloc: *** error for object 0x801c04: incorrect checksum for freed object - object was probably modified after being freed.
*** set a breakpoint in malloc_error_break to debug
SIGSEGV: segmentation violation
hadcm3n_6.07_i686-apple-darwin(363,0xa0abc540) malloc: *** error for object 0x881d200: incorrect checksum for freed object - object was probably modified after being freed.
*** set a breakpoint in malloc_error_break to debug
SIGSEGV: segmentation violation

Crashed executable name: hadcm3n_6.07_i686-apple-darwin
built using BOINC library version 6.13.0
Machine type Intel 80486 (32-bit executable)
hadcm3n_6.07_i686-apple-darwin(363,0xa0abc540) malloc: *** error for object 0x100d204: incorrect checksum for freed object - object was probably modified after being freed.
*** set a breakpoint in malloc_error_break to debug
System version: Macintosh OS 10.6.8 build 10K549
hadcm3n_6.07_i686-apple-darwin(363,0xa0abc540) malloc: *** error for object 0x100e204: incorrect checksum for freed object - object was probably modified after being freed.
*** set a breakpoint in malloc_error_break to debug
Tue Aug 16 08:16:08 2011

hadcm3n_6.07_i686-apple-darwin(363,0xa0abc540) malloc: *** error for object 0x1010204: incorrect checksum for freed object - object was probably modified after being freed.
*** set a breakpoint in malloc_error_break to debug
hadcm3n_6.07_i686-apple-darwin(363,0xa0abc540) malloc: *** error for object 0x1010c04: incorrect checksum for freed object - object was probably modified after being freed.
*** set a breakpoint in malloc_error_break to debug
hadcm3n_6.07_i686-apple-darwin(363,0xa0abc540) malloc: *** error for object 0x1010c00: incorrect checksum for freed object - object was probably modified after being freed.
*** set a breakpoint in malloc_error_break to debug
Thread 0 Crashed:
atos cannot load symbols for the file hadcm3n_6.07_i686-apple-darwin.
0   libSystem.B.dylib                   0x98939b0f small_free_list_remove_ptr + 246
1   libSystem.B.dylib                   0x989365cc szone_free_definite_size + 3457
2   libSystem.B.dylib                   0x989355e8 free + 244
3   hadcm3n_6.07_i686-apple-darwin      0x0000ba58 annual_cycle(std::vector<std::basic_string<char, std::char_traits<char>, std::allocator<char> >, std::allocator<std::basic_string<char, std::char_traits<char>, std::allocator<char> > > > const*, char const*, int, int) + 3482
4   hadcm3n_6.07_i686-apple-darwin      0x0000d36b decadalMeans(int, char const*) + 957
5   hadcm3n_6.07_i686-apple-darwin      0x000067ff doCM3Proc() + 185
6   hadcm3n_6.07_i686-apple-darwin      0x0000876a worker() + 2896
7   hadcm3n_6.07_i686-apple-darwin      0x00008aa9 main + 491
8   hadcm3n_6.07_i686-apple-darwin      0x00002676 start + 54

Thread 1:
0   libSystem.B.dylib                   0x9892ec0e mach_wait_until + 10
1   libSystem.B.dylib                   0x989b6429 nanosleep + 345
2   libSystem.B.dylib                   0x989b62ca usleep + 61
3   hadcm3n_6.07_i686-apple-darwin      0x00071a7c boinc_sleep(double) + 188
4   hadcm3n_6.07_i686-apple-darwin      0x00067282 timer_thread(void*) + 78
5   libSystem.B.dylib                   0x9895c259 _pthread_start + 345
6   libSystem.B.dylib                   0x9895c0de thread_start + 34

Thread 0 crashed with X86 Thread State (32-bit):
  eax: 0x00000000 ebx: 0x00000000 ecx: 0x00000000 edx: 0x00000000
  edi: 0x00000000 esi: 0x00000000 ebp: 0xbff8f978 esp: 0x00000000
   ss: 0x00000000 efl: 0x00000000 eip: 0x0007697e  cs: 0x00000000
   ds: 0x00000000  es: 0x00000000  fs: 0x00000000  gs: 0x00000000

Binary Images Description:
    0x1000 -    0x93fff /Library/Application Support/BOINC Data/slots/20/../../projects/climateprediction.net/hadcm3n_6.07_i686-apple-darwin
0x91b03000 - 0x91b6dfff /usr/lib/libstdc++.6.dylib
0x91b6e000 - 0x91b7cfff /usr/lib/libz.1.dylib
0x955a8000 - 0x955abfff /usr/lib/system/libmathCommon.A.dylib
0x9892e000 - 0x98ad5fff /usr/lib/libSystem.B.dylib


Exiting...

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
08 Aug 2011 17:29:32 1129441 13029058 hadcm3n_o6hp_1940_40_007266595_2 777,600 1,227,863 1.5790
06 Aug 2011 05:28:32 1129441 13029058 hadcm3n_o6hp_1940_40_007266595_2 751,680 1,195,188 1.5900
04 Aug 2011 12:50:31 1129441 13029058 hadcm3n_o6hp_1940_40_007266595_2 725,760 1,162,582 1.6019
02 Aug 2011 15:49:55 1129441 13029058 hadcm3n_o6hp_1940_40_007266595_2 699,840 1,125,311 1.6080
02 Aug 2011 05:46:11 1129441 13029058 hadcm3n_o6hp_1940_40_007266595_2 673,920 1,088,746 1.6155
01 Aug 2011 14:36:36 1129441 13029058 hadcm3n_o6hp_1940_40_007266595_2 648,000 1,042,587 1.6089
01 Aug 2011 00:51:20 1129441 13029058 hadcm3n_o6hp_1940_40_007266595_2 622,080 998,173 1.6046
31 Jul 2011 09:40:53 1129441 13029058 hadcm3n_o6hp_1940_40_007266595_2 596,160 950,454 1.5943
30 Jul 2011 18:16:33 1129441 13029058 hadcm3n_o6hp_1940_40_007266595_2 570,240 901,080 1.5802
26 Jul 2011 16:31:07 1129441 13029058 hadcm3n_o6hp_1940_40_007266595_2 544,320 853,438 1.5679
25 Jul 2011 22:53:55 1129441 13029058 hadcm3n_o6hp_1940_40_007266595_2 518,400 814,091 1.5704
25 Jul 2011 22:25:06 1129441 13029058 hadcm3n_o6hp_1940_40_007266595_2 492,480 781,568 1.5870
25 Jul 2011 22:05:42 1129441 13029058 hadcm3n_o6hp_1940_40_007266595_2 466,560 748,966 1.6053
25 Jul 2011 21:44:46 1129441 13029058 hadcm3n_o6hp_1940_40_007266595_2 440,640 716,474 1.6260
25 Jul 2011 20:49:30 1129441 13029058 hadcm3n_o6hp_1940_40_007266595_2 414,720 668,136 1.6111
25 Jul 2011 20:20:58 1129441 13029058 hadcm3n_o6hp_1940_40_007266595_2 388,800 618,550 1.5909
25 Jul 2011 19:08:09 1129441 13029058 hadcm3n_o6hp_1940_40_007266595_2 362,880 574,978 1.5845
25 Jul 2011 19:08:09 1129441 13029058 hadcm3n_o6hp_1940_40_007266595_2 336,960 535,948 1.5905
25 Jul 2011 18:52:40 1129441 13029058 hadcm3n_o6hp_1940_40_007266595_2 311,040 492,443 1.5832
25 Jul 2011 17:28:18 1129441 13029058 hadcm3n_o6hp_1940_40_007266595_2 285,120 449,113 1.5752


©2024 cpdn.org