climateprediction.net home page
Task 15422863

Task 15422863

Name hadcm3n_o1a3_2060_40_008243811_1
Workunit 8398935
Created 31 Oct 2012, 0:25:24 UTC
Sent 31 Oct 2012, 0:25:48 UTC
Report deadline 30 Jan 2013, 7:52:59 UTC
Received 17 Nov 2012, 5:37:43 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 193 (0x000000C1) EXIT_SIGNAL
Computer ID 1166508
Run time 12 days 12 hours 15 min 24 sec
CPU time 11 days 10 hours 27 min 44 sec
Validate state Invalid
Credit 6,220.80
Device peak FLOPS 2.38 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
i686-apple-darwin
Stderr
<core_client_version>7.0.31</core_client_version>
<![CDATA[
<message>
process exited with code 193 (0xc1, -63)
</message>
<stderr_txt>
23:37:08 (17094): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
23:40:28 (74165): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
23:44:10 (74281): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
23:48:02 (74352): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
23:51:50 (74483): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
23:54:57 (74557): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
23:58:11 (74664): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
23:58:12 (74664): No heartbeat from core client for 30 sec - exiting
23:58:13 (74664): No heartbeat from core client for 30 sec - exiting
23:58:14 (74664): No heartbeat from core client for 30 sec - exiting
23:58:15 (74664): No heartbeat from core client for 30 sec - exiting
00:01:25 (74730): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
00:05:09 (74841): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
00:08:24 (74914): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
hadcm3n_6.07_i686-apple-darwin(384,0xa084c540) malloc: *** error for object 0x401fa04: incorrect checksum for freed object - object was probably modified after being freed.
*** set a breakpoint in malloc_error_break to debug
hadcm3n_6.07_i686-apple-darwin(384,0xa084c540) malloc: *** error for object 0x781fe04: incorrect checksum for freed object - object was probably modified after being freed.
*** set a breakpoint in malloc_error_break to debug
hadcm3n_6.07_i686-apple-darwin(384,0xa084c540) malloc: *** error for object 0x783b404: incorrect checksum for freed object - object was probably modified after being freed.
*** set a breakpoint in malloc_error_break to debug
hadcm3n_6.07_i686-apple-darwin(384,0xa084c540) malloc: *** error for object 0x783b400: incorrect checksum for freed object - object was probably modified after being freed.
*** set a breakpoint in malloc_error_break to debug
hadcm3n_6.07_i686-apple-darwin(384,0xa084c540) malloc: *** error for object 0x781fe04: incorrect checksum for freed object - object was probably modified after being freed.
*** set a breakpoint in malloc_error_break to debug
hadcm3n_6.07_i686-apple-darwin(384,0xa084c540) malloc: *** error for object 0x1837604: incorrect checksum for freed object - object was probably modified after being freed.
*** set a breakpoint in malloc_error_break to debug
SIGSEGV: segmentation violation
hadcm3n_6.07_i686-apple-darwin(384,0xa084c540) malloc: *** error for object 0x3820004: incorrect checksum for freed object - object was probably modified after being freed.
*** set a breakpoint in malloc_error_break to debug

Crashed executable name: hadcm3n_6.07_i686-apple-darwin
built using BOINC library version 6.13.0
Machine type Intel 80486 (32-bit executable)
System version: Macintosh OS 10.6.8 build 10K549
hadcm3n_6.07_i686-apple-darwin(384,0xa084c540) malloc: *** error for object 0x301f604: incorrect checksum for freed object - object was probably modified after being freed.
*** set a breakpoint in malloc_error_break to debug
hadcm3n_6.07_i686-apple-darwin(384,0xa084c540) malloc: *** error for object 0x301f600: incorrect checksum for freed object - object was probably modified after being freed.
*** set a breakpoint in malloc_error_break to debug
Sat Nov 17 06:12:13 2012

Thread 0 Crashed:
atos cannot load symbols for the file hadcm3n_6.07_i686-apple-darwin.
0   libSystem.B.dylib                   0x95ec3b03 small_free_list_remove_ptr + 234
1   libSystem.B.dylib                   0x95ec05cc szone_free_definite_size + 3457
2   libSystem.B.dylib                   0x95ebf5e8 free + 244
3   hadcm3n_6.07_i686-apple-darwin      0x0000ba58 annual_cycle(std::vector<std::basic_string<char, std::char_traits<char>, std::allocator<char> >, std::allocator<std::basic_string<char, std::char_traits<char>, std::allocator<char> > > > const*, char const*, int, int) + 3482
4   hadcm3n_6.07_i686-apple-darwin      0x0000d36b decadalMeans(int, char const*) + 957
5   hadcm3n_6.07_i686-apple-darwin      0x000067ff doCM3Proc() + 185
6   hadcm3n_6.07_i686-apple-darwin      0x0000791c mainLoop() + 410
7   hadcm3n_6.07_i686-apple-darwin      0x000087c7 worker() + 2989
8   hadcm3n_6.07_i686-apple-darwin      0x00008aa9 main + 491
9   hadcm3n_6.07_i686-apple-darwin      0x00002676 start + 54

Thread 1:
0   libSystem.B.dylib                   0x95eb8c0e mach_wait_until + 10
1   libSystem.B.dylib                   0x95f40429 nanosleep + 345
2   libSystem.B.dylib                   0x95f402ca usleep + 61
3   hadcm3n_6.07_i686-apple-darwin      0x00071a7c boinc_sleep(double) + 188
4   hadcm3n_6.07_i686-apple-darwin      0x00067282 timer_thread(void*) + 78
5   libSystem.B.dylib                   0x95ee6259 _pthread_start + 345
6   libSystem.B.dylib                   0x95ee60de thread_start + 34

Thread 0 crashed with X86 Thread State (32-bit):
  eax: 0x00000000 ebx: 0x00000000 ecx: 0x00000000 edx: 0x00000000
  edi: 0x00000000 esi: 0x00000000 ebp: 0xbff8f698 esp: 0x00000000
   ss: 0x00000000 efl: 0x00000000 eip: 0x0007697e  cs: 0x00000000
   ds: 0x00000000  es: 0x00000000  fs: 0x00000000  gs: 0x00000000

Binary Images Description:
    0x1000 -    0x93fff /Library/Application Support/BOINC Data/slots/13/../../projects/climateprediction.net/hadcm3n_6.07_i686-apple-darwin
0x95eb8000 - 0x9605ffff /usr/lib/libSystem.B.dylib
0x97633000 - 0x97641fff /usr/lib/libz.1.dylib
0x99b6a000 - 0x99b6dfff /usr/lib/system/libmathCommon.A.dylib
0x9a312000 - 0x9a37cfff /usr/lib/libstdc++.6.dylib


Exiting...

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
17 Nov 2012 05:39:56 1166508 15422863 hadcm3n_o1a3_2060_40_008243811_1 518,400 988,058 1.9060
16 Nov 2012 15:51:42 1166508 15422863 hadcm3n_o1a3_2060_40_008243811_1 492,480 941,080 1.9109
16 Nov 2012 01:14:49 1166508 15422863 hadcm3n_o1a3_2060_40_008243811_1 466,560 894,911 1.9181
15 Nov 2012 09:29:28 1166508 15422863 hadcm3n_o1a3_2060_40_008243811_1 440,640 848,405 1.9254
14 Nov 2012 18:58:11 1166508 15422863 hadcm3n_o1a3_2060_40_008243811_1 414,720 799,229 1.9272
14 Nov 2012 02:38:20 1166508 15422863 hadcm3n_o1a3_2060_40_008243811_1 388,800 752,039 1.9343
13 Nov 2012 12:23:53 1166508 15422863 hadcm3n_o1a3_2060_40_008243811_1 362,880 704,892 1.9425
12 Nov 2012 22:06:39 1166508 15422863 hadcm3n_o1a3_2060_40_008243811_1 336,960 657,806 1.9522
08 Nov 2012 00:28:20 1166508 15422863 hadcm3n_o1a3_2060_40_008243811_1 311,040 609,834 1.9606
07 Nov 2012 05:32:16 1166508 15422863 hadcm3n_o1a3_2060_40_008243811_1 285,120 560,602 1.9662
06 Nov 2012 14:06:22 1166508 15422863 hadcm3n_o1a3_2060_40_008243811_1 259,200 511,330 1.9727
05 Nov 2012 23:39:22 1166508 15422863 hadcm3n_o1a3_2060_40_008243811_1 233,280 459,346 1.9691
05 Nov 2012 08:18:03 1166508 15422863 hadcm3n_o1a3_2060_40_008243811_1 207,360 408,115 1.9681
04 Nov 2012 17:06:46 1166508 15422863 hadcm3n_o1a3_2060_40_008243811_1 181,440 357,866 1.9724
04 Nov 2012 01:49:39 1166508 15422863 hadcm3n_o1a3_2060_40_008243811_1 155,520 307,719 1.9786
03 Nov 2012 10:45:01 1166508 15422863 hadcm3n_o1a3_2060_40_008243811_1 129,600 257,454 1.9865
02 Nov 2012 19:41:48 1166508 15422863 hadcm3n_o1a3_2060_40_008243811_1 103,680 207,210 1.9986
02 Nov 2012 04:34:23 1166508 15422863 hadcm3n_o1a3_2060_40_008243811_1 77,760 156,948 2.0184
01 Nov 2012 12:37:07 1166508 15422863 hadcm3n_o1a3_2060_40_008243811_1 51,840 106,472 2.0539
31 Oct 2012 19:09:52 1166508 15422863 hadcm3n_o1a3_2060_40_008243811_1 25,920 52,895 2.0407


©2024 cpdn.org