Name | hadcm3n_o1a3_2060_40_008243811_1 |
Workunit | 8398935 |
Created | 31 Oct 2012, 0:25:24 UTC |
Sent | 31 Oct 2012, 0:25:48 UTC |
Report deadline | 30 Jan 2013, 7:52:59 UTC |
Received | 17 Nov 2012, 5:37:43 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 193 (0x000000C1) EXIT_SIGNAL |
Computer ID | 1166508 |
Run time | 12 days 12 hours 15 min 24 sec |
CPU time | 11 days 10 hours 27 min 44 sec |
Validate state | Invalid |
Credit | 6,220.80 |
Device peak FLOPS | 2.38 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 i686-apple-darwin |
Stderr | <core_client_version>7.0.31</core_client_version> <![CDATA[ <message> process exited with code 193 (0xc1, -63) </message> <stderr_txt> 23:37:08 (17094): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 23:40:28 (74165): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 23:44:10 (74281): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 23:48:02 (74352): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 23:51:50 (74483): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 23:54:57 (74557): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 23:58:11 (74664): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 23:58:12 (74664): No heartbeat from core client for 30 sec - exiting 23:58:13 (74664): No heartbeat from core client for 30 sec - exiting 23:58:14 (74664): No heartbeat from core client for 30 sec - exiting 23:58:15 (74664): No heartbeat from core client for 30 sec - exiting 00:01:25 (74730): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 00:05:09 (74841): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 00:08:24 (74914): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... hadcm3n_6.07_i686-apple-darwin(384,0xa084c540) malloc: *** error for object 0x401fa04: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(384,0xa084c540) malloc: *** error for object 0x781fe04: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(384,0xa084c540) malloc: *** error for object 0x783b404: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(384,0xa084c540) malloc: *** error for object 0x783b400: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(384,0xa084c540) malloc: *** error for object 0x781fe04: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(384,0xa084c540) malloc: *** error for object 0x1837604: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug SIGSEGV: segmentation violation hadcm3n_6.07_i686-apple-darwin(384,0xa084c540) malloc: *** error for object 0x3820004: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug Crashed executable name: hadcm3n_6.07_i686-apple-darwin built using BOINC library version 6.13.0 Machine type Intel 80486 (32-bit executable) System version: Macintosh OS 10.6.8 build 10K549 hadcm3n_6.07_i686-apple-darwin(384,0xa084c540) malloc: *** error for object 0x301f604: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(384,0xa084c540) malloc: *** error for object 0x301f600: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug Sat Nov 17 06:12:13 2012 Thread 0 Crashed: atos cannot load symbols for the file hadcm3n_6.07_i686-apple-darwin. 0 libSystem.B.dylib 0x95ec3b03 small_free_list_remove_ptr + 234 1 libSystem.B.dylib 0x95ec05cc szone_free_definite_size + 3457 2 libSystem.B.dylib 0x95ebf5e8 free + 244 3 hadcm3n_6.07_i686-apple-darwin 0x0000ba58 annual_cycle(std::vector<std::basic_string<char, std::char_traits<char>, std::allocator<char> >, std::allocator<std::basic_string<char, std::char_traits<char>, std::allocator<char> > > > const*, char const*, int, int) + 3482 4 hadcm3n_6.07_i686-apple-darwin 0x0000d36b decadalMeans(int, char const*) + 957 5 hadcm3n_6.07_i686-apple-darwin 0x000067ff doCM3Proc() + 185 6 hadcm3n_6.07_i686-apple-darwin 0x0000791c mainLoop() + 410 7 hadcm3n_6.07_i686-apple-darwin 0x000087c7 worker() + 2989 8 hadcm3n_6.07_i686-apple-darwin 0x00008aa9 main + 491 9 hadcm3n_6.07_i686-apple-darwin 0x00002676 start + 54 Thread 1: 0 libSystem.B.dylib 0x95eb8c0e mach_wait_until + 10 1 libSystem.B.dylib 0x95f40429 nanosleep + 345 2 libSystem.B.dylib 0x95f402ca usleep + 61 3 hadcm3n_6.07_i686-apple-darwin 0x00071a7c boinc_sleep(double) + 188 4 hadcm3n_6.07_i686-apple-darwin 0x00067282 timer_thread(void*) + 78 5 libSystem.B.dylib 0x95ee6259 _pthread_start + 345 6 libSystem.B.dylib 0x95ee60de thread_start + 34 Thread 0 crashed with X86 Thread State (32-bit): eax: 0x00000000 ebx: 0x00000000 ecx: 0x00000000 edx: 0x00000000 edi: 0x00000000 esi: 0x00000000 ebp: 0xbff8f698 esp: 0x00000000 ss: 0x00000000 efl: 0x00000000 eip: 0x0007697e cs: 0x00000000 ds: 0x00000000 es: 0x00000000 fs: 0x00000000 gs: 0x00000000 Binary Images Description: 0x1000 - 0x93fff /Library/Application Support/BOINC Data/slots/13/../../projects/climateprediction.net/hadcm3n_6.07_i686-apple-darwin 0x95eb8000 - 0x9605ffff /usr/lib/libSystem.B.dylib 0x97633000 - 0x97641fff /usr/lib/libz.1.dylib 0x99b6a000 - 0x99b6dfff /usr/lib/system/libmathCommon.A.dylib 0x9a312000 - 0x9a37cfff /usr/lib/libstdc++.6.dylib Exiting... </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
17 Nov 2012 05:39:56 | 1166508 | 15422863 | hadcm3n_o1a3_2060_40_008243811_1 | 518,400 | 988,058 | 1.9060 |
16 Nov 2012 15:51:42 | 1166508 | 15422863 | hadcm3n_o1a3_2060_40_008243811_1 | 492,480 | 941,080 | 1.9109 |
16 Nov 2012 01:14:49 | 1166508 | 15422863 | hadcm3n_o1a3_2060_40_008243811_1 | 466,560 | 894,911 | 1.9181 |
15 Nov 2012 09:29:28 | 1166508 | 15422863 | hadcm3n_o1a3_2060_40_008243811_1 | 440,640 | 848,405 | 1.9254 |
14 Nov 2012 18:58:11 | 1166508 | 15422863 | hadcm3n_o1a3_2060_40_008243811_1 | 414,720 | 799,229 | 1.9272 |
14 Nov 2012 02:38:20 | 1166508 | 15422863 | hadcm3n_o1a3_2060_40_008243811_1 | 388,800 | 752,039 | 1.9343 |
13 Nov 2012 12:23:53 | 1166508 | 15422863 | hadcm3n_o1a3_2060_40_008243811_1 | 362,880 | 704,892 | 1.9425 |
12 Nov 2012 22:06:39 | 1166508 | 15422863 | hadcm3n_o1a3_2060_40_008243811_1 | 336,960 | 657,806 | 1.9522 |
08 Nov 2012 00:28:20 | 1166508 | 15422863 | hadcm3n_o1a3_2060_40_008243811_1 | 311,040 | 609,834 | 1.9606 |
07 Nov 2012 05:32:16 | 1166508 | 15422863 | hadcm3n_o1a3_2060_40_008243811_1 | 285,120 | 560,602 | 1.9662 |
06 Nov 2012 14:06:22 | 1166508 | 15422863 | hadcm3n_o1a3_2060_40_008243811_1 | 259,200 | 511,330 | 1.9727 |
05 Nov 2012 23:39:22 | 1166508 | 15422863 | hadcm3n_o1a3_2060_40_008243811_1 | 233,280 | 459,346 | 1.9691 |
05 Nov 2012 08:18:03 | 1166508 | 15422863 | hadcm3n_o1a3_2060_40_008243811_1 | 207,360 | 408,115 | 1.9681 |
04 Nov 2012 17:06:46 | 1166508 | 15422863 | hadcm3n_o1a3_2060_40_008243811_1 | 181,440 | 357,866 | 1.9724 |
04 Nov 2012 01:49:39 | 1166508 | 15422863 | hadcm3n_o1a3_2060_40_008243811_1 | 155,520 | 307,719 | 1.9786 |
03 Nov 2012 10:45:01 | 1166508 | 15422863 | hadcm3n_o1a3_2060_40_008243811_1 | 129,600 | 257,454 | 1.9865 |
02 Nov 2012 19:41:48 | 1166508 | 15422863 | hadcm3n_o1a3_2060_40_008243811_1 | 103,680 | 207,210 | 1.9986 |
02 Nov 2012 04:34:23 | 1166508 | 15422863 | hadcm3n_o1a3_2060_40_008243811_1 | 77,760 | 156,948 | 2.0184 |
01 Nov 2012 12:37:07 | 1166508 | 15422863 | hadcm3n_o1a3_2060_40_008243811_1 | 51,840 | 106,472 | 2.0539 |
31 Oct 2012 19:09:52 | 1166508 | 15422863 | hadcm3n_o1a3_2060_40_008243811_1 | 25,920 | 52,895 | 2.0407 |
©2024 cpdn.org