Name | hadcm3n_odg2_1900_40_008472549_0 |
Workunit | 8623388 |
Created | 27 Sep 2013, 10:14:13 UTC |
Sent | 29 Sep 2013, 21:32:09 UTC |
Report deadline | 30 Dec 2013, 4:59:20 UTC |
Received | 13 Oct 2013, 21:09:20 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 193 (0x000000C1) EXIT_SIGNAL |
Computer ID | 1172598 |
Run time | 8 days 1 hours 42 min 53 sec |
CPU time | 5 days 21 hours 34 min 9 sec |
Validate state | Invalid |
Credit | 3,110.40 |
Device peak FLOPS | 3.14 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 i686-apple-darwin |
Stderr | <core_client_version>7.0.28</core_client_version> <![CDATA[ <message> process exited with code 193 (0xc1, -63) </message> <stderr_txt> CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 21:09:51 (66925): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 13:02:07 (87487): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 13:02:08 (87487): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 01:26:16 (93424): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 03:38:32 (93747): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 06:12:05 (94952): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 08:02:34 (96255): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 13:37:08 (11234): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 07:09:11 (20753): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 09:18:23 (23885): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 22:11:51 (26911): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 00:13:10 (28174): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 22:08:24 (33529): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 22:08:25 (33529): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... hadcm3n_6.07_i686-apple-darwin(68988,0xa09ad540) malloc: *** error for object 0x7001c04: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(68988,0xa09ad540) malloc: *** error for object 0x7001c00: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(68988,0xa09ad540) malloc: *** error for object 0x7001c04: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(68988,0xa09ad540) malloc: *** error for object 0x7001c00: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(68988,0xa09ad540) malloc: *** error for object 0x7001c04: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(68988,0xa09ad540) malloc: *** error for object 0x7001c00: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(68988,0xa09ad540) malloc: *** error for object 0x7001c04: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(68988,0xa09ad540) malloc: *** error for object 0x7001c00: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(68988,0xa09ad540) malloc: *** error for object 0x7001c04: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(68988,0xa09ad540) malloc: *** error for object 0x7001c00: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(68988,0xa09ad540) malloc: *** error for object 0x3800e04: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(68988,0xa09ad540) malloc: *** error for object 0x3800e00: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(68988,0xa09ad540) malloc: *** error for object 0x4000e00: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug SIGSEGV: segmentation violation hadcm3n_6.07_i686-apple-darwin(70921,0xa09ad540) malloc: *** error for object 0x82de04: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(70921,0xa09ad540) malloc: *** error for object 0x82de00: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(70921,0xa09ad540) malloc: *** error for object 0x82de04: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(70921,0xa09ad540) malloc: *** error for object 0x82de00: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(70921,0xa09ad540) malloc: *** error for object 0x82de04: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(70921,0xa09ad540) malloc: *** error for object 0x82de00: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(70921,0xa09ad540) malloc: *** error for object 0x82de04: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(70921,0xa09ad540) malloc: *** error for object 0x82de00: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(70921,0xa09ad540) malloc: *** error for object 0x82de04: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug SIGSEGV: segmentation violation Crashed executable name: hadcm3n_6.07_i686-apple-darwin built using BOINC library version 6.13.0 Machine type Intel 80486 (32-bit executable) System version: Macintosh OS 10.6.8 build 10K549 Sun Oct 13 16:47:02 2013 Thread 0 Crashed: 0 libSystem.B.dylib 0x97e84b03 _small_free_list_remove_ptr + 234 1 libSystem.B.dylib 0x97e815cc _szone_free_definite_size + 3457 2 libSystem.B.dylib 0x97e805e8 _free + 244 3 hadcm3n_6.07_i686-apple-darwin 0x0000ba58 __Z12annual_cyclePKSt6vectorISsSaISsEEPKcii + 3482 4 hadcm3n_6.07_i686-apple-darwin 0x0000d36b __Z12decadalMeansiPKc + 957 5 hadcm3n_6.07_i686-apple-darwin 0x000067ff __Z9doCM3Procv + 185 6 hadcm3n_6.07_i686-apple-darwin 0x0000876a __Z6workerv + 2896 7 hadcm3n_6.07_i686-apple-darwin 0x00008aa9 _main + 491 8 hadcm3n_6.07_i686-apple-darwin 0x00002676 start + 54 Thread 1: 0 libSystem.B.dylib 0x97e79c0e _mach_wait_until + 10 1 libSystem.B.dylib 0x97f01429 _nanosleep + 345 2 libSystem.B.dylib 0x97f012ca _usleep + 61 3 hadcm3n_6.07_i686-apple-darwin 0x00071a7c __Z11boinc_sleepd + 188 4 hadcm3n_6.07_i686-apple-darwin 0x00067282 __Z12timer_threadPv + 78 5 libSystem.B.dylib 0x97ea7259 __pthread_start + 345 6 libSystem.B.dylib 0x97ea70de _thread_start + 34 Thread 0 crashed with X86 Thread State (32-bit): eax: 0x00000000 ebx: 0x00000000 ecx: 0x00000000 edx: 0x00000000 edi: 0x00000000 esi: 0x00000000 ebp: 0xbff8f9b8 esp: 0x00000000 ss: 0x00000000 efl: 0x00000000 eip: 0x0007697e cs: 0x00000000 ds: 0x00000000 es: 0x00000000 fs: 0x00000000 gs: 0x00000000 Binary Images Description: 0x1000 - 0x93fff /Library/Application Support/BOINC Data/slots/27/../../projects/climateprediction.net/hadcm3n_6.07_i686-apple-darwin 0x93c0e000 - 0x93c1cfff /usr/lib/libz.1.dylib 0x9414e000 - 0x941b8fff /usr/lib/libstdc++.6.dylib 0x96bd6000 - 0x96bd9fff /usr/lib/system/libmathCommon.A.dylib 0x97e79000 - 0x98020fff /usr/lib/libSystem.B.dylib Exiting... </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
13 Oct 2013 18:24:19 | 1172598 | 16043242 | hadcm3n_odg2_1900_40_008472549_0 | 259,200 | 502,829 | 1.9399 |
12 Oct 2013 07:19:14 | 1172598 | 16043242 | hadcm3n_odg2_1900_40_008472549_0 | 233,280 | 451,628 | 1.9360 |
11 Oct 2013 03:07:14 | 1172598 | 16043242 | hadcm3n_odg2_1900_40_008472549_0 | 207,360 | 401,109 | 1.9344 |
10 Oct 2013 05:54:26 | 1172598 | 16043242 | hadcm3n_odg2_1900_40_008472549_0 | 181,440 | 350,067 | 1.9294 |
08 Oct 2013 14:59:27 | 1172598 | 16043242 | hadcm3n_odg2_1900_40_008472549_0 | 155,520 | 299,387 | 1.9251 |
07 Oct 2013 13:19:04 | 1172598 | 16043242 | hadcm3n_odg2_1900_40_008472549_0 | 129,600 | 250,280 | 1.9312 |
06 Oct 2013 13:07:01 | 1172598 | 16043242 | hadcm3n_odg2_1900_40_008472549_0 | 103,680 | 197,953 | 1.9093 |
05 Oct 2013 11:34:56 | 1172598 | 16043242 | hadcm3n_odg2_1900_40_008472549_0 | 77,760 | 147,817 | 1.9009 |
03 Oct 2013 11:52:39 | 1172598 | 16043242 | hadcm3n_odg2_1900_40_008472549_0 | 51,840 | 98,549 | 1.9010 |
02 Oct 2013 10:21:16 | 1172598 | 16043242 | hadcm3n_odg2_1900_40_008472549_0 | 25,920 | 49,418 | 1.9066 |
©2024 cpdn.org