Name | hadcm3n_ocep_1900_40_008471204_0 |
Workunit | 8622043 |
Created | 27 Sep 2013, 10:03:29 UTC |
Sent | 1 Oct 2013, 7:55:38 UTC |
Report deadline | 31 Dec 2013, 15:22:49 UTC |
Received | 14 Oct 2013, 10:54:34 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 193 (0x000000C1) EXIT_SIGNAL |
Computer ID | 1172598 |
Run time | 7 days 19 hours 20 min 18 sec |
CPU time | 5 days 17 hours 6 min 1 sec |
Validate state | Invalid |
Credit | 3,110.40 |
Device peak FLOPS | 3.14 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 i686-apple-darwin |
Stderr | <core_client_version>7.0.28</core_client_version> <![CDATA[ <message> process exited with code 193 (0xc1, -63) </message> <stderr_txt> Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 21:09:51 (66927): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 13:02:07 (87489): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 01:26:16 (93426): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 03:38:32 (93743): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:38:33 (93743): No heartbeat from core client for 30 sec - exiting 03:38:34 (93743): No heartbeat from core client for 30 sec - exiting 03:38:35 (93743): No heartbeat from core client for 30 sec - exiting 06:12:05 (94956): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 08:02:34 (96251): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... 13:37:08 (11588): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 13:37:10 (11588): No heartbeat from core client for 30 sec - exiting 13:37:11 (11588): No heartbeat from core client for 30 sec - exiting 13:37:12 (11588): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 07:09:11 (21126): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 09:18:23 (23887): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 22:11:51 (26914): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 00:13:10 (28176): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 00:13:11 (28176): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... 22:08:24 (33531): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... hadcm3n_6.07_i686-apple-darwin(76600,0xa09ad540) malloc: *** error for object 0x8800e04: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(76600,0xa09ad540) malloc: *** error for object 0x8800e00: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(76600,0xa09ad540) malloc: *** error for object 0x8801c04: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(76600,0xa09ad540) malloc: *** error for object 0x8801c00: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(76600,0xa09ad540) malloc: *** error for object 0x8801c04: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(76600,0xa09ad540) malloc: *** error for object 0x8801c00: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(76600,0xa09ad540) malloc: *** error for object 0x8801c00: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug SIGSEGV: segmentation violation Crashed executable name: hadcm3n_6.07_i686-apple-darwin built using BOINC library version 6.13.0 Machine type Intel 80486 (32-bit executable) System version: Macintosh OS 10.6.8 build 10K549 Mon Oct 14 05:53:53 2013 Thread 0 Crashed: 0 libSystem.B.dylib 0x97e84b0f _small_free_list_remove_ptr + 246 1 libSystem.B.dylib 0x97e815cc _szone_free_definite_size + 3457 2 libSystem.B.dylib 0x97e805e8 _free + 244 3 hadcm3n_6.07_i686-apple-darwin 0x0000ba58 __Z12annual_cyclePKSt6vectorISsSaISsEEPKcii + 3482 4 hadcm3n_6.07_i686-apple-darwin 0x0000d36b __Z12decadalMeansiPKc + 957 5 hadcm3n_6.07_i686-apple-darwin 0x000067ff __Z9doCM3Procv + 185 6 hadcm3n_6.07_i686-apple-darwin 0x0000791c __Z8mainLoopv + 410 7 hadcm3n_6.07_i686-apple-darwin 0x000087c7 __Z6workerv + 2989 8 hadcm3n_6.07_i686-apple-darwin 0x00008aa9 _main + 491 9 hadcm3n_6.07_i686-apple-darwin 0x00002676 start + 54 Thread 1: 0 libSystem.B.dylib 0x97e79c0e _mach_wait_until + 10 1 libSystem.B.dylib 0x97f01429 _nanosleep + 345 2 libSystem.B.dylib 0x97f012ca _usleep + 61 3 hadcm3n_6.07_i686-apple-darwin 0x00071a7c __Z11boinc_sleepd + 188 4 hadcm3n_6.07_i686-apple-darwin 0x00067282 __Z12timer_threadPv + 78 5 libSystem.B.dylib 0x97ea7259 __pthread_start + 345 6 libSystem.B.dylib 0x97ea70de _thread_start + 34 Thread 0 crashed with X86 Thread State (32-bit): eax: 0x00000000 ebx: 0x00000000 ecx: 0x00000000 edx: 0x00000000 edi: 0x00000000 esi: 0x00000000 ebp: 0xbff8f6c8 esp: 0x00000000 ss: 0x00000000 efl: 0x00000000 eip: 0x0007697e cs: 0x00000000 ds: 0x00000000 es: 0x00000000 fs: 0x00000000 gs: 0x00000000 Binary Images Description: 0x1000 - 0x93fff /Library/Application Support/BOINC Data/slots/1/../../projects/climateprediction.net/hadcm3n_6.07_i686-apple-darwin 0x93c0e000 - 0x93c1cfff /usr/lib/libz.1.dylib 0x9414e000 - 0x941b8fff /usr/lib/libstdc++.6.dylib 0x96bd6000 - 0x96bd9fff /usr/lib/system/libmathCommon.A.dylib 0x97e79000 - 0x98020fff /usr/lib/libSystem.B.dylib Exiting... </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
14 Oct 2013 09:57:27 | 1172598 | 16041890 | hadcm3n_ocep_1900_40_008471204_0 | 259,200 | 493,555 | 1.9041 |
12 Oct 2013 16:31:30 | 1172598 | 16041890 | hadcm3n_ocep_1900_40_008471204_0 | 233,280 | 443,321 | 1.9004 |
11 Oct 2013 13:46:28 | 1172598 | 16041890 | hadcm3n_ocep_1900_40_008471204_0 | 207,360 | 394,406 | 1.9020 |
10 Oct 2013 15:49:45 | 1172598 | 16041890 | hadcm3n_ocep_1900_40_008471204_0 | 181,440 | 345,666 | 1.9051 |
09 Oct 2013 00:43:57 | 1172598 | 16041890 | hadcm3n_ocep_1900_40_008471204_0 | 155,520 | 295,604 | 1.9007 |
07 Oct 2013 19:55:47 | 1172598 | 16041890 | hadcm3n_ocep_1900_40_008471204_0 | 129,600 | 245,804 | 1.8966 |
06 Oct 2013 23:59:03 | 1172598 | 16041890 | hadcm3n_ocep_1900_40_008471204_0 | 103,680 | 196,734 | 1.8975 |
06 Oct 2013 00:15:02 | 1172598 | 16041890 | hadcm3n_ocep_1900_40_008471204_0 | 77,760 | 146,529 | 1.8844 |
04 Oct 2013 14:21:39 | 1172598 | 16041890 | hadcm3n_ocep_1900_40_008471204_0 | 51,840 | 97,876 | 1.8880 |
02 Oct 2013 05:02:18 | 1172598 | 16041890 | hadcm3n_ocep_1900_40_008471204_0 | 25,920 | 48,300 | 1.8634 |
©2024 cpdn.org