Name | hadcm3n_o18h_2060_40_008365932_1 |
Workunit | 8516791 |
Created | 11 May 2013, 4:16:43 UTC |
Sent | 11 May 2013, 4:40:55 UTC |
Report deadline | 10 Aug 2013, 12:08:06 UTC |
Received | 29 May 2013, 13:57:55 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 193 (0x000000C1) EXIT_SIGNAL |
Computer ID | 1172598 |
Run time | 7 days 11 hours 40 min 59 sec |
CPU time | 6 days 2 hours 0 min 48 sec |
Validate state | Invalid |
Credit | 3,110.40 |
Device peak FLOPS | 3.14 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 i686-apple-darwin |
Stderr | <core_client_version>7.0.28</core_client_version> <![CDATA[ <message> process exited with code 193 (0xc1, -63) </message> <stderr_txt> 07:14:29 (56220): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 00:26:47 (63314): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 06:36:58 (64749): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 21:35:42 (75076): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 09:33:35 (5186): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 63 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 64 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 65 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 66 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 67 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 68 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 69 - Return code = 1 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... hadcm3n_6.07_i686-apple-darwin(22343,0xa09ad540) malloc: *** error for object 0x3001c04: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(22343,0xa09ad540) malloc: *** error for object 0x3001c00: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(22343,0xa09ad540) malloc: *** error for object 0x3001c04: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(22343,0xa09ad540) malloc: *** error for object 0x3001c00: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(22343,0xa09ad540) malloc: *** error for object 0x3001c04: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(22343,0xa09ad540) malloc: *** error for object 0x3001c00: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(22343,0xa09ad540) malloc: *** error for object 0x3001c00: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug SIGSEGV: segmentation violation hadcm3n_6.07_i686-apple-darwin(33395,0xa09ad540) malloc: *** error for object 0x9036204: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(33395,0xa09ad540) malloc: *** error for object 0x9036200: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(33395,0xa09ad540) malloc: *** error for object 0x9036204: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug SIGSEGV: segmentation violation Crashed executable name: hadcm3n_6.07_i686-apple-darwin built using BOINC library version 6.13.0 Machine type Intel 80486 (32-bit executable) System version: Macintosh OS 10.6.8 build 10K549 Wed May 29 07:44:12 2013 Thread 0 Crashed: 0 libSystem.B.dylib 0x97e84b03 _small_free_list_remove_ptr + 234 1 libSystem.B.dylib 0x97e815cc _szone_free_definite_size + 3457 2 libSystem.B.dylib 0x97e805e8 _free + 244 3 hadcm3n_6.07_i686-apple-darwin 0x0000ba58 __Z12annual_cyclePKSt6vectorISsSaISsEEPKcii + 3482 4 hadcm3n_6.07_i686-apple-darwin 0x0000d36b __Z12decadalMeansiPKc + 957 5 hadcm3n_6.07_i686-apple-darwin 0x000067ff __Z9doCM3Procv + 185 6 hadcm3n_6.07_i686-apple-darwin 0x0000876a __Z6workerv + 2896 7 hadcm3n_6.07_i686-apple-darwin 0x00008aa9 _main + 491 8 hadcm3n_6.07_i686-apple-darwin 0x00002676 start + 54 Thread 1: 0 libSystem.B.dylib 0x97e79c0e _mach_wait_until + 10 1 libSystem.B.dylib 0x97f01429 _nanosleep + 345 2 libSystem.B.dylib 0x97f012ca _usleep + 61 3 hadcm3n_6.07_i686-apple-darwin 0x00071a7c __Z11boinc_sleepd + 188 4 hadcm3n_6.07_i686-apple-darwin 0x00067282 __Z12timer_threadPv + 78 5 libSystem.B.dylib 0x97ea7259 __pthread_start + 345 6 libSystem.B.dylib 0x97ea70de _thread_start + 34 Thread 0 crashed with X86 Thread State (32-bit): eax: 0x00000000 ebx: 0x00000000 ecx: 0x00000000 edx: 0x00000000 edi: 0x00000000 esi: 0x00000000 ebp: 0xbff8f978 esp: 0x00000000 ss: 0x00000000 efl: 0x00000000 eip: 0x0007697e cs: 0x00000000 ds: 0x00000000 es: 0x00000000 fs: 0x00000000 gs: 0x00000000 Binary Images Description: 0x1000 - 0x93fff /Library/Application Support/BOINC Data/slots/15/../../projects/climateprediction.net/hadcm3n_6.07_i686-apple-darwin 0x93c0e000 - 0x93c1cfff /usr/lib/libz.1.dylib 0x9414e000 - 0x941b8fff /usr/lib/libstdc++.6.dylib 0x96bd6000 - 0x96bd9fff /usr/lib/system/libmathCommon.A.dylib 0x97e79000 - 0x98020fff /usr/lib/libSystem.B.dylib Exiting... </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
29 May 2013 07:47:38 | 1172598 | 15776414 | hadcm3n_o18h_2060_40_008365932_1 | 259,200 | 517,601 | 1.9969 |
28 May 2013 17:17:26 | 1172598 | 15776414 | hadcm3n_o18h_2060_40_008365932_1 | 233,280 | 468,862 | 2.0099 |
24 May 2013 18:43:38 | 1172598 | 15776414 | hadcm3n_o18h_2060_40_008365932_1 | 207,360 | 421,347 | 2.0320 |
22 May 2013 16:06:20 | 1172598 | 15776414 | hadcm3n_o18h_2060_40_008365932_1 | 181,440 | 368,422 | 2.0305 |
21 May 2013 17:47:00 | 1172598 | 15776414 | hadcm3n_o18h_2060_40_008365932_1 | 155,520 | 313,687 | 2.0170 |
18 May 2013 09:28:58 | 1172598 | 15776414 | hadcm3n_o18h_2060_40_008365932_1 | 129,600 | 262,316 | 2.0240 |
17 May 2013 07:39:52 | 1172598 | 15776414 | hadcm3n_o18h_2060_40_008365932_1 | 103,680 | 210,537 | 2.0306 |
15 May 2013 22:53:20 | 1172598 | 15776414 | hadcm3n_o18h_2060_40_008365932_1 | 77,760 | 158,820 | 2.0424 |
13 May 2013 12:26:16 | 1172598 | 15776414 | hadcm3n_o18h_2060_40_008365932_1 | 51,840 | 106,055 | 2.0458 |
12 May 2013 09:36:21 | 1172598 | 15776414 | hadcm3n_o18h_2060_40_008365932_1 | 25,920 | 52,998 | 2.0447 |
©2024 cpdn.org