Name | hadcm3n_zgat_1920_40_008365430_4 |
Workunit | 8516289 |
Created | 14 Aug 2013, 11:40:47 UTC |
Sent | 14 Aug 2013, 17:55:31 UTC |
Report deadline | 14 Nov 2013, 1:22:42 UTC |
Received | 24 Aug 2013, 13:54:51 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 193 (0x000000C1) EXIT_SIGNAL |
Computer ID | 1190785 |
Run time | 9 days 12 hours 24 min 2 sec |
CPU time | 8 days 2 hours 27 min 12 sec |
Validate state | Invalid |
Credit | 9,331.20 |
Device peak FLOPS | 3.32 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 i686-apple-darwin |
Stderr | <core_client_version>7.0.65</core_client_version> <![CDATA[ <message> process exited with code 193 (0xc1, -63) </message> <stderr_txt> CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 15:21:57 (25686): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 15:23:46 (37570): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 15:30:22 (37588): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 15:34:40 (37628): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 15:37:14 (37658): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 15:49:04 (37677): No heartbeat from core client for 30 sec - exiting 15:49:05 (37677): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 63 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 64 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 65 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 66 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 67 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 68 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 69 - Return code = 1 Error: Input file: dataout/zgatko.pjd2c10 is not a valid UM file. Error converting file to netcdf: dataout/zgatko.pjd2c10 Error: Input file: dataout/zgatko.pid2c10 is not a valid UM file. Error converting file to netcdf: dataout/zgatko.pid2c10 Error: Input file: dataout/zgatko.pfd2c10 is not a valid UM file. Error converting file to netcdf: dataout/zgatko.pfd2c10 Error: Input file: dataout/zgatka.phd2c10 is not a valid UM file. Error converting file to netcdf: dataout/zgatka.phd2c10 Error: Input file: dataout/zgatka.pgd2c10 is not a valid UM file. Error converting file to netcdf: dataout/zgatka.pgd2c10 Error: Input file: dataout/zgatka.ped2c10 is not a valid UM file. Error converting file to netcdf: dataout/zgatka.ped2c10 Error: Input file: dataout/zgatka.pdd2c10 is not a valid UM file. Error converting file to netcdf: dataout/zgatka.pdd2c10 15:53:11 (37740): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 16:18:12 (37782): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... hadcm3n_6.07_i686-apple-darwin(48182,0xac17a2c0) malloc: *** error for object 0x2b8b204: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(48182,0xac17a2c0) malloc: *** error for object 0x2b8b200: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(48182,0xac17a2c0) malloc: *** error for object 0x2b8b204: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(48182,0xac17a2c0) malloc: *** error for object 0x2b8b200: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(48182,0xac17a2c0) malloc: *** error for object 0x138a804: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(48182,0xac17a2c0) malloc: *** error for object 0x138a800: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(48182,0xac17a2c0) malloc: *** error for object 0x2b6fc04: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(48182,0xac17a2c0) malloc: *** error for object 0x2383004: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(48182,0xac17a2c0) malloc: *** error for object 0x2383000: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(48182,0xac17a2c0) malloc: *** error for object 0x2383004: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(48182,0xac17a2c0) malloc: *** error for object 0x2383000: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(48182,0xac17a2c0) malloc: *** error for object 0x2383004: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(48182,0xac17a2c0) malloc: *** error for object 0x2383000: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(48182,0xac17a2c0) malloc: *** error for object 0xb59c04: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(48182,0xac17a2c0) malloc: *** error for object 0x136f204: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(48182,0xac17a2c0) malloc: *** error for object 0x136f200: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(48182,0xac17a2c0) malloc: *** error for object 0x2b45604: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(48182,0xac17a2c0) malloc: *** error for object 0x2b45600: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(48182,0xac17a2c0) malloc: *** error for object 0x2b3b804: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(48182,0xac17a2c0) malloc: *** error for object 0x2b3b800: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(48182,0xac17a2c0) malloc: *** error for object 0x2b3b804: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug SIGSEGV: segmentation violation Crashed executable name: hadcm3n_6.07_i686-apple-darwin built using BOINC library version 6.13.0 Machine type Intel 80486 (32-bit executable) System version: Macintosh OS 10.7.5 build 11G63 Sat Aug 24 14:10:02 2013 Thread 0 Crashed: Thread 1: atos cannot load symbols for the file hadcm3n_6.07_i686-apple-darwin for architecture i386. Thread 0 crashed with X86 Thread State (32-bit): eax: 0x00000000 ebx: 0x00000000 ecx: 0x00000000 edx: 0x00000000 edi: 0x00000000 esi: 0x00000000 ebp: 0xbff8faa8 esp: 0x00000000 ss: 0x00000000 efl: 0x00000000 eip: 0x0007697e cs: 0x00000000 ds: 0x00000000 es: 0x00000000 fs: 0x00000000 gs: 0x00000000 Binary Images Description: 0x1000 - 0x93fff /Library/Application Support/BOINC Data/slots/2/../../projects/climateprediction.net/hadcm3n_6.07_i686-apple-darwin 0x9082c000 - 0x9082cfff /usr/lib/system/libkeymgr.dylib 0x91556000 - 0x91621fff /usr/lib/system/libsystem_c.dylib 0x92d8c000 - 0x92d93fff /usr/lib/system/libsystem_notify.dylib 0x92ffa000 - 0x92ffffff /usr/lib/system/libmacho.dylib 0x93091000 - 0x930bffff /usr/lib/libSystem.B.dylib 0x931cc000 - 0x931eafff /usr/lib/system/libsystem_kernel.dylib 0x94316000 - 0x94324fff /usr/lib/system/libdispatch.dylib 0x94325000 - 0x9432efff /usr/lib/libc++abi.dylib 0x94815000 - 0x94818fff /usr/lib/system/libcompiler_rt.dylib 0x95961000 - 0x95965fff /usr/lib/system/libsystem_network.dylib 0x95f17000 - 0x95f25fff /usr/lib/libz.1.dylib 0x95f8c000 - 0x95fa2fff /usr/lib/system/libxpc.dylib 0x9747e000 - 0x97486fff /usr/lib/system/liblaunch.dylib 0x9754a000 - 0x9754bfff /usr/lib/system/libunc.dylib 0x978c1000 - 0x978c5fff /usr/lib/system/libcache.dylib 0x991bd000 - 0x9921ffff /usr/lib/libstdc++.6.dylib 0x99221000 - 0x99221fff /usr/lib/system/libdnsinfo.dylib 0x99742000 - 0x99743fff /usr/lib/system/libquarantine.dylib 0x99ed1000 - 0x99ed8fff /usr/lib/system/libsystem_dnssd.dylib 0x99ed9000 - 0x99f1cfff /usr/lib/system/libcommonCrypto.dylib 0x9a010000 - 0x9a03ffff /usr/lib/system/libsystem_info.dylib 0x9a4f3000 - 0x9a4f4fff /usr/lib/system/libsystem_blocks.dylib 0x9bcdd000 - 0x9bce5fff /usr/lib/system/libcopyfile.dylib 0x9c196000 - 0x9c197fff /usr/lib/system/libsystem_sandbox.dylib 0x9c77b000 - 0x9c77efff /usr/lib/system/libmathCommon.A.dylib 0x9c7dd000 - 0x9c7defff /usr/lib/system/libremovefile.dylib 0x9ca05000 - 0x9ca07fff /usr/lib/system/libdyld.dylib 0x9ca08000 - 0x9ca10fff /usr/lib/system/libunwind.dylib Exiting... </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
24 Aug 2013 13:59:24 | 1190785 | 15915888 | hadcm3n_zgat_1920_40_008365430_4 | 777,600 | 700,027 | 0.9002 |
24 Aug 2013 05:27:18 | 1190785 | 15915888 | hadcm3n_zgat_1920_40_008365430_4 | 751,680 | 676,616 | 0.9001 |
23 Aug 2013 21:54:50 | 1190785 | 15915888 | hadcm3n_zgat_1920_40_008365430_4 | 725,760 | 653,157 | 0.9000 |
23 Aug 2013 13:17:17 | 1190785 | 15915888 | hadcm3n_zgat_1920_40_008365430_4 | 699,840 | 629,741 | 0.8998 |
23 Aug 2013 10:48:49 | 1190785 | 15915888 | hadcm3n_zgat_1920_40_008365430_4 | 673,920 | 606,356 | 0.8997 |
23 Aug 2013 10:48:49 | 1190785 | 15915888 | hadcm3n_zgat_1920_40_008365430_4 | 648,000 | 582,926 | 0.8996 |
22 Aug 2013 15:42:13 | 1190785 | 15915888 | hadcm3n_zgat_1920_40_008365430_4 | 622,080 | 559,450 | 0.8993 |
22 Aug 2013 07:03:59 | 1190785 | 15915888 | hadcm3n_zgat_1920_40_008365430_4 | 596,160 | 536,046 | 0.8992 |
21 Aug 2013 23:25:37 | 1190785 | 15915888 | hadcm3n_zgat_1920_40_008365430_4 | 570,240 | 512,682 | 0.8991 |
21 Aug 2013 15:33:17 | 1190785 | 15915888 | hadcm3n_zgat_1920_40_008365430_4 | 544,320 | 489,306 | 0.8989 |
21 Aug 2013 08:01:36 | 1190785 | 15915888 | hadcm3n_zgat_1920_40_008365430_4 | 518,400 | 465,989 | 0.8989 |
21 Aug 2013 00:18:28 | 1190785 | 15915888 | hadcm3n_zgat_1920_40_008365430_4 | 492,480 | 442,596 | 0.8987 |
20 Aug 2013 16:50:34 | 1190785 | 15915888 | hadcm3n_zgat_1920_40_008365430_4 | 466,560 | 419,272 | 0.8986 |
20 Aug 2013 09:54:06 | 1190785 | 15915888 | hadcm3n_zgat_1920_40_008365430_4 | 440,640 | 395,882 | 0.8984 |
20 Aug 2013 01:46:07 | 1190785 | 15915888 | hadcm3n_zgat_1920_40_008365430_4 | 414,720 | 372,479 | 0.8981 |
19 Aug 2013 15:26:01 | 1190785 | 15915888 | hadcm3n_zgat_1920_40_008365430_4 | 388,800 | 349,108 | 0.8979 |
19 Aug 2013 06:39:15 | 1190785 | 15915888 | hadcm3n_zgat_1920_40_008365430_4 | 362,880 | 325,774 | 0.8977 |
18 Aug 2013 23:07:32 | 1190785 | 15915888 | hadcm3n_zgat_1920_40_008365430_4 | 336,960 | 302,399 | 0.8974 |
18 Aug 2013 15:36:10 | 1190785 | 15915888 | hadcm3n_zgat_1920_40_008365430_4 | 311,040 | 278,989 | 0.8970 |
18 Aug 2013 07:54:35 | 1190785 | 15915888 | hadcm3n_zgat_1920_40_008365430_4 | 285,120 | 255,840 | 0.8973 |
©2024 climateprediction.net