Name | hadcm3n_ldk4_1940_40_010020926_1 |
Workunit | 10018997 |
Created | 26 Aug 2015, 12:15:31 UTC |
Sent | 26 Aug 2015, 12:15:55 UTC |
Report deadline | 25 Nov 2015, 19:43:06 UTC |
Received | 18 Sep 2015, 12:02:04 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 4 (0x00000004) Unknown error code |
Computer ID | 1303560 |
Run time | 12 days 9 hours 15 min 33 sec |
CPU time | 11 days 1 hours 24 min 17 sec |
Validate state | Invalid |
Credit | 9,020.16 |
Device peak FLOPS | 3.84 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 i686-apple-darwin |
Stderr | <core_client_version>7.6.10</core_client_version> <![CDATA[ <message> process got signal 4 </message> <stderr_txt> Suspended CPDN Monitor - Suspend request from BOINC... 14:38:00 (16742): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 17:28:18 (19459): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 17:28:19 (19459): No heartbeat from core client for 30 sec - exiting 17:28:20 (19459): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... 14:40:58 (11612): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 14:40:59 (11612): No heartbeat from core client for 30 sec - exiting 15:47:13 (17861): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 15:47:15 (17861): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 12:57:21 (23687): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 12:57:23 (23687): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - Suspend request from BOINC... 15:46:01 (35576): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 15:46:03 (35576): No heartbeat from core client for 30 sec - exiting 15:46:04 (35576): No heartbeat from core client for 30 sec - exiting 15:46:05 (35576): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... 15:30:13 (49299): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... 15:25:08 (59644): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 15:25:09 (59644): No heartbeat from core client for 30 sec - exiting 15:25:10 (59644): No heartbeat from core client for 30 sec - exiting SIGSEGV: segmentation violation 13:21:51 (70680): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 13:21:52 (70680): No heartbeat from core client for 30 sec - exiting 13:21:53 (70680): No heartbeat from core client for 30 sec - exiting 13:21:54 (70680): No heartbeat from core client for 30 sec - exiting 13:21:55 (70680): No heartbeat from core client for 30 sec - exiting 13:21:56 (70680): No heartbeat from core client for 30 sec - exiting 13:21:57 (70680): No heartbeat from core client for 30 sec - exiting 13:21:58 (70680): No heartbeat from core client for 30 sec - exiting hadcm3n_6.07_i686-apple-darwin(82118,0xa04f11d4) malloc: *** error for object 0x867a04: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(82118,0xa04f11d4) malloc: *** error for object 0x1004e04: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(82118,0xa04f11d4) malloc: *** error for object 0x1005c04: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(82118,0xa04f11d4) malloc: *** error for object 0x1005c00: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(82118,0xa04f11d4) malloc: *** error for object 0x1005c04: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(82118,0xa04f11d4) malloc: *** error for object 0x1005c00: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(82118,0xa04f11d4) malloc: *** error for object 0x1005c04: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(82118,0xa04f11d4) malloc: *** error for object 0x1005c00: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(82118,0xa04f11d4) malloc: *** error for object 0x1005c04: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(82118,0xa04f11d4) malloc: *** error for object 0x1005c00: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 22:26:12 (7534): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 14:02:48 (7591): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 14:02:50 (7591): No heartbeat from core client for 30 sec - exiting 14:02:51 (7591): No heartbeat from core client for 30 sec - exiting 14:02:52 (7591): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 16:51:24 (24031): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 16:34:53 (24118): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 13:05:18 (31238): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 16:10:33 (42632): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 20:43:00 (44144): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 20:43:01 (44144): No heartbeat from core client for 30 sec - exiting 20:43:02 (44144): No heartbeat from core client for 30 sec - exiting 20:43:03 (44144): No heartbeat from core client for 30 sec - exiting 20:43:04 (44144): No heartbeat from core client for 30 sec - exiting 20:43:05 (44144): No heartbeat from core client for 30 sec - exiting 21:00:56 (46060): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... SIGSEGV: segmentation violation Crashed executable name: hadcm3n_6.07_i686-apple-darwin built using BOINC library version 6.13.0 Machine type Intel 80486 (32-bit executable) System version: Macintosh OS 10.10.5 build 14F27 Fri Sep 18 21:01:03 2015 </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
18 Sep 2015 03:06:40 | 1303560 | 18859482 | hadcm3n_ldk4_1940_40_010020926_1 | 751,680 | 929,251 | 1.2362 |
17 Sep 2015 08:15:21 | 1303560 | 18859482 | hadcm3n_ldk4_1940_40_010020926_1 | 725,760 | 896,918 | 1.2358 |
16 Sep 2015 11:30:56 | 1303560 | 18859482 | hadcm3n_ldk4_1940_40_010020926_1 | 699,840 | 864,679 | 1.2355 |
15 Sep 2015 18:09:35 | 1303560 | 18859482 | hadcm3n_ldk4_1940_40_010020926_1 | 673,920 | 832,834 | 1.2358 |
15 Sep 2015 07:56:20 | 1303560 | 18859482 | hadcm3n_ldk4_1940_40_010020926_1 | 648,000 | 800,129 | 1.2348 |
13 Sep 2015 23:53:25 | 1303560 | 18859482 | hadcm3n_ldk4_1940_40_010020926_1 | 622,080 | 768,053 | 1.2347 |
13 Sep 2015 07:19:02 | 1303560 | 18859482 | hadcm3n_ldk4_1940_40_010020926_1 | 596,160 | 735,546 | 1.2338 |
11 Sep 2015 17:17:54 | 1303560 | 18859482 | hadcm3n_ldk4_1940_40_010020926_1 | 570,240 | 703,285 | 1.2333 |
11 Sep 2015 07:28:26 | 1303560 | 18859482 | hadcm3n_ldk4_1940_40_010020926_1 | 544,320 | 671,242 | 1.2332 |
10 Sep 2015 12:13:23 | 1303560 | 18859482 | hadcm3n_ldk4_1940_40_010020926_1 | 518,400 | 639,276 | 1.2332 |
09 Sep 2015 17:26:04 | 1303560 | 18859482 | hadcm3n_ldk4_1940_40_010020926_1 | 492,480 | 606,584 | 1.2317 |
09 Sep 2015 06:42:53 | 1303560 | 18859482 | hadcm3n_ldk4_1940_40_010020926_1 | 466,560 | 573,863 | 1.2300 |
08 Sep 2015 09:50:11 | 1303560 | 18859482 | hadcm3n_ldk4_1940_40_010020926_1 | 440,640 | 541,641 | 1.2292 |
07 Sep 2015 11:10:59 | 1303560 | 18859482 | hadcm3n_ldk4_1940_40_010020926_1 | 414,720 | 509,062 | 1.2275 |
06 Sep 2015 13:22:30 | 1303560 | 18859482 | hadcm3n_ldk4_1940_40_010020926_1 | 388,800 | 477,368 | 1.2278 |
05 Sep 2015 17:35:17 | 1303560 | 18859482 | hadcm3n_ldk4_1940_40_010020926_1 | 362,880 | 445,587 | 1.2279 |
05 Sep 2015 07:37:25 | 1303560 | 18859482 | hadcm3n_ldk4_1940_40_010020926_1 | 336,960 | 413,401 | 1.2269 |
04 Sep 2015 10:10:45 | 1303560 | 18859482 | hadcm3n_ldk4_1940_40_010020926_1 | 311,040 | 381,466 | 1.2264 |
03 Sep 2015 12:55:15 | 1303560 | 18859482 | hadcm3n_ldk4_1940_40_010020926_1 | 285,120 | 349,311 | 1.2251 |
02 Sep 2015 15:11:37 | 1303560 | 18859482 | hadcm3n_ldk4_1940_40_010020926_1 | 259,200 | 317,015 | 1.2231 |
©2024 cpdn.org