Name | hadcm3n_o0a6_2140_40_008270200_4 |
Workunit | 8425324 |
Created | 14 Aug 2013, 19:13:39 UTC |
Sent | 14 Aug 2013, 19:14:02 UTC |
Report deadline | 14 Nov 2013, 2:41:13 UTC |
Received | 30 Aug 2013, 19:20:46 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 1197259 |
Run time | 7 days 21 hours 5 min 29 sec |
CPU time | 4 days 22 hours 36 min 41 sec |
Validate state | Invalid |
Credit | 3,110.40 |
Device peak FLOPS | 3.19 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 i686-apple-darwin |
Stderr | <core_client_version>7.0.65</core_client_version> <![CDATA[ <message> process exited with code 22 (0x16, -234) </message> <stderr_txt> 01:28:29 (80120): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... SIGSEGV: segmentation violation CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 01:29:24 (55001): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 02:30:39 (14113): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 02:30:40 (14113): No heartbeat from core client for 30 sec - exiting 02:30:41 (14113): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... hadcm3n_6.07_i686-apple-darwin(60500,0xac96e2c0) malloc: *** error for object 0x11f8804: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(60500,0xac96e2c0) malloc: *** error for object 0x11f8800: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(60500,0xac96e2c0) malloc: *** error for object 0x79f4804: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(60500,0xac96e2c0) malloc: *** error for object 0x79f4800: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(60500,0xac96e2c0) malloc: *** error for object 0x79f4804: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(60500,0xac96e2c0) malloc: *** error for object 0x79f4800: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(60500,0xac96e2c0) malloc: *** error for object 0x79f4804: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(60500,0xac96e2c0) malloc: *** error for object 0x79f4800: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(60500,0xac96e2c0) malloc: *** error for object 0x79f4804: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(60500,0xac96e2c0) malloc: *** error for object 0x79f4800: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(60500,0xac96e2c0) malloc: *** error for object 0x79f4804: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(60500,0xac96e2c0) malloc: *** error for object 0x79f4800: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(22520,0xac96e2c0) malloc: *** error for object 0x7090604: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(22520,0xac96e2c0) malloc: *** error for object 0x7090600: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(22520,0xac96e2c0) malloc: *** error for object 0x308c604: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(22520,0xac96e2c0) malloc: *** error for object 0x308c600: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(22520,0xac96e2c0) malloc: *** error for object 0x308c604: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(22520,0xac96e2c0) malloc: *** error for object 0x308c600: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(22520,0xac96e2c0) malloc: *** error for object 0x308c604: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(22520,0xac96e2c0) malloc: *** error for object 0x308c600: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(22520,0xac96e2c0) malloc: *** error for object 0x308c604: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(22520,0xac96e2c0) malloc: *** error for object 0x308c600: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(22520,0xac96e2c0) malloc: *** error for object 0x88b804: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(22520,0xac96e2c0) malloc: *** error for object 0x88b800: incorrect checksum for freed object - object was probably modified after being freed. *** set a breakpoint in malloc_error_break to debug cpdnmonitor: cannot open input file /Library/Application Support/BOINC Data/projects/climateprediction.net/hadcm3n_o0a6_2140_40_008270200/dataout/atmos_restart.day after 11 attempts cpdnmonitor: cannot open input file /Library/Application Support/BOINC Data/projects/climateprediction.net/hadcm3n_o0a6_2140_40_008270200/dataout/ocean_restart.day after 11 attempts Model crashed: READ_FLH: I/O error tmp/pipe_dummy 2048 cpdnmonitor: cannot open input file /Library/Application Support/BOINC Data/projects/climateprediction.net/hadcm3n_o0a6_2140_40_008270200/dataout/atmos_restart.day after 11 attempts cpdnmonitor: cannot open input file /Library/Application Support/BOINC Data/projects/climateprediction.net/hadcm3n_o0a6_2140_40_008270200/dataout/ocean_restart.day after 11 attempts Model crashed: READ_FLH: I/O error tmp/pipe_dummy 2048 cpdnmonitor: cannot open input file /Library/Application Support/BOINC Data/projects/climateprediction.net/hadcm3n_o0a6_2140_40_008270200/dataout/atmos_restart.day after 11 attempts cpdnmonitor: cannot open input file /Library/Application Support/BOINC Data/projects/climateprediction.net/hadcm3n_o0a6_2140_40_008270200/dataout/ocean_restart.day after 11 attempts Model crashed: READ_FLH: I/O error tmp/pipe_dummy 2048 cpdnmonitor: cannot open input file /Library/Application Support/BOINC Data/projects/climateprediction.net/hadcm3n_o0a6_2140_40_008270200/dataout/atmos_restart.day after 11 attempts cpdnmonitor: cannot open input file /Library/Application Support/BOINC Data/projects/climateprediction.net/hadcm3n_o0a6_2140_40_008270200/dataout/ocean_restart.day after 11 attempts Model crashed: READ_FLH: I/O error tmp/pipe_dummy 2048 cpdnmonitor: cannot open input file /Library/Application Support/BOINC Data/projects/climateprediction.net/hadcm3n_o0a6_2140_40_008270200/dataout/atmos_restart.day after 11 attempts cpdnmonitor: cannot open input file /Library/Application Support/BOINC Data/projects/climateprediction.net/hadcm3n_o0a6_2140_40_008270200/dataout/ocean_restart.day after 11 attempts Model crashed: READ_FLH: I/O error tmp/pipe_dummy 2048 cpdnmonitor: cannot open input file /Library/Application Support/BOINC Data/projects/climateprediction.net/hadcm3n_o0a6_2140_40_008270200/dataout/atmos_restart.day after 11 attempts cpdnmonitor: cannot open input file /Library/Application Support/BOINC Data/projects/climateprediction.net/hadcm3n_o0a6_2140_40_008270200/dataout/ocean_restart.day after 11 attempts Model crashed: READ_FLH: I/O error tmp/pipe_dummy 2048 Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
30 Aug 2013 16:09:30 | 1197259 | 15920490 | hadcm3n_o0a6_2140_40_008270200_4 | 259,200 | 427,043 | 1.6475 |
30 Aug 2013 01:23:14 | 1197259 | 15920490 | hadcm3n_o0a6_2140_40_008270200_4 | 233,280 | 386,705 | 1.6577 |
24 Aug 2013 14:09:26 | 1197259 | 15920490 | hadcm3n_o0a6_2140_40_008270200_4 | 207,360 | 346,553 | 1.6713 |
22 Aug 2013 05:53:35 | 1197259 | 15920490 | hadcm3n_o0a6_2140_40_008270200_4 | 181,440 | 306,358 | 1.6885 |
20 Aug 2013 20:21:38 | 1197259 | 15920490 | hadcm3n_o0a6_2140_40_008270200_4 | 155,520 | 262,993 | 1.6911 |
19 Aug 2013 22:59:16 | 1197259 | 15920490 | hadcm3n_o0a6_2140_40_008270200_4 | 129,600 | 219,580 | 1.6943 |
19 Aug 2013 00:38:12 | 1197259 | 15920490 | hadcm3n_o0a6_2140_40_008270200_4 | 103,680 | 175,856 | 1.6961 |
18 Aug 2013 02:38:22 | 1197259 | 15920490 | hadcm3n_o0a6_2140_40_008270200_4 | 77,760 | 132,045 | 1.6981 |
16 Aug 2013 14:35:44 | 1197259 | 15920490 | hadcm3n_o0a6_2140_40_008270200_4 | 51,840 | 89,359 | 1.7237 |
15 Aug 2013 16:58:46 | 1197259 | 15920490 | hadcm3n_o0a6_2140_40_008270200_4 | 25,920 | 44,703 | 1.7247 |
©2024 cpdn.org