Name | hadcm3n_t6j5_1940_40_007448485_3 |
Workunit | 7645988 |
Created | 25 Nov 2011, 16:10:30 UTC |
Sent | 25 Nov 2011, 16:21:41 UTC |
Report deadline | 24 Feb 2012, 23:48:52 UTC |
Received | 1 Dec 2011, 14:07:59 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 785487 |
Run time | 1 days 4 hours 29 min 36 sec |
CPU time | 1 days 2 hours 26 min 29 sec |
Validate state | Invalid |
Credit | 622.08 |
Device peak FLOPS | 2.86 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 i686-pc-linux-gnu |
Stderr | <core_client_version>6.12.34</core_client_version> <![CDATA[ <message> process exited with code 22 (0x16, -234) </message> <stderr_txt> 17:26:51 (15707): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 17:38:03 (15776): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 17:49:37 (15796): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 17:55:02 (15857): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 17:58:51 (15877): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 18:10:24 (15936): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 18:51:37 (15964): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 19:11:01 (16169): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 19:11:35 (16262): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 20:26:45 (16282): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... 20:34:14 (16938): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 20:34:15 (16938): No heartbeat from core client for 30 sec - exiting 20:36:52 (17015): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 20:36:53 (17015): No heartbeat from core client for 30 sec - exiting 20:36:54 (17015): No heartbeat from core client for 30 sec - exiting 20:36:55 (17015): No heartbeat from core client for 30 sec - exiting 20:36:56 (17015): No heartbeat from core client for 30 sec - exiting 20:36:57 (17015): No heartbeat from core client for 30 sec - exiting 20:36:58 (17015): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - Suspend request from BOINC... Atmos Hold Restart file rename failed on atmos_restart.hold Suspended CPDN Monitor - Suspend request from BOINC... 21:30:16 (17037): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:30:48 (17270): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:33:48 (17289): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:48:46 (17313): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 22:40:41 (17388): No heartbeat from core client for 30 sec - exiting 22:40:42 (17388): No heartbeat from core client for 30 sec - exiting 22:40:43 (17388): No heartbeat from core client for 30 sec - exiting 22:40:44 (17388): No heartbeat from core client for 30 sec - exiting 22:40:45 (17388): No heartbeat from core client for 30 sec - exiting 22:40:46 (17388): No heartbeat from core client for 30 sec - exiting 22:40:47 (17388): No heartbeat from core client for 30 sec - exiting 22:40:48 (17388): No heartbeat from core client for 30 sec - exiting 22:40:49 (17388): No heartbeat from core client for 30 sec - exiting 22:40:50 (17388): No heartbeat from core client for 30 sec - exiting 22:40:51 (17388): No heartbeat from core client for 30 sec - exiting 22:40:52 (17388): No heartbeat from core client for 30 sec - exiting 22:40:53 (17388): No heartbeat from core client for 30 sec - exiting 22:40:54 (17388): No heartbeat from core client for 30 sec - exiting 22:40:55 (17388): No heartbeat from core client for 30 sec - exiting 22:40:56 (17388): No heartbeat from core client for 30 sec - exiting 22:40:57 (17388): No heartbeat from core client for 30 sec - exiting 22:40:58 (17388): No heartbeat from core client for 30 sec - exiting 22:40:59 (17388): No heartbeat from core client for 30 sec - exiting 22:41:00 (17388): No heartbeat from core client for 30 sec - exiting 22:41:01 (17388): No heartbeat from core client for 30 sec - exiting 22:41:02 (17388): No heartbeat from core client for 30 sec - exiting 22:41:03 (17388): No heartbeat from core client for 30 sec - exiting 22:41:04 (17388): No heartbeat from core client for 30 sec - exiting 22:41:05 (17388): No heartbeat from core client for 30 sec - exiting 22:41:06 (17388): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - Suspend request from BOINC... 23:56:47 (14047): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 04:33:52 (14419): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 06:27:46 (15372): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 07:29:35 (15733): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... 13:02:34 (17826): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 16:03:31 (18799): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 17:32:48 (19419): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 17:48:38 (19732): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 07:51:41 (11517): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 07:52:28 (11733): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 07:54:45 (11755): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 07:56:34 (11775): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 08:37:52 (11792): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 08:40:33 (12396): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 08:41:38 (12413): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 08:42:52 (12718): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 09:26:21 (12733): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 09:37:30 (13748): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 10:18:37 (13816): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:21:28 (13978): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:22:31 (13998): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 10:33:19 (14019): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Atmos Hold Restart file rename failed on atmos_restart.hold 12:27:13 (14096): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 12:30:00 (14491): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 13:22:40 (14546): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 13:50:12 (14727): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 13:57:42 (14854): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... 13:59:20 (14913): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 14:01:21 (14970): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 14:03:39 (14985): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 14:05:20 (14999): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 14:06:34 (15012): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 14:08:02 (15026): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 14:09:32 (15041): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 14:10:58 (15058): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 14:12:55 (15077): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 14:13:27 (15077): No heartbeat from core client for 30 sec - exiting 14:13:29 (15077): No heartbeat from core client for 30 sec - exiting 14:13:30 (15077): No heartbeat from core client for 30 sec - exiting 14:15:19 (15097): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 14:15:34 (15097): No heartbeat from core client for 30 sec - exiting 14:15:35 (15097): No heartbeat from core client for 30 sec - exiting 14:15:36 (15097): No heartbeat from core client for 30 sec - exiting 14:15:37 (15097): No heartbeat from core client for 30 sec - exiting 14:17:47 (15139): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 14:18:16 (15139): No heartbeat from core client for 30 sec - exiting 14:18:17 (15139): No heartbeat from core client for 30 sec - exiting 14:18:18 (15139): No heartbeat from core client for 30 sec - exiting 14:18:19 (15139): No heartbeat from core client for 30 sec - exiting 14:19:38 (15156): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Model crashed: ATM_DYN : INVALID THETA DETECTED. tmp/pipe_dummy 2048 Model crashed: ATM_DYN : INVALID THETA DETECTED. tmp/pipe_dummy 2048 Model crashed: ATM_DYN : INVALID THETA DETECTED. tmp/pipe_dummy 2048 Model crashed: ATM_DYN : INVALID THETA DETECTED. tmp/pipe_dummy 2048 Model crashed: ATM_DYN : INVALID THETA DETECTED. tmp/pipe_dummy 2048 Model crashed: ATM_DYN : INVALID THETA DETECTED. tmp/pipe_dummy 2048 Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
01 Dec 2011 08:35:46 | 785487 | 13661545 | hadcm3n_t6j5_1940_40_007448485_3 | 51,840 | 82,570 | 1.5928 |
28 Nov 2011 06:29:53 | 785487 | 13661545 | hadcm3n_t6j5_1940_40_007448485_3 | 25,920 | 41,327 | 1.5944 |
©2024 cpdn.org