Name | hadcm3n_4dra_2020_40_008406789_2 |
Workunit | 8557645 |
Created | 27 Sep 2013, 6:45:31 UTC |
Sent | 27 Sep 2013, 6:45:53 UTC |
Report deadline | 27 Dec 2013, 14:13:04 UTC |
Received | 13 Oct 2013, 12:43:05 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 1158872 |
Run time | 7 days 11 hours 25 min 28 sec |
CPU time | 7 days 5 hours 4 min 17 sec |
Validate state | Invalid |
Credit | 2,177.28 |
Device peak FLOPS | 1.52 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86 |
Stderr | <core_client_version>7.0.64</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> 21:41:18 (6568): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:49:11 (5712): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 19:12:01 (1900): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 09:36:42 (9500): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... 09:36:44 (9500): No heartbeat from core client for 30 sec - exiting 09:36:45 (9500): No heartbeat from core client for 30 sec - exiting 21:13:04 (11124): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:13:06 (11124): No heartbeat from core client for 30 sec - exiting 21:13:07 (11124): No heartbeat from core client for 30 sec - exiting Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4900, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... 15:38:08 (5880): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Atmos Hold Restart file rename failed on atmos_restart.hold 23:41:18 (2968): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 23:41:38 (2968): No heartbeat from core client for 30 sec - exiting 23:41:40 (2968): No heartbeat from core client for 30 sec - exiting 23:41:41 (2968): No heartbeat from core client for 30 sec - exiting 23:41:42 (2968): No heartbeat from core client for 30 sec - exiting 23:41:43 (2968): No heartbeat from core client for 30 sec - exiting 23:41:44 (2968): No heartbeat from core client for 30 sec - exiting 23:41:45 (2968): No heartbeat from core client for 30 sec - exiting 23:41:46 (2968): No heartbeat from core client for 30 sec - exiting 23:41:47 (2968): No heartbeat from core client for 30 sec - exiting 23:41:48 (2968): No heartbeat from core client for 30 sec - exiting 23:41:49 (2968): No heartbeat from core client for 30 sec - exiting 23:41:50 (2968): No heartbeat from core client for 30 sec - exiting 23:41:52 (2968): No heartbeat from core client for 30 sec - exiting 23:41:53 (2968): No heartbeat from core client for 30 sec - exiting 23:41:54 (2968): No heartbeat from core client for 30 sec - exiting 23:41:55 (2968): No heartbeat from core client for 30 sec - exiting 23:41:56 (2968): No heartbeat from core client for 30 sec - exiting 23:41:57 (2968): No heartbeat from core client for 30 sec - exiting 23:41:58 (2968): No heartbeat from core client for 30 sec - exiting 23:41:59 (2968): No heartbeat from core client for 30 sec - exiting 23:42:00 (2968): No heartbeat from core client for 30 sec - exiting 23:42:01 (2968): No heartbeat from core client for 30 sec - exiting 23:42:02 (2968): No heartbeat from core client for 30 sec - exiting 23:42:04 (2968): No heartbeat from core client for 30 sec - exiting 23:42:05 (2968): No heartbeat from core client for 30 sec - exiting 23:42:06 (2968): No heartbeat from core client for 30 sec - exiting 23:42:07 (2968): No heartbeat from core client for 30 sec - exiting 23:42:08 (2968): No heartbeat from core client for 30 sec - exiting 23:42:09 (2968): No heartbeat from core client for 30 sec - exiting 23:42:10 (2968): No heartbeat from core client for 30 sec - exiting 23:42:11 (2968): No heartbeat from core client for 30 sec - exiting 23:42:12 (2968): No heartbeat from core client for 30 sec - exiting 23:42:13 (2968): No heartbeat from core client for 30 sec - exiting 23:42:15 (2968): No heartbeat from core client for 30 sec - exiting 23:42:16 (2968): No heartbeat from core client for 30 sec - exiting 23:42:17 (2968): No heartbeat from core client for 30 sec - exiting 23:52:51 (6572): No heartbeat from core client for 30 sec - exiting 23:53:29 (6572): No heartbeat from core client for 30 sec - exiting 23:53:30 (6572): No heartbeat from core client for 30 sec - exiting 23:53:31 (6572): No heartbeat from core client for 30 sec - exiting 23:53:32 (6572): No heartbeat from core client for 30 sec - exiting 23:56:45 (6572): No heartbeat from core client for 30 sec - exiting 23:56:47 (6572): No heartbeat from core client for 30 sec - exiting 23:56:48 (6572): No heartbeat from core client for 30 sec - exiting 23:56:49 (6572): No heartbeat from core client for 30 sec - exiting 23:56:50 (6572): No heartbeat from core client for 30 sec - exiting 23:56:51 (6572): No heartbeat from core client for 30 sec - exiting 23:56:52 (6572): No heartbeat from core client for 30 sec - exiting 23:56:53 (6572): No heartbeat from core client for 30 sec - exiting 23:56:54 (6572): No heartbeat from core client for 30 sec - exiting 23:56:55 (6572): No heartbeat from core client for 30 sec - exiting 23:56:56 (6572): No heartbeat from core client for 30 sec - exiting 23:56:57 (6572): No heartbeat from core client for 30 sec - exiting 23:56:59 (6572): No heartbeat from core client for 30 sec - exiting 23:57:00 (6572): No heartbeat from core client for 30 sec - exiting 23:57:01 (6572): No heartbeat from core client for 30 sec - exiting 23:57:02 (6572): No heartbeat from core client for 30 sec - exiting 23:57:03 (6572): No heartbeat from core client for 30 sec - exiting 23:57:04 (6572): No heartbeat from core client for 30 sec - exiting 23:57:05 (6572): No heartbeat from core client for 30 sec - exiting 23:57:06 (6572): No heartbeat from core client for 30 sec - exiting 23:57:07 (6572): No heartbeat from core client for 30 sec - exiting 23:57:08 (6572): No heartbeat from core client for 30 sec - exiting 23:57:09 (6572): No heartbeat from core client for 30 sec - exiting 23:57:11 (6572): No heartbeat from core client for 30 sec - exiting 23:57:12 (6572): No heartbeat from core client for 30 sec - exiting 23:57:13 (6572): No heartbeat from core client for 30 sec - exiting 23:57:14 (6572): No heartbeat from core client for 30 sec - exiting 23:57:15 (6572): No heartbeat from core client for 30 sec - exiting 23:57:16 (6572): No heartbeat from core client for 30 sec - exiting 23:57:17 (6572): No heartbeat from core client for 30 sec - exiting 23:57:18 (6572): No heartbeat from core client for 30 sec - exiting 23:57:19 (6572): No heartbeat from core client for 30 sec - exiting 23:57:20 (6572): No heartbeat from core client for 30 sec - exiting 23:57:21 (6572): No heartbeat from core client for 30 sec - exiting 23:57:23 (6572): No heartbeat from core client for 30 sec - exiting 23:57:24 (6572): No heartbeat from core client for 30 sec - exiting 23:57:25 (6572): No heartbeat from core client for 30 sec - exiting 23:57:26 (6572): No heartbeat from core client for 30 sec - exiting 23:57:27 (6572): No heartbeat from core client for 30 sec - exiting 23:57:28 (6572): No heartbeat from core client for 30 sec - exiting 23:57:29 (6572): No heartbeat from core client for 30 sec - exiting 23:57:30 (6572): No heartbeat from core client for 30 sec - exiting 23:57:31 (6572): No heartbeat from core client for 30 sec - exiting 23:57:32 (6572): No heartbeat from core client for 30 sec - exiting 23:57:33 (6572): No heartbeat from core client for 30 sec - exiting 23:58:07 (6572): No heartbeat from core client for 30 sec - exiting 23:58:08 (6572): No heartbeat from core client for 30 sec - exiting 23:58:09 (6572): No heartbeat from core client for 30 sec - exiting 23:58:11 (6572): No heartbeat from core client for 30 sec - exiting 23:58:12 (6572): No heartbeat from core client for 30 sec - exiting 23:58:13 (6572): No heartbeat from core client for 30 sec - exiting 23:58:14 (6572): No heartbeat from core client for 30 sec - exiting 23:58:15 (6572): No heartbeat from core client for 30 sec - exiting 23:58:16 (6572): No heartbeat from core client for 30 sec - exiting 23:58:17 (6572): No heartbeat from core client for 30 sec - exiting 23:58:18 (6572): No heartbeat from core client for 30 sec - exiting 23:58:19 (6572): No heartbeat from core client for 30 sec - exiting 23:58:20 (6572): No heartbeat from core client for 30 sec - exiting 23:58:22 (6572): No heartbeat from core client for 30 sec - exiting 23:58:23 (6572): No heartbeat from core client for 30 sec - exiting 23:58:24 (6572): No heartbeat from core client for 30 sec - exiting 23:58:25 (6572): No heartbeat from core client for 30 sec - exiting 23:58:26 (6572): No heartbeat from core client for 30 sec - exiting 23:58:27 (6572): No heartbeat from core client for 30 sec - exiting 23:58:28 (6572): No heartbeat from core client for 30 sec - exiting 23:58:29 (6572): No heartbeat from core client for 30 sec - exiting 23:58:30 (6572): No heartbeat from core client for 30 sec - exiting 23:58:31 (6572): No heartbeat from core client for 30 sec - exiting 23:58:32 (6572): No heartbeat from core client for 30 sec - exiting 23:58:34 (6572): No heartbeat from core client for 30 sec - exiting 23:58:35 (6572): No heartbeat from core client for 30 sec - exiting 23:58:36 (6572): No heartbeat from core client for 30 sec - exiting 23:58:37 (6572): No heartbeat from core client for 30 sec - exiting 23:58:38 (6572): No heartbeat from core client for 30 sec - exiting 23:58:39 (6572): No heartbeat from core client for 30 sec - exiting 23:58:40 (6572): No heartbeat from core client for 30 sec - exiting 23:58:41 (6572): No heartbeat from core client for 30 sec - exiting 23:58:42 (6572): No heartbeat from core client for 30 sec - exiting 23:58:43 (6572): No heartbeat from core client for 30 sec - exiting 23:58:44 (6572): No heartbeat from core client for 30 sec - exiting 23:58:46 (6572): No heartbeat from core client for 30 sec - exiting 23:58:47 (6572): No heartbeat from core client for 30 sec - exiting 23:58:48 (6572): No heartbeat from core client for 30 sec - exiting 23:58:49 (6572): No heartbeat from core client for 30 sec - exiting 23:58:50 (6572): No heartbeat from core client for 30 sec - exiting 23:58:51 (6572): No heartbeat from core client for 30 sec - exiting 23:58:52 (6572): No heartbeat from core client for 30 sec - exiting 23:58:53 (6572): No heartbeat from core client for 30 sec - exiting 23:58:54 (6572): No heartbeat from core client for 30 sec - exiting 23:58:55 (6572): No heartbeat from core client for 30 sec - exiting 23:58:56 (6572): No heartbeat from core client for 30 sec - exiting 23:58:58 (6572): No heartbeat from core client for 30 sec - exiting 23:58:59 (6572): No heartbeat from core client for 30 sec - exiting 23:59:00 (6572): No heartbeat from core client for 30 sec - exiting 23:59:01 (6572): No heartbeat from core client for 30 sec - exiting 23:59:02 (6572): No heartbeat from core client for 30 sec - exiting 23:59:03 (6572): No heartbeat from core client for 30 sec - exiting 23:59:04 (6572): No heartbeat from core client for 30 sec - exiting 23:59:05 (6572): No heartbeat from core client for 30 sec - exiting 23:59:06 (6572): No heartbeat from core client for 30 sec - exiting 23:59:07 (6572): No heartbeat from core client for 30 sec - exiting 23:59:08 (6572): No heartbeat from core client for 30 sec - exiting 23:59:10 (6572): No heartbeat from core client for 30 sec - exiting 23:59:11 (6572): No heartbeat from core client for 30 sec - exiting 23:59:12 (6572): No heartbeat from core client for 30 sec - exiting 23:59:13 (6572): No heartbeat from core client for 30 sec - exiting 23:59:14 (6572): No heartbeat from core client for 30 sec - exiting 23:59:15 (6572): No heartbeat from core client for 30 sec - exiting 23:59:16 (6572): No heartbeat from core client for 30 sec - exiting 23:59:17 (6572): No heartbeat from core client for 30 sec - exiting 23:59:18 (6572): No heartbeat from core client for 30 sec - exiting 23:59:19 (6572): No heartbeat from core client for 30 sec - exiting 23:59:20 (6572): No heartbeat from core client for 30 sec - exiting 23:59:22 (6572): No heartbeat from core client for 30 sec - exiting 23:59:23 (6572): No heartbeat from core client for 30 sec - exiting 23:59:24 (6572): No heartbeat from core client for 30 sec - exiting 23:59:25 (6572): No heartbeat from core client for 30 sec - exiting 23:59:26 (6572): No heartbeat from core client for 30 sec - exiting 23:59:27 (6572): No heartbeat from core client for 30 sec - exiting 23:59:28 (6572): No heartbeat from core client for 30 sec - exiting 23:59:29 (6572): No heartbeat from core client for 30 sec - exiting 23:59:30 (6572): No heartbeat from core client for 30 sec - exiting 23:59:31 (6572): No heartbeat from core client for 30 sec - exiting 23:59:32 (6572): No heartbeat from core client for 30 sec - exiting 23:59:34 (6572): No heartbeat from core client for 30 sec - exiting 23:59:35 (6572): No heartbeat from core client for 30 sec - exiting 23:59:36 (6572): No heartbeat from core client for 30 sec - exiting 23:59:37 (6572): No heartbeat from core client for 30 sec - exiting 23:59:38 (6572): No heartbeat from core client for 30 sec - exiting 23:59:39 (6572): No heartbeat from core client for 30 sec - exiting 23:59:40 (6572): No heartbeat from core client for 30 sec - exiting 23:59:41 (6572): No heartbeat from core client for 30 sec - exiting 23:59:42 (6572): No heartbeat from core client for 30 sec - exiting 23:59:43 (6572): No heartbeat from core client for 30 sec - exiting 23:59:44 (6572): No heartbeat from core client for 30 sec - exiting 23:59:46 (6572): No heartbeat from core client for 30 sec - exiting 23:59:47 (6572): No heartbeat from core client for 30 sec - exiting 23:59:48 (6572): No heartbeat from core client for 30 sec - exiting 23:59:49 (6572): No heartbeat from core client for 30 sec - exiting 23:59:50 (6572): No heartbeat from core client for 30 sec - exiting 23:59:51 (6572): No heartbeat from core client for 30 sec - exiting 23:59:52 (6572): No heartbeat from core client for 30 sec - exiting 23:59:53 (6572): No heartbeat from core client for 30 sec - exiting 23:59:54 (6572): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 05:22:31 (6432): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 05:35:33 (7068): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6456, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6456, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6456, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6456, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6456, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6456, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
13 Oct 2013 12:51:58 | 1158872 | 16036019 | hadcm3n_4dra_2020_40_008406789_2 | 181,440 | 564,049 | 3.1087 |
10 Oct 2013 10:52:28 | 1158872 | 16036019 | hadcm3n_4dra_2020_40_008406789_2 | 155,520 | 484,258 | 3.1138 |
10 Oct 2013 09:51:47 | 1158872 | 16036019 | hadcm3n_4dra_2020_40_008406789_2 | 129,600 | 402,714 | 3.1074 |
10 Oct 2013 09:51:47 | 1158872 | 16036019 | hadcm3n_4dra_2020_40_008406789_2 | 103,680 | 321,979 | 3.1055 |
03 Oct 2013 18:13:13 | 1158872 | 16036019 | hadcm3n_4dra_2020_40_008406789_2 | 77,760 | 241,314 | 3.1033 |
02 Oct 2013 18:44:13 | 1158872 | 16036019 | hadcm3n_4dra_2020_40_008406789_2 | 51,840 | 160,725 | 3.1004 |
01 Oct 2013 19:57:59 | 1158872 | 16036019 | hadcm3n_4dra_2020_40_008406789_2 | 25,920 | 80,533 | 3.1070 |
©2025 cpdn.org