Name | hadcm3n_o6iz_1940_40_007432999_0 |
Workunit | 7630502 |
Created | 31 Aug 2011, 20:59:54 UTC |
Sent | 1 Sep 2011, 2:09:02 UTC |
Report deadline | 1 Dec 2011, 9:36:13 UTC |
Received | 15 Oct 2011, 0:52:06 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 1164148 |
Run time | 12 days 12 hours 25 min 39 sec |
CPU time | 11 days 19 hours 4 min 9 sec |
Validate state | Invalid |
Credit | 4,354.56 |
Device peak FLOPS | 2.45 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86 |
Stderr | <core_client_version>6.12.34</core_client_version> <![CDATA[ <message> Enheden genkender ikke kommandoen. (0x16) - exit code 22 (0x16) </message> <stderr_txt> CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 16:46:21 (3512): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 23:23:22 (5932): No heartbeat from core client for 30 sec - exiting 23:23:23 (5932): No heartbeat from core client for 30 sec - exiting 23:23:24 (5932): No heartbeat from core client for 30 sec - exiting 23:23:25 (5932): No heartbeat from core client for 30 sec - exiting 23:23:26 (5932): No heartbeat from core client for 30 sec - exiting 23:23:27 (5932): No heartbeat from core client for 30 sec - exiting 23:23:28 (5932): No heartbeat from core client for 30 sec - exiting 23:23:29 (5932): No heartbeat from core client for 30 sec - exiting 23:23:30 (5932): No heartbeat from core client for 30 sec - exiting 23:23:31 (5932): No heartbeat from core client for 30 sec - exiting 23:23:32 (5932): No heartbeat from core client for 30 sec - exiting 23:23:33 (5932): No heartbeat from core client for 30 sec - exiting 23:23:34 (5932): No heartbeat from core client for 30 sec - exiting 23:23:35 (5932): No heartbeat from core client for 30 sec - exiting 23:23:36 (5932): No heartbeat from core client for 30 sec - exiting 23:23:37 (5932): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 18:52:25 (4780): No heartbeat from core client for 30 sec - exiting 18:52:26 (4780): No heartbeat from core client for 30 sec - exiting 18:52:27 (4780): No heartbeat from core client for 30 sec - exiting 18:52:28 (4780): No heartbeat from core client for 30 sec - exiting 18:52:29 (4780): No heartbeat from core client for 30 sec - exiting 18:52:30 (4780): No heartbeat from core client for 30 sec - exiting 18:52:32 (4780): No heartbeat from core client for 30 sec - exiting 18:52:33 (4780): No heartbeat from core client for 30 sec - exiting 18:52:34 (4780): No heartbeat from core client for 30 sec - exiting 18:52:35 (4780): No heartbeat from core client for 30 sec - exiting 18:52:36 (4780): No heartbeat from core client for 30 sec - exiting 18:52:37 (4780): No heartbeat from core client for 30 sec - exiting 18:52:38 (4780): No heartbeat from core client for 30 sec - exiting 18:52:39 (4780): No heartbeat from core client for 30 sec - exiting 18:52:40 (4780): No heartbeat from core client for 30 sec - exiting 18:52:41 (4780): No heartbeat from core client for 30 sec - exiting 18:52:42 (4780): No heartbeat from core client for 30 sec - exiting 18:52:43 (4780): No heartbeat from core client for 30 sec - exiting 18:52:44 (4780): No heartbeat from core client for 30 sec - exiting 18:52:45 (4780): No heartbeat from core client for 30 sec - exiting 18:52:46 (4780): No heartbeat from core client for 30 sec - exiting 18:52:47 (4780): No heartbeat from core client for 30 sec - exiting 18:52:49 (4780): No heartbeat from core client for 30 sec - exiting 18:52:50 (4780): No heartbeat from core client for 30 sec - exiting 18:52:51 (4780): No heartbeat from core client for 30 sec - exiting 18:52:52 (4780): No heartbeat from core client for 30 sec - exiting 18:52:53 (4780): No heartbeat from core client for 30 sec - exiting 18:52:54 (4780): No heartbeat from core client for 30 sec - exiting 18:52:55 (4780): No heartbeat from core client for 30 sec - exiting 18:52:56 (4780): No heartbeat from core client for 30 sec - exiting 18:52:57 (4780): No heartbeat from core client for 30 sec - exiting 18:52:58 (4780): No heartbeat from core client for 30 sec - exiting 18:52:59 (4780): No heartbeat from core client for 30 sec - exiting 18:53:01 (4780): No heartbeat from core client for 30 sec - exiting 18:53:02 (4780): No heartbeat from core client for 30 sec - exiting 18:53:03 (4780): No heartbeat from core client for 30 sec - exiting 18:53:04 (4780): No heartbeat from core client for 30 sec - exiting 18:53:05 (4780): No heartbeat from core client for 30 sec - exiting 18:53:06 (4780): No heartbeat from core client for 30 sec - exiting 18:53:07 (4780): No heartbeat from core client for 30 sec - exiting 18:53:08 (4780): No heartbeat from core client for 30 sec - exiting 18:53:09 (4780): No heartbeat from core client for 30 sec - exiting 18:53:10 (4780): No heartbeat from core client for 30 sec - exiting 18:53:11 (4780): No heartbeat from core client for 30 sec - exiting 18:53:12 (4780): No heartbeat from core client for 30 sec - exiting 18:53:13 (4780): No heartbeat from core client for 30 sec - exiting 18:53:14 (4780): No heartbeat from core client for 30 sec - exiting 18:53:15 (4780): No heartbeat from core client for 30 sec - exiting 18:53:16 (4780): No heartbeat from core client for 30 sec - exiting 18:53:17 (4780): No heartbeat from core client for 30 sec - exiting 18:53:18 (4780): No heartbeat from core client for 30 sec - exiting 18:53:19 (4780): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 11:45:09 (6072): No heartbeat from core client for 30 sec - exiting 11:45:11 (6072): No heartbeat from core client for 30 sec - exiting 11:45:12 (6072): No heartbeat from core client for 30 sec - exiting 11:45:13 (6072): No heartbeat from core client for 30 sec - exiting 11:45:14 (6072): No heartbeat from core client for 30 sec - exiting 11:45:15 (6072): No heartbeat from core client for 30 sec - exiting 11:45:16 (6072): No heartbeat from core client for 30 sec - exiting 11:45:17 (6072): No heartbeat from core client for 30 sec - exiting 11:45:18 (6072): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Called boinc_finish CPDN Monitor - Quit request from BOINC... 04:30:16 (8664): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 18:22:54 (5804): No heartbeat from core client for 30 sec - exiting 18:22:55 (5804): No heartbeat from core client for 30 sec - exiting 18:22:56 (5804): No heartbeat from core client for 30 sec - exiting 18:22:57 (5804): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 08:38:00 (13128): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 00:24:53 (6072): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 00:24:54 (6072): No heartbeat from core client for 30 sec - exiting 00:24:55 (6072): No heartbeat from core client for 30 sec - exiting 00:24:56 (6072): No heartbeat from core client for 30 sec - exiting 00:24:57 (6072): No heartbeat from core client for 30 sec - exiting 00:24:58 (6072): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... 17:40:42 (8844): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 17:41:30 (6876): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 17:48:28 (5736): No heartbeat from core client for 30 sec - exiting 17:48:29 (5736): No heartbeat from core client for 30 sec - exiting 17:48:30 (5736): No heartbeat from core client for 30 sec - exiting 17:48:31 (5736): No heartbeat from core client for 30 sec - exiting 17:48:32 (5736): No heartbeat from core client for 30 sec - exiting 17:48:33 (5736): No heartbeat from core client for 30 sec - exiting 17:48:34 (5736): No heartbeat from core client for 30 sec - exiting 17:48:35 (5736): No heartbeat from core client for 30 sec - exiting 17:49:09 (5736): No heartbeat from core client for 30 sec - exiting 17:49:10 (5736): No heartbeat from core client for 30 sec - exiting 17:49:11 (5736): No heartbeat from core client for 30 sec - exiting 17:49:12 (5736): No heartbeat from core client for 30 sec - exiting 17:49:13 (5736): No heartbeat from core client for 30 sec - exiting 17:49:14 (5736): No heartbeat from core client for 30 sec - exiting 17:49:15 (5736): No heartbeat from core client for 30 sec - exiting 17:49:16 (5736): No heartbeat from core client for 30 sec - exiting 17:49:17 (5736): No heartbeat from core client for 30 sec - exiting 17:49:19 (5736): No heartbeat from core client for 30 sec - exiting 17:49:20 (5736): No heartbeat from core client for 30 sec - exiting 17:49:21 (5736): No heartbeat from core client for 30 sec - exiting 17:49:22 (5736): No heartbeat from core client for 30 sec - exiting 17:49:23 (5736): No heartbeat from core client for 30 sec - exiting 17:49:24 (5736): No heartbeat from core client for 30 sec - exiting 17:49:25 (5736): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... 07:33:06 (8916): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 07:34:02 (4492): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 23:23:46 (5064): No heartbeat from core client for 30 sec - exiting 23:23:47 (5064): No heartbeat from core client for 30 sec - exiting 23:23:48 (5064): No heartbeat from core client for 30 sec - exiting 23:23:49 (5064): No heartbeat from core client for 30 sec - exiting 23:23:50 (5064): No heartbeat from core client for 30 sec - exiting 23:24:24 (5064): No heartbeat from core client for 30 sec - exiting 23:24:25 (5064): No heartbeat from core client for 30 sec - exiting 23:24:26 (5064): No heartbeat from core client for 30 sec - exiting 23:24:27 (5064): No heartbeat from core client for 30 sec - exiting 23:24:28 (5064): No heartbeat from core client for 30 sec - exiting 23:24:29 (5064): No heartbeat from core client for 30 sec - exiting 23:24:30 (5064): No heartbeat from core client for 30 sec - exiting 23:24:31 (5064): No heartbeat from core client for 30 sec - exiting 23:24:32 (5064): No heartbeat from core client for 30 sec - exiting 23:24:33 (5064): No heartbeat from core client for 30 sec - exiting 23:24:34 (5064): No heartbeat from core client for 30 sec - exiting 23:24:35 (5064): No heartbeat from core client for 30 sec - exiting 23:24:36 (5064): No heartbeat from core client for 30 sec - exiting 23:24:37 (5064): No heartbeat from core client for 30 sec - exiting 23:24:38 (5064): No heartbeat from core client for 30 sec - exiting 23:24:39 (5064): No heartbeat from core client for 30 sec - exiting 23:24:40 (5064): No heartbeat from core client for 30 sec - exiting 23:24:41 (5064): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 03:35:47 (6044): No heartbeat from core client for 30 sec - exiting 03:35:48 (6044): No heartbeat from core client for 30 sec - exiting 03:35:49 (6044): No heartbeat from core client for 30 sec - exiting 03:35:50 (6044): No heartbeat from core client for 30 sec - exiting 03:35:52 (6044): No heartbeat from core client for 30 sec - exiting 03:35:53 (6044): No heartbeat from core client for 30 sec - exiting 03:35:54 (6044): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 00:03:19 (5812): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8100, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8100, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6024, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6024, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6024, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6024, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6024, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
14 Oct 2011 00:06:30 | 1164148 | 13318475 | hadcm3n_o6iz_1940_40_007432999_0 | 362,880 | 1,011,590 | 2.7877 |
13 Oct 2011 11:07:40 | 1164148 | 13318475 | hadcm3n_o6iz_1940_40_007432999_0 | 336,960 | 965,903 | 2.8665 |
12 Oct 2011 22:44:15 | 1164148 | 13318475 | hadcm3n_o6iz_1940_40_007432999_0 | 311,040 | 920,601 | 2.9598 |
12 Oct 2011 05:06:56 | 1164148 | 13318475 | hadcm3n_o6iz_1940_40_007432999_0 | 285,120 | 874,381 | 3.0667 |
11 Oct 2011 09:38:11 | 1164148 | 13318475 | hadcm3n_o6iz_1940_40_007432999_0 | 259,200 | 827,504 | 3.1925 |
10 Oct 2011 13:40:25 | 1164148 | 13318475 | hadcm3n_o6iz_1940_40_007432999_0 | 233,280 | 780,282 | 3.3448 |
10 Oct 2011 00:11:00 | 1164148 | 13318475 | hadcm3n_o6iz_1940_40_007432999_0 | 207,360 | 732,688 | 3.5334 |
09 Oct 2011 01:40:16 | 1164148 | 13318475 | hadcm3n_o6iz_1940_40_007432999_0 | 181,440 | 684,841 | 3.7745 |
30 Sep 2011 15:42:20 | 1164148 | 13318475 | hadcm3n_o6iz_1940_40_007432999_0 | 155,520 | 299,917 | 1.9285 |
30 Sep 2011 01:19:12 | 1164148 | 13318475 | hadcm3n_o6iz_1940_40_007432999_0 | 129,600 | 249,393 | 1.9243 |
29 Sep 2011 10:37:07 | 1164148 | 13318475 | hadcm3n_o6iz_1940_40_007432999_0 | 103,680 | 198,353 | 1.9131 |
28 Sep 2011 17:59:22 | 1164148 | 13318475 | hadcm3n_o6iz_1940_40_007432999_0 | 77,760 | 146,488 | 1.8838 |
26 Sep 2011 00:54:25 | 1164148 | 13318475 | hadcm3n_o6iz_1940_40_007432999_0 | 51,840 | 94,472 | 1.8224 |
09 Sep 2011 11:37:50 | 1164148 | 13318475 | hadcm3n_o6iz_1940_40_007432999_0 | 25,920 | 47,850 | 1.8461 |
©2024 cpdn.org