Name | hadcm3n_ye5l_1900_40_007351203_1 |
Workunit | 7548633 |
Created | 6 Jul 2011, 14:11:57 UTC |
Sent | 16 Jul 2011, 11:38:06 UTC |
Report deadline | 15 Oct 2011, 19:05:17 UTC |
Received | 31 Oct 2011, 16:52:56 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 978576 |
Run time | 23 days 0 hours 33 min 9 sec |
CPU time | 19 days 17 hours 47 min 47 sec |
Validate state | Invalid |
Credit | 4,665.60 |
Device peak FLOPS | 1.80 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86 |
Stderr | <core_client_version>6.6.28</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 20:45:14 (520): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 00:49:11 (6080): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 19:29:32 (6048): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 22:18:39 (5512): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 13:13:21 (5408): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 17:14:23 (5200): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 19:07:22 (3660): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 19:07:23 (3660): No heartbeat from core client for 30 sec - exiting 14:17:07 (5312): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 16:17:48 (4996): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1264, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4252, iMonCtr=1 Model crash detected, will try to restart... 18:06:52 (1756): No heartbeat from core client for 30 sec - exiting 18:06:53 (1756): No heartbeat from core client for 30 sec - exiting 18:06:54 (1756): No heartbeat from core client for 30 sec - exiting 18:06:55 (1756): No heartbeat from core client for 30 sec - exiting 18:06:56 (1756): No heartbeat from core client for 30 sec - exiting 18:06:57 (1756): No heartbeat from core client for 30 sec - exiting 18:06:58 (1756): No heartbeat from core client for 30 sec - exiting 18:06:59 (1756): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 06:45:16 (1756): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 08:37:37 (4804): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 09:59:25 (3248): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 11:50:44 (5928): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 16:22:48 (5544): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 22:54:42 (5116): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 22:54:43 (5116): No heartbeat from core client for 30 sec - exiting 22:54:44 (5116): No heartbeat from core client for 30 sec - exiting 14:03:58 (5528): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 14:03:59 (5528): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... 15:09:20 (1276): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 20:29:37 (5412): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:37:35 (1280): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 17:39:35 (5616): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 17:39:36 (5616): No heartbeat from core client for 30 sec - exiting 19:48:10 (4600): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:35:54 (4676): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 12:38:27 (3944): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 12:38:28 (3944): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... 20:43:48 (4684): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 05:01:35 (5360): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 15:34:21 (4492): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 20:12:35 (5624): No heartbeat from core client for 30 sec - exiting 20:12:36 (5624): No heartbeat from core client for 30 sec - exiting 20:12:43 (5624): No heartbeat from core client for 30 sec - exiting 20:12:44 (5624): No heartbeat from core client for 30 sec - exiting 20:12:45 (5624): No heartbeat from core client for 30 sec - exiting 20:12:46 (5624): No heartbeat from core client for 30 sec - exiting 20:12:48 (5624): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 20:12:49 (5624): No heartbeat from core client for 30 sec - exiting 20:12:50 (5624): No heartbeat from core client for 30 sec - exiting 20:12:51 (5624): No heartbeat from core client for 30 sec - exiting 20:12:52 (5624): No heartbeat from core client for 30 sec - exiting 20:12:54 (5624): No heartbeat from core client for 30 sec - exiting 20:12:55 (5624): No heartbeat from core client for 30 sec - exiting 20:12:56 (5624): No heartbeat from core client for 30 sec - exiting 20:12:58 (5624): No heartbeat from core client for 30 sec - exiting 20:12:59 (5624): No heartbeat from core client for 30 sec - exiting 20:13:00 (5624): No heartbeat from core client for 30 sec - exiting 20:13:01 (5624): No heartbeat from core client for 30 sec - exiting 20:13:02 (5624): No heartbeat from core client for 30 sec - exiting 20:13:03 (5624): No heartbeat from core client for 30 sec - exiting 20:13:04 (5624): No heartbeat from core client for 30 sec - exiting 20:13:05 (5624): No heartbeat from core client for 30 sec - exiting 20:13:06 (5624): No heartbeat from core client for 30 sec - exiting 20:13:07 (5624): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... 15:16:03 (5168): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 15:16:04 (5168): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... 21:30:09 (5768): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 20:40:41 (4608): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 18:42:05 (2316): No heartbeat from core client for 30 sec - exiting 18:42:06 (2316): No heartbeat from core client for 30 sec - exiting 18:42:07 (2316): No heartbeat from core client for 30 sec - exiting 18:42:08 (2316): No heartbeat from core client for 30 sec - exiting 18:42:09 (2316): No heartbeat from core client for 30 sec - exiting 18:42:10 (2316): No heartbeat from core client for 30 sec - exiting 18:42:11 (2316): No heartbeat from core client for 30 sec - exiting 18:42:13 (2316): No heartbeat from core client for 30 sec - exiting 18:42:14 (2316): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 16:50:38 (1080): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 04:33:39 (2436): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 20:12:25 (5068): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 19:17:30 (5368): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 15:59:41 (4124): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 15:59:42 (4124): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... 16:50:26 (6112): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 19:19:43 (5644): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 00:21:16 (5376): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 12:23:47 (5544): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 05:01:59 (6000): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 09:22:55 (3012): No heartbeat from core client for 30 sec - exiting 09:22:56 (3012): No heartbeat from core client for 30 sec - exiting 09:22:57 (3012): No heartbeat from core client for 30 sec - exiting 09:22:58 (3012): No heartbeat from core client for 30 sec - exiting 09:22:59 (3012): No heartbeat from core client for 30 sec - exiting 09:23:00 (3012): No heartbeat from core client for 30 sec - exiting 09:23:01 (3012): No heartbeat from core client for 30 sec - exiting 09:23:02 (3012): No heartbeat from core client for 30 sec - exiting 09:23:03 (3012): No heartbeat from core client for 30 sec - exiting 09:23:04 (3012): No heartbeat from core client for 30 sec - exiting 09:23:05 (3012): No heartbeat from core client for 30 sec - exiting 09:23:07 (3012): No heartbeat from core client for 30 sec - exiting 09:23:08 (3012): No heartbeat from core client for 30 sec - exiting 09:23:09 (3012): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 09:23:10 (3012): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... 21:16:59 (3200): No heartbeat from core client for 30 sec - exiting 21:17:00 (3200): No heartbeat from core client for 30 sec - exiting 21:17:01 (3200): No heartbeat from core client for 30 sec - exiting 21:17:02 (3200): No heartbeat from core client for 30 sec - exiting 21:17:03 (3200): No heartbeat from core client for 30 sec - exiting 21:17:05 (3200): No heartbeat from core client for 30 sec - exiting 21:17:06 (3200): No heartbeat from core client for 30 sec - exiting 21:17:07 (3200): No heartbeat from core client for 30 sec - exiting 21:17:08 (3200): No heartbeat from core client for 30 sec - exiting 21:17:09 (3200): No heartbeat from core client for 30 sec - exiting 21:17:10 (3200): No heartbeat from core client for 30 sec - exiting 21:17:11 (3200): No heartbeat from core client for 30 sec - exiting 21:17:13 (3200): No heartbeat from core client for 30 sec - exiting 21:17:14 (3200): No heartbeat from core client for 30 sec - exiting 21:17:15 (3200): No heartbeat from core client for 30 sec - exiting 21:17:16 (3200): No heartbeat from core client for 30 sec - exiting 21:17:17 (3200): No heartbeat from core client for 30 sec - exiting 21:17:18 (3200): No heartbeat from core client for 30 sec - exiting 21:17:19 (3200): No heartbeat from core client for 30 sec - exiting 21:17:20 (3200): No heartbeat from core client for 30 sec - exiting 21:17:21 (3200): No heartbeat from core client for 30 sec - exiting 21:17:22 (3200): No heartbeat from core client for 30 sec - exiting 21:17:24 (3200): No heartbeat from core client for 30 sec - exiting 21:17:25 (3200): No heartbeat from core client for 30 sec - exiting 21:17:26 (3200): No heartbeat from core client for 30 sec - exiting 21:17:27 (3200): No heartbeat from core client for 30 sec - exiting 21:17:28 (3200): No heartbeat from core client for 30 sec - exiting 21:17:29 (3200): No heartbeat from core client for 30 sec - exiting 21:17:30 (3200): No heartbeat from core client for 30 sec - exiting 21:17:31 (3200): No heartbeat from core client for 30 sec - exiting 21:17:32 (3200): No heartbeat from core client for 30 sec - exiting 21:17:33 (3200): No heartbeat from core client for 30 sec - exiting 21:17:34 (3200): No heartbeat from core client for 30 sec - exiting 21:17:36 (3200): No heartbeat from core client for 30 sec - exiting 21:17:37 (3200): No heartbeat from core client for 30 sec - exiting 21:17:38 (3200): No heartbeat from core client for 30 sec - exiting 21:17:39 (3200): No heartbeat from core client for 30 sec - exiting 21:17:40 (3200): No heartbeat from core client for 30 sec - exiting 21:17:41 (3200): No heartbeat from core client for 30 sec - exiting 21:17:42 (3200): No heartbeat from core client for 30 sec - exiting 21:17:43 (3200): No heartbeat from core client for 30 sec - exiting 21:17:44 (3200): No heartbeat from core client for 30 sec - exiting 21:17:45 (3200): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1760, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1760, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1760, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1760, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1760, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1760, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
31 Oct 2011 15:32:29 | 978576 | 13106276 | hadcm3n_ye5l_1900_40_007351203_1 | 388,800 | 1,676,149 | 4.3111 |
17 Oct 2011 21:40:47 | 978576 | 13106276 | hadcm3n_ye5l_1900_40_007351203_1 | 362,880 | 1,566,609 | 4.3172 |
13 Oct 2011 10:21:37 | 978576 | 13106276 | hadcm3n_ye5l_1900_40_007351203_1 | 336,960 | 1,457,579 | 4.3257 |
08 Oct 2011 01:45:42 | 978576 | 13106276 | hadcm3n_ye5l_1900_40_007351203_1 | 311,040 | 1,337,572 | 4.3003 |
30 Sep 2011 07:14:36 | 978576 | 13106276 | hadcm3n_ye5l_1900_40_007351203_1 | 285,120 | 1,233,244 | 4.3254 |
21 Sep 2011 21:30:28 | 978576 | 13106276 | hadcm3n_ye5l_1900_40_007351203_1 | 259,200 | 1,123,364 | 4.3340 |
15 Sep 2011 23:36:04 | 978576 | 13106276 | hadcm3n_ye5l_1900_40_007351203_1 | 233,280 | 1,007,385 | 4.3184 |
06 Sep 2011 12:15:38 | 978576 | 13106276 | hadcm3n_ye5l_1900_40_007351203_1 | 207,360 | 897,452 | 4.3280 |
30 Aug 2011 06:38:27 | 978576 | 13106276 | hadcm3n_ye5l_1900_40_007351203_1 | 181,440 | 787,451 | 4.3400 |
26 Aug 2011 02:39:41 | 978576 | 13106276 | hadcm3n_ye5l_1900_40_007351203_1 | 155,520 | 676,207 | 4.3480 |
22 Aug 2011 21:39:24 | 978576 | 13106276 | hadcm3n_ye5l_1900_40_007351203_1 | 129,600 | 558,520 | 4.3096 |
12 Aug 2011 16:19:36 | 978576 | 13106276 | hadcm3n_ye5l_1900_40_007351203_1 | 103,680 | 447,909 | 4.3201 |
07 Aug 2011 02:22:20 | 978576 | 13106276 | hadcm3n_ye5l_1900_40_007351203_1 | 77,760 | 334,700 | 4.3043 |
30 Jul 2011 02:03:19 | 978576 | 13106276 | hadcm3n_ye5l_1900_40_007351203_1 | 51,840 | 220,429 | 4.2521 |
25 Jul 2011 21:07:53 | 978576 | 13106276 | hadcm3n_ye5l_1900_40_007351203_1 | 25,920 | 110,544 | 4.2648 |
©2024 cpdn.org