Name | hadcm3n_zh9n_1880_40_008254755_2 |
Workunit | 8409879 |
Created | 1 Mar 2013, 21:32:37 UTC |
Sent | 1 Mar 2013, 21:33:53 UTC |
Report deadline | 1 Jun 2013, 5:01:04 UTC |
Received | 6 Mar 2013, 7:10:03 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 1270025 |
Run time | 2 days 20 hours 16 min 46 sec |
CPU time | 2 days 12 hours 50 min 22 sec |
Validate state | Invalid |
Credit | 1,555.20 |
Device peak FLOPS | 3.26 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86 |
Stderr | <core_client_version>7.0.28</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 20:41:15 (16932): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 20:42:03 (368): No heartbeat from core client for 30 sec - exiting 20:42:04 (368): No heartbeat from core client for 30 sec - exiting 20:42:05 (368): No heartbeat from core client for 30 sec - exiting 20:42:06 (368): No heartbeat from core client for 30 sec - exiting 20:42:07 (368): No heartbeat from core client for 30 sec - exiting 20:42:08 (368): No heartbeat from core client for 30 sec - exiting 20:42:09 (368): No heartbeat from core client for 30 sec - exiting 20:42:10 (368): No heartbeat from core client for 30 sec - exiting 20:42:11 (368): No heartbeat from core client for 30 sec - exiting 20:42:12 (368): No heartbeat from core client for 30 sec - exiting 20:42:13 (368): No heartbeat from core client for 30 sec - exiting 20:42:14 (368): No heartbeat from core client for 30 sec - exiting 20:42:15 (368): No heartbeat from core client for 30 sec - exiting 20:42:16 (368): No heartbeat from core client for 30 sec - exiting 20:42:17 (368): No heartbeat from core client for 30 sec - exiting 20:42:18 (368): No heartbeat from core client for 30 sec - exiting 20:42:19 (368): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 20:42:20 (368): No heartbeat from core client for 30 sec - exiting 20:42:21 (368): No heartbeat from core client for 30 sec - exiting 20:42:22 (368): No heartbeat from core client for 30 sec - exiting 20:42:23 (368): No heartbeat from core client for 30 sec - exiting 20:42:24 (368): No heartbeat from core client for 30 sec - exiting 20:42:25 (368): No heartbeat from core client for 30 sec - exiting 20:42:26 (368): No heartbeat from core client for 30 sec - exiting 20:42:27 (368): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - Suspend request from BOINC... 20:43:25 (9264): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 23:16:12 (9032): No heartbeat from core client for 30 sec - exiting 23:16:13 (9032): No heartbeat from core client for 30 sec - exiting 23:16:14 (9032): No heartbeat from core client for 30 sec - exiting 23:16:15 (9032): No heartbeat from core client for 30 sec - exiting 23:16:16 (9032): No heartbeat from core client for 30 sec - exiting 23:16:17 (9032): No heartbeat from core client for 30 sec - exiting 23:16:18 (9032): No heartbeat from core client for 30 sec - exiting 23:16:19 (9032): No heartbeat from core client for 30 sec - exiting 23:16:20 (9032): No heartbeat from core client for 30 sec - exiting 23:16:21 (9032): No heartbeat from core client for 30 sec - exiting 23:16:22 (9032): No heartbeat from core client for 30 sec - exiting 23:16:23 (9032): No heartbeat from core client for 30 sec - exiting 23:16:24 (9032): No heartbeat from core client for 30 sec - exiting 23:16:25 (9032): No heartbeat from core client for 30 sec - exiting 23:16:26 (9032): No heartbeat from core client for 30 sec - exiting 23:16:27 (9032): No heartbeat from core client for 30 sec - exiting 23:16:28 (9032): No heartbeat from core client for 30 sec - exiting 23:16:29 (9032): No heartbeat from core client for 30 sec - exiting 23:16:30 (9032): No heartbeat from core client for 30 sec - exiting 23:16:31 (9032): No heartbeat from core client for 30 sec - exiting 23:16:32 (9032): No heartbeat from core client for 30 sec - exiting 23:16:33 (9032): No heartbeat from core client for 30 sec - exiting 23:16:34 (9032): No heartbeat from core client for 30 sec - exiting 23:16:35 (9032): No heartbeat from core client for 30 sec - exiting 23:16:36 (9032): No heartbeat from core client for 30 sec - exiting 23:16:37 (9032): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 23:16:38 (9032): No heartbeat from core client for 30 sec - exiting 23:24:07 (9068): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Signal 22 received, exiting... Called boinc_finish 23:31:18 (9140): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8456, iMonCtr=1 Model crash detected, will try to restart... 23:32:00 (8456): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1808, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1808, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1808, iMonCtr=1 Model crash detected, will try to restart... 23:32:50 (1808): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8756, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish 23:33:35 (8756): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5504, iMonCtr=1 Model crash detected, will try to restart... 23:34:24 (5504): No heartbeat from core client for 30 sec - exiting 23:34:25 (5504): No heartbeat from core client for 30 sec - exiting 23:34:26 (5504): No heartbeat from core client for 30 sec - exiting 23:34:27 (5504): No heartbeat from core client for 30 sec - exiting 23:34:28 (5504): No heartbeat from core client for 30 sec - exiting 23:34:29 (5504): No heartbeat from core client for 30 sec - exiting 23:34:30 (5504): No heartbeat from core client for 30 sec - exiting 23:34:31 (5504): No heartbeat from core client for 30 sec - exiting 23:34:32 (5504): No heartbeat from core client for 30 sec - exiting 23:34:33 (5504): No heartbeat from core client for 30 sec - exiting 23:34:34 (5504): No heartbeat from core client for 30 sec - exiting 23:34:35 (5504): No heartbeat from core client for 30 sec - exiting 23:34:36 (5504): No heartbeat from core client for 30 sec - exiting 23:34:37 (5504): No heartbeat from core client for 30 sec - exiting 23:34:38 (5504): No heartbeat from core client for 30 sec - exiting 23:34:39 (5504): No heartbeat from core client for 30 sec - exiting 23:34:40 (5504): No heartbeat from core client for 30 sec - exiting 23:34:41 (5504): No heartbeat from core client for 30 sec - exiting 23:34:42 (5504): No heartbeat from core client for 30 sec - exiting 23:34:43 (5504): No heartbeat from core client for 30 sec - exiting 23:34:44 (5504): No heartbeat from core client for 30 sec - exiting 23:34:45 (5504): No heartbeat from core client for 30 sec - exiting 23:34:46 (5504): No heartbeat from core client for 30 sec - exiting 23:34:47 (5504): No heartbeat from core client for 30 sec - exiting 23:34:48 (5504): No heartbeat from core client for 30 sec - exiting 23:34:49 (5504): No heartbeat from core client for 30 sec - exiting 23:34:50 (5504): No heartbeat from core client for 30 sec - exiting 23:34:51 (5504): No heartbeat from core client for 30 sec - exiting 23:34:52 (5504): No heartbeat from core client for 30 sec - exiting 23:34:53 (5504): No heartbeat from core client for 30 sec - exiting 23:34:54 (5504): No heartbeat from core client for 30 sec - exiting 23:34:55 (5504): No heartbeat from core client for 30 sec - exiting 23:34:56 (5504): No heartbeat from core client for 30 sec - exiting 23:34:57 (5504): No heartbeat from core client for 30 sec - exiting 23:34:58 (5504): No heartbeat from core client for 30 sec - exiting 23:34:59 (5504): No heartbeat from core client for 30 sec - exiting 23:35:00 (5504): No heartbeat from core client for 30 sec - exiting 23:35:01 (5504): No heartbeat from core client for 30 sec - exiting 23:35:02 (5504): No heartbeat from core client for 30 sec - exiting 23:35:03 (5504): No heartbeat from core client for 30 sec - exiting 23:35:04 (5504): No heartbeat from core client for 30 sec - exiting 23:35:05 (5504): No heartbeat from core client for 30 sec - exiting 23:35:06 (5504): No heartbeat from core client for 30 sec - exiting 23:35:07 (5504): No heartbeat from core client for 30 sec - exiting 23:35:08 (5504): No heartbeat from core client for 30 sec - exiting 23:35:09 (5504): No heartbeat from core client for 30 sec - exiting 23:35:10 (5504): No heartbeat from core client for 30 sec - exiting 23:35:11 (5504): No heartbeat from core client for 30 sec - exiting 23:35:12 (5504): No heartbeat from core client for 30 sec - exiting 23:35:13 (5504): No heartbeat from core client for 30 sec - exiting 23:35:14 (5504): No heartbeat from core client for 30 sec - exiting 23:35:15 (5504): No heartbeat from core client for 30 sec - exiting 23:35:16 (5504): No heartbeat from core client for 30 sec - exiting 23:35:17 (5504): No heartbeat from core client for 30 sec - exiting 23:35:18 (5504): No heartbeat from core client for 30 sec - exiting 23:35:19 (5504): No heartbeat from core client for 30 sec - exiting 23:35:20 (5504): No heartbeat from core client for 30 sec - exiting 23:35:21 (5504): No heartbeat from core client for 30 sec - exiting 23:35:22 (5504): No heartbeat from core client for 30 sec - exiting 23:35:23 (5504): No heartbeat from core client for 30 sec - exiting 23:35:24 (5504): No heartbeat from core client for 30 sec - exiting 23:35:25 (5504): No heartbeat from core client for 30 sec - exiting 23:35:26 (5504): No heartbeat from core client for 30 sec - exiting 23:35:27 (5504): No heartbeat from core client for 30 sec - exiting 23:35:28 (5504): No heartbeat from core client for 30 sec - exiting 23:35:29 (5504): No heartbeat from core client for 30 sec - exiting 23:35:30 (5504): No heartbeat from core client for 30 sec - exiting 23:35:31 (5504): No heartbeat from core client for 30 sec - exiting 23:35:32 (5504): No heartbeat from core client for 30 sec - exiting 23:35:33 (5504): No heartbeat from core client for 30 sec - exiting 23:35:34 (5504): No heartbeat from core client for 30 sec - exiting 23:35:35 (5504): No heartbeat from core client for 30 sec - exiting 23:35:36 (5504): No heartbeat from core client for 30 sec - exiting 23:35:37 (5504): No heartbeat from core client for 30 sec - exiting 23:35:38 (5504): No heartbeat from core client for 30 sec - exiting 23:35:39 (5504): No heartbeat from core client for 30 sec - exiting 23:35:40 (5504): No heartbeat from core client for 30 sec - exiting 23:35:41 (5504): No heartbeat from core client for 30 sec - exiting 23:35:42 (5504): No heartbeat from core client for 30 sec - exiting 23:35:43 (5504): No heartbeat from core client for 30 sec - exiting 23:35:44 (5504): No heartbeat from core client for 30 sec - exiting 23:35:45 (5504): No heartbeat from core client for 30 sec - exiting 23:35:46 (5504): No heartbeat from core client for 30 sec - exiting 23:35:47 (5504): No heartbeat from core client for 30 sec - exiting 23:35:48 (5504): No heartbeat from core client for 30 sec - exiting 23:35:49 (5504): No heartbeat from core client for 30 sec - exiting 23:35:50 (5504): No heartbeat from core client for 30 sec - exiting 23:35:51 (5504): No heartbeat from core client for 30 sec - exiting 23:35:52 (5504): No heartbeat from core client for 30 sec - exiting 23:35:53 (5504): No heartbeat from core client for 30 sec - exiting 23:35:54 (5504): No heartbeat from core client for 30 sec - exiting 23:35:55 (5504): No heartbeat from core client for 30 sec - exiting 23:35:56 (5504): No heartbeat from core client for 30 sec - exiting 23:35:57 (5504): No heartbeat from core client for 30 sec - exiting 23:35:58 (5504): No heartbeat from core client for 30 sec - exiting Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
04 Mar 2013 14:35:21 | 1270025 | 15644483 | hadcm3n_zh9n_1880_40_008254755_2 | 129,600 | 203,524 | 1.5704 |
04 Mar 2013 00:20:53 | 1270025 | 15644483 | hadcm3n_zh9n_1880_40_008254755_2 | 103,680 | 163,260 | 1.5747 |
03 Mar 2013 12:50:02 | 1270025 | 15644483 | hadcm3n_zh9n_1880_40_008254755_2 | 77,760 | 122,736 | 1.5784 |
02 Mar 2013 22:43:18 | 1270025 | 15644483 | hadcm3n_zh9n_1880_40_008254755_2 | 51,840 | 81,657 | 1.5752 |
02 Mar 2013 10:13:53 | 1270025 | 15644483 | hadcm3n_zh9n_1880_40_008254755_2 | 25,920 | 40,566 | 1.5650 |
©2024 cpdn.org