Name | hadcm3n_o9au_1900_40_008467177_1 |
Workunit | 8618016 |
Created | 7 Oct 2013, 3:55:11 UTC |
Sent | 7 Oct 2013, 4:06:22 UTC |
Report deadline | 6 Jan 2014, 11:33:33 UTC |
Received | 23 Oct 2013, 8:10:35 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 1275636 |
Run time | 22 hours 20 min 1 sec |
CPU time | 22 hours 13 min |
Validate state | Invalid |
Credit | 933.12 |
Device peak FLOPS | 3.06 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86 |
Stderr | <core_client_version>7.0.64</core_client_version> <![CDATA[ <message> Das Gerät erkennt den Befehl nicht. (0x16) - exit code 22 (0x16) </message> <stderr_txt> 06:27:54 (5748): No heartbeat from core client for 30 sec - exiting 06:27:55 (5748): No heartbeat from core client for 30 sec - exiting 06:27:56 (5748): No heartbeat from core client for 30 sec - exiting 06:27:57 (5748): No heartbeat from core client for 30 sec - exiting 06:27:58 (5748): No heartbeat from core client for 30 sec - exiting 06:27:59 (5748): No heartbeat from core client for 30 sec - exiting 06:28:00 (5748): No heartbeat from core client for 30 sec - exiting 06:28:01 (5748): No heartbeat from core client for 30 sec - exiting 06:28:02 (5748): No heartbeat from core client for 30 sec - exiting 06:28:03 (5748): No heartbeat from core client for 30 sec - exiting 06:28:04 (5748): No heartbeat from core client for 30 sec - exiting 06:28:05 (5748): No heartbeat from core client for 30 sec - exiting 06:28:06 (5748): No heartbeat from core client for 30 sec - exiting 06:28:07 (5748): No heartbeat from core client for 30 sec - exiting 06:28:08 (5748): No heartbeat from core client for 30 sec - exiting 06:28:09 (5748): No heartbeat from core client for 30 sec - exiting 06:28:10 (5748): No heartbeat from core client for 30 sec - exiting 06:28:11 (5748): No heartbeat from core client for 30 sec - exiting 06:28:12 (5748): No heartbeat from core client for 30 sec - exiting 06:28:13 (5748): No heartbeat from core client for 30 sec - exiting 06:28:14 (5748): No heartbeat from core client for 30 sec - exiting 06:28:15 (5748): No heartbeat from core client for 30 sec - exiting 06:28:16 (5748): No heartbeat from core client for 30 sec - exiting 06:28:17 (5748): No heartbeat from core client for 30 sec - exiting 06:28:18 (5748): No heartbeat from core client for 30 sec - exiting 06:28:19 (5748): No heartbeat from core client for 30 sec - exiting 06:28:20 (5748): No heartbeat from core client for 30 sec - exiting 06:28:21 (5748): No heartbeat from core client for 30 sec - exiting 06:28:22 (5748): No heartbeat from core client for 30 sec - exiting 06:28:23 (5748): No heartbeat from core client for 30 sec - exiting 06:28:24 (5748): No heartbeat from core client for 30 sec - exiting 06:28:25 (5748): No heartbeat from core client for 30 sec - exiting 06:28:26 (5748): No heartbeat from core client for 30 sec - exiting 06:28:27 (5748): No heartbeat from core client for 30 sec - exiting 06:28:28 (5748): No heartbeat from core client for 30 sec - exiting 06:28:29 (5748): No heartbeat from core client for 30 sec - exiting 06:28:30 (5748): No heartbeat from core client for 30 sec - exiting 06:28:31 (5748): No heartbeat from core client for 30 sec - exiting 06:28:32 (5748): No heartbeat from core client for 30 sec - exiting 06:28:33 (5748): No heartbeat from core client for 30 sec - exiting 06:28:34 (5748): No heartbeat from core client for 30 sec - exiting 06:28:35 (5748): No heartbeat from core client for 30 sec - exiting 06:28:36 (5748): No heartbeat from core client for 30 sec - exiting 06:28:37 (5748): No heartbeat from core client for 30 sec - exiting 06:28:38 (5748): No heartbeat from core client for 30 sec - exiting 06:28:39 (5748): No heartbeat from core client for 30 sec - exiting 06:28:40 (5748): No heartbeat from core client for 30 sec - exiting 06:28:41 (5748): No heartbeat from core client for 30 sec - exiting 06:28:42 (5748): No heartbeat from core client for 30 sec - exiting 06:28:43 (5748): No heartbeat from core client for 30 sec - exiting 06:28:44 (5748): No heartbeat from core client for 30 sec - exiting 06:28:45 (5748): No heartbeat from core client for 30 sec - exiting 06:28:46 (5748): No heartbeat from core client for 30 sec - exiting 06:28:47 (5748): No heartbeat from core client for 30 sec - exiting 06:28:48 (5748): No heartbeat from core client for 30 sec - exiting 06:28:49 (5748): No heartbeat from core client for 30 sec - exiting 06:28:50 (5748): No heartbeat from core client for 30 sec - exiting 06:28:51 (5748): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 06:30:22 (7420): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 06:31:04 (6784): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 19:20:36 (4548): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 09:51:32 (9624): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 23:32:28 (1688): No heartbeat from core client for 30 sec - exiting 23:32:29 (1688): No heartbeat from core client for 30 sec - exiting 23:32:30 (1688): No heartbeat from core client for 30 sec - exiting 23:32:31 (1688): No heartbeat from core client for 30 sec - exiting 23:32:32 (1688): No heartbeat from core client for 30 sec - exiting 23:32:33 (1688): No heartbeat from core client for 30 sec - exiting 23:32:34 (1688): No heartbeat from core client for 30 sec - exiting 23:32:35 (1688): No heartbeat from core client for 30 sec - exiting 23:32:36 (1688): No heartbeat from core client for 30 sec - exiting 23:32:37 (1688): No heartbeat from core client for 30 sec - exiting 23:32:38 (1688): No heartbeat from core client for 30 sec - exiting 23:32:39 (1688): No heartbeat from core client for 30 sec - exiting 23:32:40 (1688): No heartbeat from core client for 30 sec - exiting 23:32:41 (1688): No heartbeat from core client for 30 sec - exiting 23:32:42 (1688): No heartbeat from core client for 30 sec - exiting 23:32:43 (1688): No heartbeat from core client for 30 sec - exiting 23:32:44 (1688): No heartbeat from core client for 30 sec - exiting 23:32:45 (1688): No heartbeat from core client for 30 sec - exiting 23:32:46 (1688): No heartbeat from core client for 30 sec - exiting 23:32:47 (1688): No heartbeat from core client for 30 sec - exiting 23:32:48 (1688): No heartbeat from core client for 30 sec - exiting 23:32:49 (1688): No heartbeat from core client for 30 sec - exiting 23:32:50 (1688): No heartbeat from core client for 30 sec - exiting 23:32:51 (1688): No heartbeat from core client for 30 sec - exiting 23:32:52 (1688): No heartbeat from core client for 30 sec - exiting 23:32:53 (1688): No heartbeat from core client for 30 sec - exiting 23:32:54 (1688): No heartbeat from core client for 30 sec - exiting 23:32:55 (1688): No heartbeat from core client for 30 sec - exiting 23:32:56 (1688): No heartbeat from core client for 30 sec - exiting 23:32:57 (1688): No heartbeat from core client for 30 sec - exiting 23:32:58 (1688): No heartbeat from core client for 30 sec - exiting 23:32:59 (1688): No heartbeat from core client for 30 sec - exiting 23:33:00 (1688): No heartbeat from core client for 30 sec - exiting 23:33:01 (1688): No heartbeat from core client for 30 sec - exiting 23:33:02 (1688): No heartbeat from core client for 30 sec - exiting 23:33:03 (1688): No heartbeat from core client for 30 sec - exiting 23:33:04 (1688): No heartbeat from core client for 30 sec - exiting 23:33:05 (1688): No heartbeat from core client for 30 sec - exiting 23:33:06 (1688): No heartbeat from core client for 30 sec - exiting 23:33:07 (1688): No heartbeat from core client for 30 sec - exiting 23:33:08 (1688): No heartbeat from core client for 30 sec - exiting 23:33:09 (1688): No heartbeat from core client for 30 sec - exiting 23:33:10 (1688): No heartbeat from core client for 30 sec - exiting 23:33:11 (1688): No heartbeat from core client for 30 sec - exiting 23:33:12 (1688): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 15:26:24 (11792): No heartbeat from core client for 30 sec - exiting 15:26:25 (11792): No heartbeat from core client for 30 sec - exiting 15:26:26 (11792): No heartbeat from core client for 30 sec - exiting 15:26:27 (11792): No heartbeat from core client for 30 sec - exiting 15:26:28 (11792): No heartbeat from core client for 30 sec - exiting 15:26:29 (11792): No heartbeat from core client for 30 sec - exiting 15:26:30 (11792): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5652, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5652, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... 03:02:02 (5652): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6360, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6360, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6360, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6360, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
13 Oct 2013 23:52:34 | 1275636 | 16059666 | hadcm3n_o9au_1900_40_008467177_1 | 77,760 | 79,777 | 1.0259 |
12 Oct 2013 21:39:31 | 1275636 | 16059666 | hadcm3n_o9au_1900_40_008467177_1 | 51,840 | 49,440 | 0.9537 |
11 Oct 2013 02:11:46 | 1275636 | 16059666 | hadcm3n_o9au_1900_40_008467177_1 | 25,920 | 21,106 | 0.8143 |
©2024 cpdn.org