Name | hadcm3n_8akf_1980_40_008723114_4 |
Workunit | 8869092 |
Created | 15 Jul 2014, 17:47:00 UTC |
Sent | 15 Jul 2014, 17:49:16 UTC |
Report deadline | 15 Oct 2014, 1:16:27 UTC |
Received | 14 Aug 2014, 14:53:46 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 1089735 |
Run time | 3 days 4 hours 47 min 33 sec |
CPU time | 2 days 17 hours 16 min 4 sec |
Validate state | Invalid |
Credit | 311.04 |
Device peak FLOPS | 1.05 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86 |
Stderr | <core_client_version>6.10.58</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> 18:42:02 (3376): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 18:42:03 (3376): No heartbeat from core client for 30 sec - exiting 18:42:04 (3376): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... 07:13:56 (3128): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 07:13:58 (3128): No heartbeat from core client for 30 sec - exiting 07:13:59 (3128): No heartbeat from core client for 30 sec - exiting 07:24:12 (2556): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 07:24:14 (2556): No heartbeat from core client for 30 sec - exiting 07:24:16 (2556): No heartbeat from core client for 30 sec - exiting 07:24:17 (2556): No heartbeat from core client for 30 sec - exiting 07:47:43 (3780): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 07:47:45 (3780): No heartbeat from core client for 30 sec - exiting 07:47:46 (3780): No heartbeat from core client for 30 sec - exiting 07:48:30 (1568): No heartbeat from core client for 30 sec - exiting 07:48:31 (1568): No heartbeat from core client for 30 sec - exiting 07:48:32 (1568): No heartbeat from core client for 30 sec - exiting 07:48:33 (1568): No heartbeat from core client for 30 sec - exiting 07:48:34 (1568): No heartbeat from core client for 30 sec - exiting 07:48:35 (1568): No heartbeat from core client for 30 sec - exiting 07:48:37 (1568): No heartbeat from core client for 30 sec - exiting 07:48:38 (1568): No heartbeat from core client for 30 sec - exiting 07:48:39 (1568): No heartbeat from core client for 30 sec - exiting 07:48:40 (1568): No heartbeat from core client for 30 sec - exiting 07:48:41 (1568): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 08:19:32 (2332): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 08:19:33 (2332): No heartbeat from core client for 30 sec - exiting 08:19:34 (2332): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... 13:50:31 (3028): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 13:50:33 (3028): No heartbeat from core client for 30 sec - exiting 13:50:35 (3028): No heartbeat from core client for 30 sec - exiting 13:51:52 (840): No heartbeat from core client for 30 sec - exiting 13:51:53 (840): No heartbeat from core client for 30 sec - exiting 13:51:55 (840): No heartbeat from core client for 30 sec - exiting 13:51:56 (840): No heartbeat from core client for 30 sec - exiting 13:51:57 (840): No heartbeat from core client for 30 sec - exiting 13:51:58 (840): No heartbeat from core client for 30 sec - exiting 13:51:59 (840): No heartbeat from core client for 30 sec - exiting 13:52:00 (840): No heartbeat from core client for 30 sec - exiting 13:52:01 (840): No heartbeat from core client for 30 sec - exiting 13:52:02 (840): No heartbeat from core client for 30 sec - exiting 13:52:03 (840): No heartbeat from core client for 30 sec - exiting 13:52:04 (840): No heartbeat from core client for 30 sec - exiting 13:52:06 (840): No heartbeat from core client for 30 sec - exiting 13:52:07 (840): No heartbeat from core client for 30 sec - exiting 13:52:08 (840): No heartbeat from core client for 30 sec - exiting 13:52:09 (840): No heartbeat from core client for 30 sec - exiting 13:52:10 (840): No heartbeat from core client for 30 sec - exiting 13:52:11 (840): No heartbeat from core client for 30 sec - exiting 13:52:12 (840): No heartbeat from core client for 30 sec - exiting 13:52:13 (840): No heartbeat from core client for 30 sec - exiting 13:52:14 (840): No heartbeat from core client for 30 sec - exiting 13:52:15 (840): No heartbeat from core client for 30 sec - exiting 13:52:16 (840): No heartbeat from core client for 30 sec - exiting 13:52:18 (840): No heartbeat from core client for 30 sec - exiting 13:52:19 (840): No heartbeat from core client for 30 sec - exiting 13:52:21 (840): No heartbeat from core client for 30 sec - exiting 13:52:22 (840): No heartbeat from core client for 30 sec - exiting 13:52:23 (840): No heartbeat from core client for 30 sec - exiting 13:52:24 (840): No heartbeat from core client for 30 sec - exiting 13:52:25 (840): No heartbeat from core client for 30 sec - exiting 13:52:26 (840): No heartbeat from core client for 30 sec - exiting 13:52:27 (840): No heartbeat from core client for 30 sec - exiting 13:52:28 (840): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 13:52:29 (840): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 10:49:24 (3128): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:49:26 (3128): No heartbeat from core client for 30 sec - exiting 10:49:27 (3128): No heartbeat from core client for 30 sec - exiting 10:50:21 (2148): No heartbeat from core client for 30 sec - exiting 10:50:23 (2148): No heartbeat from core client for 30 sec - exiting 10:50:24 (2148): No heartbeat from core client for 30 sec - exiting 10:50:25 (2148): No heartbeat from core client for 30 sec - exiting 10:50:26 (2148): No heartbeat from core client for 30 sec - exiting 10:50:27 (2148): No heartbeat from core client for 30 sec - exiting 10:50:28 (2148): No heartbeat from core client for 30 sec - exiting 10:50:29 (2148): No heartbeat from core client for 30 sec - exiting 10:50:30 (2148): No heartbeat from core client for 30 sec - exiting 10:50:31 (2148): No heartbeat from core client for 30 sec - exiting 10:50:32 (2148): No heartbeat from core client for 30 sec - exiting 10:50:33 (2148): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 08:02:16 (644): No heartbeat from core client for 30 sec - exiting 08:02:17 (644): No heartbeat from core client for 30 sec - exiting 08:02:18 (644): No heartbeat from core client for 30 sec - exiting 08:02:19 (644): No heartbeat from core client for 30 sec - exiting 08:02:20 (644): No heartbeat from core client for 30 sec - exiting 08:02:21 (644): No heartbeat from core client for 30 sec - exiting 08:02:22 (644): No heartbeat from core client for 30 sec - exiting 08:02:23 (644): No heartbeat from core client for 30 sec - exiting 08:02:24 (644): No heartbeat from core client for 30 sec - exiting 08:02:26 (644): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 08:02:27 (644): No heartbeat from core client for 30 sec - exiting 08:03:10 (2632): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 08:12:59 (2700): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3076, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3076, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3076, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... 13:22:27 (4088): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 13:22:29 (4088): No heartbeat from core client for 30 sec - exiting 13:23:32 (2716): No heartbeat from core client for 30 sec - exiting 13:23:34 (2716): No heartbeat from core client for 30 sec - exiting 13:23:35 (2716): No heartbeat from core client for 30 sec - exiting 13:23:36 (2716): No heartbeat from core client for 30 sec - exiting 13:23:37 (2716): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2916, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2916, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2916, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
18 Jul 2014 22:32:29 | 1089735 | 16809702 | hadcm3n_8akf_1980_40_008723114_4 | 25,920 | 156,533 | 6.0391 |
©2024 cpdn.org