Name | hadcm3n_ykel_1940_40_007542971_1 |
Workunit | 7740203 |
Created | 9 Nov 2011, 23:44:19 UTC |
Sent | 16 Nov 2011, 15:27:16 UTC |
Report deadline | 15 Feb 2012, 22:54:27 UTC |
Received | 19 Nov 2011, 22:44:53 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 1141348 |
Run time | 2 days 22 hours 53 min 29 sec |
CPU time | 2 days 19 hours 45 min 17 sec |
Validate state | Invalid |
Credit | 2,177.28 |
Device peak FLOPS | 2.98 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86 |
Stderr | <core_client_version>6.12.34</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 14:57:10 (4644): No heartbeat from core client for 30 sec - exiting 14:57:11 (4644): No heartbeat from core client for 30 sec - exiting 14:57:12 (4644): No heartbeat from core client for 30 sec - exiting 14:57:13 (4644): No heartbeat from core client for 30 sec - exiting 14:57:14 (4644): No heartbeat from core client for 30 sec - exiting 14:57:15 (4644): No heartbeat from core client for 30 sec - exiting 14:57:16 (4644): No heartbeat from core client for 30 sec - exiting 14:57:17 (4644): No heartbeat from core client for 30 sec - exiting 14:57:18 (4644): No heartbeat from core client for 30 sec - exiting 14:57:19 (4644): No heartbeat from core client for 30 sec - exiting 14:57:20 (4644): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 17:08:00 (3460): No heartbeat from core client for 30 sec - exiting 17:08:01 (3460): No heartbeat from core client for 30 sec - exiting 17:08:02 (3460): No heartbeat from core client for 30 sec - exiting 17:08:03 (3460): No heartbeat from core client for 30 sec - exiting 17:08:04 (3460): No heartbeat from core client for 30 sec - exiting 17:08:05 (3460): No heartbeat from core client for 30 sec - exiting 17:08:06 (3460): No heartbeat from core client for 30 sec - exiting 17:08:07 (3460): No heartbeat from core client for 30 sec - exiting 17:08:08 (3460): No heartbeat from core client for 30 sec - exiting 17:08:09 (3460): No heartbeat from core client for 30 sec - exiting 17:08:10 (3460): No heartbeat from core client for 30 sec - exiting 17:08:11 (3460): No heartbeat from core client for 30 sec - exiting 17:08:12 (3460): No heartbeat from core client for 30 sec - exiting 17:08:13 (3460): No heartbeat from core client for 30 sec - exiting 17:08:14 (3460): No heartbeat from core client for 30 sec - exiting 17:08:15 (3460): No heartbeat from core client for 30 sec - exiting 17:08:16 (3460): No heartbeat from core client for 30 sec - exiting 17:08:17 (3460): No heartbeat from core client for 30 sec - exiting 17:08:18 (3460): No heartbeat from core client for 30 sec - exiting 17:08:19 (3460): No heartbeat from core client for 30 sec - exiting 17:08:20 (3460): No heartbeat from core client for 30 sec - exiting 17:08:21 (3460): No heartbeat from core client for 30 sec - exiting 17:08:22 (3460): No heartbeat from core client for 30 sec - exiting 17:08:23 (3460): No heartbeat from core client for 30 sec - exiting 17:08:24 (3460): No heartbeat from core client for 30 sec - exiting 17:08:25 (3460): No heartbeat from core client for 30 sec - exiting 17:08:26 (3460): No heartbeat from core client for 30 sec - exiting 17:08:27 (3460): No heartbeat from core client for 30 sec - exiting 17:08:28 (3460): No heartbeat from core client for 30 sec - exiting 17:08:29 (3460): No heartbeat from core client for 30 sec - exiting 17:08:30 (3460): No heartbeat from core client for 30 sec - exiting 17:08:31 (3460): No heartbeat from core client for 30 sec - exiting 17:08:32 (3460): No heartbeat from core client for 30 sec - exiting 17:08:33 (3460): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3668, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3668, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3920, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3920, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3920, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3920, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
19 Nov 2011 12:19:37 | 1141348 | 13625488 | hadcm3n_ykel_1940_40_007542971_1 | 181,440 | 221,711 | 1.2220 |
19 Nov 2011 03:57:26 | 1141348 | 13625488 | hadcm3n_ykel_1940_40_007542971_1 | 155,520 | 189,873 | 1.2209 |
18 Nov 2011 17:51:41 | 1141348 | 13625488 | hadcm3n_ykel_1940_40_007542971_1 | 129,600 | 157,977 | 1.2190 |
18 Nov 2011 08:39:09 | 1141348 | 13625488 | hadcm3n_ykel_1940_40_007542971_1 | 103,680 | 126,183 | 1.2170 |
17 Nov 2011 23:16:30 | 1141348 | 13625488 | hadcm3n_ykel_1940_40_007542971_1 | 77,760 | 94,273 | 1.2124 |
17 Nov 2011 09:48:39 | 1141348 | 13625488 | hadcm3n_ykel_1940_40_007542971_1 | 51,840 | 62,880 | 1.2130 |
17 Nov 2011 01:46:35 | 1141348 | 13625488 | hadcm3n_ykel_1940_40_007542971_1 | 25,920 | 31,547 | 1.2171 |
©2024 cpdn.org