Name | hadcm3n_n4a6_1920_40_008321394_2 |
Workunit | 8472529 |
Created | 7 May 2013, 1:20:44 UTC |
Sent | 7 May 2013, 1:20:47 UTC |
Report deadline | 6 Aug 2013, 8:47:58 UTC |
Received | 28 May 2013, 13:03:27 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 1265248 |
Run time | 6 days 13 hours 55 min 32 sec |
CPU time | 6 days 2 hours 11 min 20 sec |
Validate state | Invalid |
Credit | 1,866.24 |
Device peak FLOPS | 0.84 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86 |
Stderr | <core_client_version>7.0.28</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2700, iMonCtr=1 Model crash detected, will try to restart... 18:01:39 (5576): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 12:15:05 (5604): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 21:56:28 (892): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 12:19:02 (6104): No heartbeat from core client for 30 sec - exiting 12:19:03 (6104): No heartbeat from core client for 30 sec - exiting 12:19:04 (6104): No heartbeat from core client for 30 sec - exiting 12:19:05 (6104): No heartbeat from core client for 30 sec - exiting 12:19:06 (6104): No heartbeat from core client for 30 sec - exiting 12:19:07 (6104): No heartbeat from core client for 30 sec - exiting 12:19:08 (6104): No heartbeat from core client for 30 sec - exiting 12:19:09 (6104): No heartbeat from core client for 30 sec - exiting 12:19:10 (6104): No heartbeat from core client for 30 sec - exiting 12:19:11 (6104): No heartbeat from core client for 30 sec - exiting 12:19:12 (6104): No heartbeat from core client for 30 sec - exiting 12:19:13 (6104): No heartbeat from core client for 30 sec - exiting 12:19:14 (6104): No heartbeat from core client for 30 sec - exiting 12:19:15 (6104): No heartbeat from core client for 30 sec - exiting 12:19:16 (6104): No heartbeat from core client for 30 sec - exiting 12:19:17 (6104): No heartbeat from core client for 30 sec - exiting 12:19:18 (6104): No heartbeat from core client for 30 sec - exiting 12:19:19 (6104): No heartbeat from core client for 30 sec - exiting 12:19:20 (6104): No heartbeat from core client for 30 sec - exiting 12:19:21 (6104): No heartbeat from core client for 30 sec - exiting 12:19:22 (6104): No heartbeat from core client for 30 sec - exiting 12:19:23 (6104): No heartbeat from core client for 30 sec - exiting 12:19:24 (6104): No heartbeat from core client for 30 sec - exiting 12:19:25 (6104): No heartbeat from core client for 30 sec - exiting 12:19:26 (6104): No heartbeat from core client for 30 sec - exiting 12:19:28 (6104): No heartbeat from core client for 30 sec - exiting 12:19:29 (6104): No heartbeat from core client for 30 sec - exiting 12:19:30 (6104): No heartbeat from core client for 30 sec - exiting 12:19:31 (6104): No heartbeat from core client for 30 sec - exiting 12:19:32 (6104): No heartbeat from core client for 30 sec - exiting 12:19:33 (6104): No heartbeat from core client for 30 sec - exiting 12:19:34 (6104): No heartbeat from core client for 30 sec - exiting 12:19:35 (6104): No heartbeat from core client for 30 sec - exiting 12:19:36 (6104): No heartbeat from core client for 30 sec - exiting 12:19:37 (6104): No heartbeat from core client for 30 sec - exiting 12:19:38 (6104): No heartbeat from core client for 30 sec - exiting 12:19:39 (6104): No heartbeat from core client for 30 sec - exiting 12:19:40 (6104): No heartbeat from core client for 30 sec - exiting 12:19:41 (6104): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 12:53:07 (5664): No heartbeat from core client for 30 sec - exiting 12:53:08 (5664): No heartbeat from core client for 30 sec - exiting 12:53:09 (5664): No heartbeat from core client for 30 sec - exiting 12:53:10 (5664): No heartbeat from core client for 30 sec - exiting 12:53:11 (5664): No heartbeat from core client for 30 sec - exiting 12:53:12 (5664): No heartbeat from core client for 30 sec - exiting 12:53:13 (5664): No heartbeat from core client for 30 sec - exiting 12:53:14 (5664): No heartbeat from core client for 30 sec - exiting 12:53:15 (5664): No heartbeat from core client for 30 sec - exiting 12:53:16 (5664): No heartbeat from core client for 30 sec - exiting 12:53:18 (5664): No heartbeat from core client for 30 sec - exiting 12:53:19 (5664): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 12:53:20 (5664): No heartbeat from core client for 30 sec - exiting 12:53:21 (5664): No heartbeat from core client for 30 sec - exiting 02:01:14 (4872): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 20:00:00 (5520): No heartbeat from core client for 30 sec - exiting 20:00:02 (5520): No heartbeat from core client for 30 sec - exiting 20:00:03 (5520): No heartbeat from core client for 30 sec - exiting 20:00:04 (5520): No heartbeat from core client for 30 sec - exiting 20:00:05 (5520): No heartbeat from core client for 30 sec - exiting 20:00:06 (5520): No heartbeat from core client for 30 sec - exiting 20:00:07 (5520): No heartbeat from core client for 30 sec - exiting 20:00:08 (5520): No heartbeat from core client for 30 sec - exiting 20:00:09 (5520): No heartbeat from core client for 30 sec - exiting 20:00:10 (5520): No heartbeat from core client for 30 sec - exiting 20:00:11 (5520): No heartbeat from core client for 30 sec - exiting 20:00:12 (5520): No heartbeat from core client for 30 sec - exiting 20:00:13 (5520): No heartbeat from core client for 30 sec - exiting 20:00:14 (5520): No heartbeat from core client for 30 sec - exiting 20:00:15 (5520): No heartbeat from core client for 30 sec - exiting 20:00:16 (5520): No heartbeat from core client for 30 sec - exiting 20:00:17 (5520): No heartbeat from core client for 30 sec - exiting 20:00:18 (5520): No heartbeat from core client for 30 sec - exiting 20:00:19 (5520): No heartbeat from core client for 30 sec - exiting 20:00:20 (5520): No heartbeat from core client for 30 sec - exiting 20:00:21 (5520): No heartbeat from core client for 30 sec - exiting 20:00:22 (5520): No heartbeat from core client for 30 sec - exiting 20:00:23 (5520): No heartbeat from core client for 30 sec - exiting 20:00:24 (5520): No heartbeat from core client for 30 sec - exiting 20:00:25 (5520): No heartbeat from core client for 30 sec - exiting 20:00:26 (5520): No heartbeat from core client for 30 sec - exiting 20:00:27 (5520): No heartbeat from core client for 30 sec - exiting 20:00:28 (5520): No heartbeat from core client for 30 sec - exiting 20:00:29 (5520): No heartbeat from core client for 30 sec - exiting 20:00:30 (5520): No heartbeat from core client for 30 sec - exiting 20:00:31 (5520): No heartbeat from core client for 30 sec - exiting 20:00:32 (5520): No heartbeat from core client for 30 sec - exiting 20:00:33 (5520): No heartbeat from core client for 30 sec - exiting 20:00:34 (5520): No heartbeat from core client for 30 sec - exiting 20:00:35 (5520): No heartbeat from core client for 30 sec - exiting 20:00:36 (5520): No heartbeat from core client for 30 sec - exiting 20:00:37 (5520): No heartbeat from core client for 30 sec - exiting 20:00:38 (5520): No heartbeat from core client for 30 sec - exiting 20:00:39 (5520): No heartbeat from core client for 30 sec - exiting 20:00:40 (5520): No heartbeat from core client for 30 sec - exiting 20:00:41 (5520): No heartbeat from core client for 30 sec - exiting 20:00:42 (5520): No heartbeat from core client for 30 sec - exiting 20:00:43 (5520): No heartbeat from core client for 30 sec - exiting 20:00:44 (5520): No heartbeat from core client for 30 sec - exiting 20:00:45 (5520): No heartbeat from core client for 30 sec - exiting 20:00:46 (5520): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Atmos Hold Restart file rename failed on atmos_restart.hold CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 21:53:23 (6256): No heartbeat from core client for 30 sec - exiting 21:53:24 (6256): No heartbeat from core client for 30 sec - exiting 21:53:25 (6256): No heartbeat from core client for 30 sec - exiting 21:53:26 (6256): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5852, iMonCtr=1 Model crash detected, will try to restart... 14:37:51 (5396): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 20:58:29 (6140): No heartbeat from core client for 30 sec - exiting 20:58:30 (6140): No heartbeat from core client for 30 sec - exiting 20:58:31 (6140): No heartbeat from core client for 30 sec - exiting 20:58:32 (6140): No heartbeat from core client for 30 sec - exiting 20:58:33 (6140): No heartbeat from core client for 30 sec - exiting 20:58:34 (6140): No heartbeat from core client for 30 sec - exiting 20:58:35 (6140): No heartbeat from core client for 30 sec - exiting 20:58:36 (6140): No heartbeat from core client for 30 sec - exiting 20:58:37 (6140): No heartbeat from core client for 30 sec - exiting 20:58:38 (6140): No heartbeat from core client for 30 sec - exiting 20:58:39 (6140): No heartbeat from core client for 30 sec - exiting 20:58:40 (6140): No heartbeat from core client for 30 sec - exiting 20:58:41 (6140): No heartbeat from core client for 30 sec - exiting 20:58:42 (6140): No heartbeat from core client for 30 sec - exiting 20:58:43 (6140): No heartbeat from core client for 30 sec - exiting 20:58:44 (6140): No heartbeat from core client for 30 sec - exiting 20:58:45 (6140): No heartbeat from core client for 30 sec - exiting 20:58:46 (6140): No heartbeat from core client for 30 sec - exiting 20:58:47 (6140): No heartbeat from core client for 30 sec - exiting 20:58:48 (6140): No heartbeat from core client for 30 sec - exiting 20:58:49 (6140): No heartbeat from core client for 30 sec - exiting 20:58:50 (6140): No heartbeat from core client for 30 sec - exiting 20:58:51 (6140): No heartbeat from core client for 30 sec - exiting 20:58:52 (6140): No heartbeat from core client for 30 sec - exiting 20:58:53 (6140): No heartbeat from core client for 30 sec - exiting 20:58:54 (6140): No heartbeat from core client for 30 sec - exiting 20:58:55 (6140): No heartbeat from core client for 30 sec - exiting 20:58:56 (6140): No heartbeat from core client for 30 sec - exiting 20:58:57 (6140): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 20:58:58 (6140): No heartbeat from core client for 30 sec - exiting 20:58:59 (6140): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 11:07:44 (4992): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5944, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... 18:25:14 (1380): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CCPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3944, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1720, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1720, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1720, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1720, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1720, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1720, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
27 May 2013 05:37:51 | 1265248 | 15764584 | hadcm3n_n4a6_1920_40_008321394_2 | 155,520 | 521,604 | 3.3539 |
25 May 2013 16:04:42 | 1265248 | 15764584 | hadcm3n_n4a6_1920_40_008321394_2 | 129,600 | 440,504 | 3.3990 |
22 May 2013 21:23:12 | 1265248 | 15764584 | hadcm3n_n4a6_1920_40_008321394_2 | 103,680 | 365,424 | 3.5245 |
17 May 2013 22:54:41 | 1265248 | 15764584 | hadcm3n_n4a6_1920_40_008321394_2 | 77,760 | 289,057 | 3.7173 |
15 May 2013 00:37:41 | 1265248 | 15764584 | hadcm3n_n4a6_1920_40_008321394_2 | 51,840 | 197,420 | 3.8083 |
11 May 2013 23:52:45 | 1265248 | 15764584 | hadcm3n_n4a6_1920_40_008321394_2 | 25,920 | 96,871 | 3.7373 |
©2024 cpdn.org