Name | hadcm3n_zlcm_1880_40_008251145_1 |
Workunit | 8406269 |
Created | 22 Nov 2012, 10:22:34 UTC |
Sent | 22 Nov 2012, 10:23:05 UTC |
Report deadline | 21 Feb 2013, 17:50:16 UTC |
Received | 12 Jan 2013, 13:13:30 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 1202786 |
Run time | 17 days 7 hours 50 min 42 sec |
CPU time | 10 days 16 hours 14 min 37 sec |
Validate state | Invalid |
Credit | 3,732.48 |
Device peak FLOPS | 2.42 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86 |
Stderr | <core_client_version>6.12.34</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 00:16:27 (2308): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 00:16:29 (2308): No heartbeat from core client for 30 sec - exiting 00:16:30 (2308): No heartbeat from core client for 30 sec - exiting 00:16:31 (2308): No heartbeat from core client for 30 sec - exiting 00:17:26 (2928): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 02:02:58 (3812): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 02:03:01 (3812): No heartbeat from core client for 30 sec - exiting 02:03:02 (3812): No heartbeat from core client for 30 sec - exiting 02:04:05 (5888): No heartbeat from core client for 30 sec - exiting 02:04:07 (5888): No heartbeat from core client for 30 sec - exiting 02:04:08 (5888): No heartbeat from core client for 30 sec - exiting 02:04:09 (5888): No heartbeat from core client for 30 sec - exiting 02:04:18 (5888): No heartbeat from core client for 30 sec - exiting 02:04:20 (5888): No heartbeat from core client for 30 sec - exiting 02:04:22 (5888): No heartbeat from core client for 30 sec - exiting 02:04:23 (5888): No heartbeat from core client for 30 sec - exiting 02:04:24 (5888): No heartbeat from core client for 30 sec - exiting 02:04:25 (5888): No heartbeat from core client for 30 sec - exiting 02:04:27 (5888): No heartbeat from core client for 30 sec - exiting 02:04:29 (5888): No heartbeat from core client for 30 sec - exiting 02:04:31 (5888): No heartbeat from core client for 30 sec - exiting 02:04:32 (5888): No heartbeat from core client for 30 sec - exiting 02:04:33 (5888): No heartbeat from core client for 30 sec - exiting 02:04:34 (5888): No heartbeat from core client for 30 sec - exiting 02:04:35 (5888): No heartbeat from core client for 30 sec - exiting 02:04:36 (5888): No heartbeat from core client for 30 sec - exiting 02:04:37 (5888): No heartbeat from core client for 30 sec - exiting 02:04:38 (5888): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 02:05:14 (3244): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 00:25:33 (2668): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 00:25:36 (2668): No heartbeat from core client for 30 sec - exiting 00:25:37 (2668): No heartbeat from core client for 30 sec - exiting 00:25:38 (2668): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - Suspend request from BOINC... 06:14:13 (4908): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 06:14:15 (4908): No heartbeat from core client for 30 sec - exiting 06:15:07 (4684): No heartbeat from core client for 30 sec - exiting 06:15:08 (4684): No heartbeat from core client for 30 sec - exiting 06:15:10 (4684): No heartbeat from core client for 30 sec - exiting 06:15:16 (4684): No heartbeat from core client for 30 sec - exiting 06:15:17 (4684): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 06:15:19 (4684): No heartbeat from core client for 30 sec - exiting 18:11:30 (2660): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 18:11:33 (2660): No heartbeat from core client for 30 sec - exiting 18:11:34 (2660): No heartbeat from core client for 30 sec - exiting 18:12:26 (4904): No heartbeat from core client for 30 sec - exiting 18:12:27 (4904): No heartbeat from core client for 30 sec - exiting 18:12:28 (4904): No heartbeat from core client for 30 sec - exiting 18:12:29 (4904): No heartbeat from core client for 30 sec - exiting 18:12:30 (4904): No heartbeat from core client for 30 sec - exiting 18:12:31 (4904): No heartbeat from core client for 30 sec - exiting 18:12:32 (4904): No heartbeat from core client for 30 sec - exiting 18:12:33 (4904): No heartbeat from core client for 30 sec - exiting 18:12:34 (4904): No heartbeat from core client for 30 sec - exiting 18:12:37 (4904): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 18:12:38 (4904): No heartbeat from core client for 30 sec - exiting 18:13:24 (1960): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 00:22:26 (4900): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:10:39 (4200): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2436, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2436, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2436, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2436, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2436, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2436, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
26 Dec 2012 22:47:10 | 1202786 | 15453407 | hadcm3n_zlcm_1880_40_008251145_1 | 311,040 | 917,215 | 2.9489 |
26 Dec 2012 03:57:54 | 1202786 | 15453407 | hadcm3n_zlcm_1880_40_008251145_1 | 285,120 | 851,723 | 2.9872 |
25 Dec 2012 08:57:30 | 1202786 | 15453407 | hadcm3n_zlcm_1880_40_008251145_1 | 259,200 | 785,283 | 3.0296 |
24 Dec 2012 13:55:41 | 1202786 | 15453407 | hadcm3n_zlcm_1880_40_008251145_1 | 233,280 | 718,420 | 3.0796 |
23 Dec 2012 14:54:28 | 1202786 | 15453407 | hadcm3n_zlcm_1880_40_008251145_1 | 207,360 | 643,262 | 3.1022 |
22 Dec 2012 16:40:58 | 1202786 | 15453407 | hadcm3n_zlcm_1880_40_008251145_1 | 181,440 | 564,534 | 3.1114 |
21 Dec 2012 16:32:27 | 1202786 | 15453407 | hadcm3n_zlcm_1880_40_008251145_1 | 155,520 | 481,230 | 3.0943 |
20 Dec 2012 08:01:37 | 1202786 | 15453407 | hadcm3n_zlcm_1880_40_008251145_1 | 129,600 | 373,087 | 2.8788 |
17 Dec 2012 14:05:27 | 1202786 | 15453407 | hadcm3n_zlcm_1880_40_008251145_1 | 103,680 | 274,675 | 2.6493 |
15 Dec 2012 17:26:19 | 1202786 | 15453407 | hadcm3n_zlcm_1880_40_008251145_1 | 77,760 | 200,085 | 2.5731 |
24 Nov 2012 17:00:17 | 1202786 | 15453407 | hadcm3n_zlcm_1880_40_008251145_1 | 51,840 | 124,027 | 2.3925 |
23 Nov 2012 13:33:37 | 1202786 | 15453407 | hadcm3n_zlcm_1880_40_008251145_1 | 25,920 | 59,304 | 2.2880 |
©2024 cpdn.org