Name | hadcm3n_zi7r_1880_40_008200287_4 |
Workunit | 8355411 |
Created | 22 Sep 2012, 1:43:08 UTC |
Sent | 22 Sep 2012, 1:43:22 UTC |
Report deadline | 22 Dec 2012, 9:10:33 UTC |
Received | 31 Oct 2012, 17:49:05 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 1181321 |
Run time | 22 days 4 hours 45 min 1 sec |
CPU time | 21 days 17 hours 55 min 52 sec |
Validate state | Invalid |
Credit | 12,130.56 |
Device peak FLOPS | 2.27 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86 |
Stderr | <core_client_version>7.0.28</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> 11:20:36 (5024): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 11:20:38 (5024): No heartbeat from core client for 30 sec - exiting 11:20:39 (5024): No heartbeat from core client for 30 sec - exiting 11:20:40 (5024): No heartbeat from core client for 30 sec - exiting 11:20:41 (5024): No heartbeat from core client for 30 sec - exiting 11:20:42 (5024): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 19:37:53 (3924): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 15:03:00 (3944): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 15:03:03 (3944): No heartbeat from core client for 30 sec - exiting 15:03:04 (3944): No heartbeat from core client for 30 sec - exiting 15:03:05 (3944): No heartbeat from core client for 30 sec - exiting 15:03:06 (3944): No heartbeat from core client for 30 sec - exiting 15:03:07 (3944): No heartbeat from core client for 30 sec - exiting 15:03:08 (3944): No heartbeat from core client for 30 sec - exiting 15:03:09 (3944): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... 13:58:22 (4884): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 13:58:24 (4884): No heartbeat from core client for 30 sec - exiting 11:45:48 (5840): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 11:45:50 (5840): No heartbeat from core client for 30 sec - exiting 11:45:51 (5840): No heartbeat from core client for 30 sec - exiting 11:45:52 (5840): No heartbeat from core client for 30 sec - exiting 11:45:53 (5840): No heartbeat from core client for 30 sec - exiting 11:45:55 (5840): No heartbeat from core client for 30 sec - exiting 11:45:56 (5840): No heartbeat from core client for 30 sec - exiting 11:45:57 (5840): No heartbeat from core client for 30 sec - exiting 11:45:58 (5840): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 21:33:52 (8796): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... 21:33:53 (8796): No heartbeat from core client for 30 sec - exiting 21:33:54 (8796): No heartbeat from core client for 30 sec - exiting 21:33:55 (8796): No heartbeat from core client for 30 sec - exiting 21:33:56 (8796): No heartbeat from core client for 30 sec - exiting 21:33:57 (8796): No heartbeat from core client for 30 sec - exiting 21:33:58 (8796): No heartbeat from core client for 30 sec - exiting 21:33:59 (8796): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... 10:51:33 (3944): No heartbeat from core client for 30 sec - exiting 10:51:34 (3944): No heartbeat from core client for 30 sec - exiting 10:51:35 (3944): No heartbeat from core client for 30 sec - exiting 10:51:36 (3944): No heartbeat from core client for 30 sec - exiting 10:51:37 (3944): No heartbeat from core client for 30 sec - exiting 10:51:38 (3944): No heartbeat from core client for 30 sec - exiting 10:51:39 (3944): No heartbeat from core client for 30 sec - exiting 10:51:40 (3944): No heartbeat from core client for 30 sec - exiting 10:51:43 (3944): No heartbeat from core client for 30 sec - exiting 10:51:44 (3944): No heartbeat from core client for 30 sec - exiting 10:51:45 (3944): No heartbeat from core client for 30 sec - exiting 10:51:46 (3944): No heartbeat from core client for 30 sec - exiting 10:51:47 (3944): No heartbeat from core client for 30 sec - exiting 10:51:48 (3944): No heartbeat from core client for 30 sec - exiting 10:51:49 (3944): No heartbeat from core client for 30 sec - exiting 10:51:50 (3944): No heartbeat from core client for 30 sec - exiting 10:51:51 (3944): No heartbeat from core client for 30 sec - exiting 10:51:52 (3944): No heartbeat from core client for 30 sec - exiting 10:51:53 (3944): No heartbeat from core client for 30 sec - exiting 10:51:54 (3944): No heartbeat from core client for 30 sec - exiting 10:51:55 (3944): No heartbeat from core client for 30 sec - exiting 10:51:56 (3944): No heartbeat from core client for 30 sec - exiting 10:51:57 (3944): No heartbeat from core client for 30 sec - exiting 10:51:58 (3944): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:51:59 (3944): No heartbeat from core client for 30 sec - exiting 10:52:00 (3944): No heartbeat from core client for 30 sec - exiting 10:52:01 (3944): No heartbeat from core client for 30 sec - exiting 10:52:02 (3944): No heartbeat from core client for 30 sec - exiting 10:52:03 (3944): No heartbeat from core client for 30 sec - exiting 10:52:04 (3944): No heartbeat from core client for 30 sec - exiting 10:54:56 (4136): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... Signal 11 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2952, iMonCtr=1 Model crash detected, will try to restart... Signal 11 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2952, iMonCtr=1 Model crash detected, will try to restart... Signal 11 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2952, iMonCtr=1 Model crash detected, will try to restart... Signal 11 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2952, iMonCtr=1 Model crash detected, will try to restart... Signal 11 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2952, iMonCtr=1 Model crash detected, will try to restart... Signal 11 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2952, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
31 Oct 2012 17:50:35 | 1181321 | 15296115 | hadcm3n_zi7r_1880_40_008200287_4 | 1,010,880 | 1,872,965 | 1.8528 |
28 Oct 2012 19:45:10 | 1181321 | 15296115 | hadcm3n_zi7r_1880_40_008200287_4 | 984,960 | 1,824,100 | 1.8520 |
14 Oct 2012 09:41:28 | 1181321 | 15296115 | hadcm3n_zi7r_1880_40_008200287_4 | 959,040 | 1,775,544 | 1.8514 |
13 Oct 2012 20:03:32 | 1181321 | 15296115 | hadcm3n_zi7r_1880_40_008200287_4 | 933,120 | 1,726,697 | 1.8505 |
13 Oct 2012 05:51:59 | 1181321 | 15296115 | hadcm3n_zi7r_1880_40_008200287_4 | 907,200 | 1,678,371 | 1.8501 |
12 Oct 2012 16:31:55 | 1181321 | 15296115 | hadcm3n_zi7r_1880_40_008200287_4 | 881,280 | 1,630,956 | 1.8507 |
12 Oct 2012 02:29:11 | 1181321 | 15296115 | hadcm3n_zi7r_1880_40_008200287_4 | 855,360 | 1,583,882 | 1.8517 |
11 Oct 2012 12:06:21 | 1181321 | 15296115 | hadcm3n_zi7r_1880_40_008200287_4 | 829,440 | 1,535,580 | 1.8513 |
10 Oct 2012 14:52:19 | 1181321 | 15296115 | hadcm3n_zi7r_1880_40_008200287_4 | 803,520 | 1,488,153 | 1.8520 |
10 Oct 2012 01:49:51 | 1181321 | 15296115 | hadcm3n_zi7r_1880_40_008200287_4 | 777,600 | 1,440,793 | 1.8529 |
09 Oct 2012 11:31:05 | 1181321 | 15296115 | hadcm3n_zi7r_1880_40_008200287_4 | 751,680 | 1,392,321 | 1.8523 |
08 Oct 2012 21:57:55 | 1181321 | 15296115 | hadcm3n_zi7r_1880_40_008200287_4 | 725,760 | 1,344,606 | 1.8527 |
08 Oct 2012 08:46:03 | 1181321 | 15296115 | hadcm3n_zi7r_1880_40_008200287_4 | 699,840 | 1,297,845 | 1.8545 |
07 Oct 2012 18:45:28 | 1181321 | 15296115 | hadcm3n_zi7r_1880_40_008200287_4 | 673,920 | 1,249,842 | 1.8546 |
07 Oct 2012 05:03:54 | 1181321 | 15296115 | hadcm3n_zi7r_1880_40_008200287_4 | 648,000 | 1,201,779 | 1.8546 |
06 Oct 2012 14:38:53 | 1181321 | 15296115 | hadcm3n_zi7r_1880_40_008200287_4 | 622,080 | 1,152,908 | 1.8533 |
06 Oct 2012 01:55:50 | 1181321 | 15296115 | hadcm3n_zi7r_1880_40_008200287_4 | 596,160 | 1,104,652 | 1.8529 |
05 Oct 2012 11:48:07 | 1181321 | 15296115 | hadcm3n_zi7r_1880_40_008200287_4 | 570,240 | 1,057,523 | 1.8545 |
04 Oct 2012 23:15:20 | 1181321 | 15296115 | hadcm3n_zi7r_1880_40_008200287_4 | 544,320 | 1,010,238 | 1.8560 |
04 Oct 2012 09:13:10 | 1181321 | 15296115 | hadcm3n_zi7r_1880_40_008200287_4 | 518,400 | 963,282 | 1.8582 |
©2024 cpdn.org