Name | hadcm3n_7y33_1980_40_008455458_4 |
Workunit | 8606314 |
Created | 1 Apr 2014, 18:01:22 UTC |
Sent | 1 Apr 2014, 18:01:55 UTC |
Report deadline | 2 Jul 2014, 1:29:06 UTC |
Received | 12 Jun 2014, 1:31:30 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 1265124 |
Run time | 33 days 9 hours 49 min 43 sec |
CPU time | 24 days 15 hours 15 min 54 sec |
Validate state | Invalid |
Credit | 9,020.16 |
Device peak FLOPS | 2.62 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86 |
Stderr | <core_client_version>7.2.42</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 11:33:05 (2812): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 11:33:06 (2812): No heartbeat from core client for 30 sec - exiting 11:33:07 (2812): No heartbeat from core client for 30 sec - exiting 11:33:09 (2812): No heartbeat from core client for 30 sec - exiting 11:33:10 (2812): No heartbeat from core client for 30 sec - exiting 11:33:11 (2812): No heartbeat from core client for 30 sec - exiting 11:33:12 (2812): No heartbeat from core client for 30 sec - exiting 11:33:13 (2812): No heartbeat from core client for 30 sec - exiting 11:33:14 (2812): No heartbeat from core client for 30 sec - exiting 11:33:15 (2812): No heartbeat from core client for 30 sec - exiting 11:33:16 (2812): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 02:50:03 (9232): No heartbeat from core client for 30 sec - exiting 02:50:04 (9232): No heartbeat from core client for 30 sec - exiting 02:50:05 (9232): No heartbeat from core client for 30 sec - exiting 02:50:06 (9232): No heartbeat from core client for 30 sec - exiting 02:50:07 (9232): No heartbeat from core client for 30 sec - exiting 02:50:08 (9232): No heartbeat from core client for 30 sec - exiting 02:50:09 (9232): No heartbeat from core client for 30 sec - exiting 02:50:10 (9232): No heartbeat from core client for 30 sec - exiting 02:50:11 (9232): No heartbeat from core client for 30 sec - exiting 02:50:12 (9232): No heartbeat from core client for 30 sec - exiting 02:50:13 (9232): No heartbeat from core client for 30 sec - exiting 02:50:14 (9232): No heartbeat from core client for 30 sec - exiting 02:50:15 (9232): No heartbeat from core client for 30 sec - exiting 02:50:16 (9232): No heartbeat from core client for 30 sec - exiting 02:50:17 (9232): No heartbeat from core client for 30 sec - exiting 02:50:18 (9232): No heartbeat from core client for 30 sec - exiting 02:50:20 (9232): No heartbeat from core client for 30 sec - exiting 02:50:21 (9232): No heartbeat from core client for 30 sec - exiting 02:50:22 (9232): No heartbeat from core client for 30 sec - exiting 02:50:23 (9232): No heartbeat from core client for 30 sec - exiting 02:50:24 (9232): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 17:31:36 (10652): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 06:05:25 (24180): No heartbeat from core client for 30 sec - exiting 06:05:26 (24180): No heartbeat from core client for 30 sec - exiting 06:05:27 (24180): No heartbeat from core client for 30 sec - exiting 06:05:28 (24180): No heartbeat from core client for 30 sec - exiting 06:05:29 (24180): No heartbeat from core client for 30 sec - exiting 06:05:30 (24180): No heartbeat from core client for 30 sec - exiting 06:05:31 (24180): No heartbeat from core client for 30 sec - exiting 06:05:32 (24180): No heartbeat from core client for 30 sec - exiting 06:05:33 (24180): No heartbeat from core client for 30 sec - exiting 06:05:34 (24180): No heartbeat from core client for 30 sec - exiting 06:05:36 (24180): No heartbeat from core client for 30 sec - exiting 06:05:37 (24180): No heartbeat from core client for 30 sec - exiting 06:05:38 (24180): No heartbeat from core client for 30 sec - exiting 06:05:39 (24180): No heartbeat from core client for 30 sec - exiting 06:05:40 (24180): No heartbeat from core client for 30 sec - exiting 06:05:41 (24180): No heartbeat from core client for 30 sec - exiting 06:05:42 (24180): No heartbeat from core client for 30 sec - exiting 06:05:43 (24180): No heartbeat from core client for 30 sec - exiting 06:05:44 (24180): No heartbeat from core client for 30 sec - exiting 06:05:45 (24180): No heartbeat from core client for 30 sec - exiting 06:05:46 (24180): No heartbeat from core client for 30 sec - exiting 06:05:47 (24180): No heartbeat from core client for 30 sec - exiting 06:05:48 (24180): No heartbeat from core client for 30 sec - exiting 06:05:49 (24180): No heartbeat from core client for 30 sec - exiting 06:05:50 (24180): No heartbeat from core client for 30 sec - exiting 06:05:51 (24180): No heartbeat from core client for 30 sec - exiting 06:05:52 (24180): No heartbeat from core client for 30 sec - exiting 06:05:53 (24180): No heartbeat from core client for 30 sec - exiting 06:05:54 (24180): No heartbeat from core client for 30 sec - exiting 06:05:55 (24180): No heartbeat from core client for 30 sec - exiting 06:05:56 (24180): No heartbeat from core client for 30 sec - exiting 06:05:57 (24180): No heartbeat from core client for 30 sec - exiting 06:05:58 (24180): No heartbeat from core client for 30 sec - exiting 06:05:59 (24180): No heartbeat from core client for 30 sec - exiting 06:06:00 (24180): No heartbeat from core client for 30 sec - exiting 06:06:01 (24180): No heartbeat from core client for 30 sec - exiting 06:06:02 (24180): No heartbeat from core client for 30 sec - exiting 06:06:03 (24180): No heartbeat from core client for 30 sec - exiting 06:06:04 (24180): No heartbeat from core client for 30 sec - exiting 06:06:05 (24180): No heartbeat from core client for 30 sec - exiting 06:06:06 (24180): No heartbeat from core client for 30 sec - exiting 06:06:07 (24180): No heartbeat from core client for 30 sec - exiting 06:06:08 (24180): No heartbeat from core client for 30 sec - exiting 06:06:09 (24180): No heartbeat from core client for 30 sec - exiting 06:06:10 (24180): No heartbeat from core client for 30 sec - exiting 06:06:11 (24180): No heartbeat from core client for 30 sec - exiting 06:06:12 (24180): No heartbeat from core client for 30 sec - exiting 06:06:13 (24180): No heartbeat from core client for 30 sec - exiting 06:06:14 (24180): No heartbeat from core client for 30 sec - exiting 06:06:15 (24180): No heartbeat from core client for 30 sec - exiting 06:06:16 (24180): No heartbeat from core client for 30 sec - exiting 06:06:17 (24180): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... BUFFIN: C I/O Error feof - Unit 63 - Return code = 16 BUFFIN: C I/O Error feof - Unit 64 - Return code = 16 BUFFIN: C I/O Error feof - Unit 65 - Return code = 16 BUFFIN: C I/O Error feof - Unit 66 - Return code = 16 BUFFIN: C I/O Error feof - Unit 67 - Return code = 16 BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 Error converting file to netcdf: dataout/7y33ko.pjj7c10 Error converting file to netcdf: dataout/7y33ko.pij7c10 Error converting file to netcdf: dataout/7y33ko.pfj7c10 Error converting file to netcdf: dataout/7y33ka.phj7c10 Error converting file to netcdf: dataout/7y33ka.pgj7c10 Error converting file to netcdf: dataout/7y33ka.pej7c10 Error converting file to netcdf: dataout/7y33ka.pdj7c10 14:01:34 (24876): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 14:01:35 (24876): No heartbeat from core client for 30 sec - exiting 20:28:01 (25540): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 20:28:02 (25540): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 17:15:05 (3828): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 17:15:07 (3828): No heartbeat from core client for 30 sec - exiting 17:15:08 (3828): No heartbeat from core client for 30 sec - exiting 17:15:09 (3828): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - Suspend request from BOINC... Signal 11 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3164, iMonCtr=1 Model crash detected, will try to restart... Signal 11 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3164, iMonCtr=1 Model crash detected, will try to restart... Signal 11 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3164, iMonCtr=1 Model crash detected, will try to restart... Signal 11 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3164, iMonCtr=1 Model crash detected, will try to restart... Signal 11 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3164, iMonCtr=1 Model crash detected, will try to restart... Signal 11 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3164, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
11 Jun 2014 01:16:15 | 1265124 | 16439409 | hadcm3n_7y33_1980_40_008455458_4 | 751,680 | 2,131,002 | 2.8350 |
10 Jun 2014 09:03:57 | 1265124 | 16439409 | hadcm3n_7y33_1980_40_008455458_4 | 725,760 | 2,057,593 | 2.8351 |
10 Jun 2014 09:02:18 | 1265124 | 16439409 | hadcm3n_7y33_1980_40_008455458_4 | 699,840 | 1,984,192 | 2.8352 |
10 Jun 2014 04:34:59 | 1265124 | 16439409 | hadcm3n_7y33_1980_40_008455458_4 | 673,920 | 1,910,268 | 2.8346 |
06 Jun 2014 11:33:11 | 1265124 | 16439409 | hadcm3n_7y33_1980_40_008455458_4 | 648,000 | 1,836,612 | 2.8343 |
05 Jun 2014 07:59:40 | 1265124 | 16439409 | hadcm3n_7y33_1980_40_008455458_4 | 622,080 | 1,762,726 | 2.8336 |
04 Jun 2014 04:35:11 | 1265124 | 16439409 | hadcm3n_7y33_1980_40_008455458_4 | 596,160 | 1,689,065 | 2.8332 |
03 Jun 2014 01:00:02 | 1265124 | 16439409 | hadcm3n_7y33_1980_40_008455458_4 | 570,240 | 1,615,318 | 2.8327 |
01 May 2014 08:17:24 | 1265124 | 16439409 | hadcm3n_7y33_1980_40_008455458_4 | 544,320 | 1,541,833 | 2.8326 |
30 Apr 2014 05:46:41 | 1265124 | 16439409 | hadcm3n_7y33_1980_40_008455458_4 | 518,400 | 1,468,318 | 2.8324 |
23 Apr 2014 16:53:34 | 1265124 | 16439409 | hadcm3n_7y33_1980_40_008455458_4 | 492,480 | 1,395,199 | 2.8330 |
22 Apr 2014 13:52:46 | 1265124 | 16439409 | hadcm3n_7y33_1980_40_008455458_4 | 466,560 | 1,321,129 | 2.8316 |
21 Apr 2014 13:47:25 | 1265124 | 16439409 | hadcm3n_7y33_1980_40_008455458_4 | 440,640 | 1,249,544 | 2.8357 |
21 Apr 2014 12:46:15 | 1265124 | 16439409 | hadcm3n_7y33_1980_40_008455458_4 | 414,720 | 1,176,116 | 2.8359 |
19 Apr 2014 07:59:42 | 1265124 | 16439409 | hadcm3n_7y33_1980_40_008455458_4 | 388,800 | 1,101,786 | 2.8338 |
18 Apr 2014 03:56:12 | 1265124 | 16439409 | hadcm3n_7y33_1980_40_008455458_4 | 362,880 | 1,027,598 | 2.8318 |
16 Apr 2014 23:57:42 | 1265124 | 16439409 | hadcm3n_7y33_1980_40_008455458_4 | 336,960 | 953,911 | 2.8309 |
15 Apr 2014 19:48:05 | 1265124 | 16439409 | hadcm3n_7y33_1980_40_008455458_4 | 311,040 | 880,078 | 2.8295 |
14 Apr 2014 16:43:02 | 1265124 | 16439409 | hadcm3n_7y33_1980_40_008455458_4 | 285,120 | 806,183 | 2.8275 |
13 Apr 2014 13:38:00 | 1265124 | 16439409 | hadcm3n_7y33_1980_40_008455458_4 | 259,200 | 732,214 | 2.8249 |
©2024 cpdn.org