Task 12366134

Name	famous_v3wf_1199_200_006690821_6
Workunit	6894074
Created	6 Dec 2010, 22:12:06 UTC
Sent	6 Dec 2010, 22:15:19 UTC
Report deadline	8 Mar 2011, 5:42:30 UTC
Received	14 Feb 2011, 12:44:14 UTC
Server state	Over
Outcome	Computation error
Client state	Compute error
Exit status	22 (0x00000016) Unknown error code
Computer ID	1117493
Run time	14 days 0 hours 6 min 53 sec
CPU time	12 days 19 hours 1 min 5 sec
Validate state	Invalid
Credit	5,991.12
Device peak FLOPS	2.32 GFLOPS
Application version	UK Met Office FAMOUS v6.11 windows_intelx86
Stderr	<core_client_version>6.10.58</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 60 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 61 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1244, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 60 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 61 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3316, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3316, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3316, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3316, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3316, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3316, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( 21:45:33 (3316): called boinc_finish 17:14:34 (4968): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:12:58 (2544): Can't acquire lockfile (32) - waiting 35s 10:13:00 (4384): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 09:37:59 (6848): Can't acquire lockfile (32) - waiting 35s 09:38:07 (1696): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 60 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 61 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4684, iMonCtr=1 Model crash detected, will try to restart... Controller:Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... 09:35:37 (4984): Can't acquire lockfile (32) - waiting 35s 09:35:54 (5772): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 60 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 61 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 09:19:57 (5436): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7584, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 10:05:48 (7828): Can't acquire lockfile (32) - waiting 35s 10:06:07 (4372): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 12:46:04 (3580): Can't acquire lockfile (32) - waiting 35s 12:46:31 (1116): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=556, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Model crashed: ATM_DYN : INVALID THETA DETECTED. tmp/pipe_dummy Model crashed: ATM_DYN : INVALID THETA DETECTED. tmp/pipe_dummy Model crashed: ATM_DYN : INVALID THETA DETECTED. tmp/pipe_dummy Model crashed: ATM_DYN : INVALID THETA DETECTED. tmp/pipe_dummy Model crashed: ATM_DYN : INVALID THETA DETECTED. tmp/pipe_dummy Model crashed: ATM_DYN : INVALID THETA DETECTED. tmp/pipe_dummy Sorry, too many model crashes! :-( 12:43:12 (4112): called boinc_finish </stderr_txt> ]]>

Latest Trickles Received
Time Sent (UTC)	Host ID	Result ID	Result Name	Timestep	CPU Time (sec)	Average (sec/TS)
14 Feb 2011 11:50:06	1117493	12366134	famous_v3wf_1199_200_006690821_6	1,815,866	1,102,094	0.6069
14 Feb 2011 10:40:14	1117493	12366134	famous_v3wf_1199_200_006690821_6	1,806,506	1,097,932	0.6078
14 Feb 2011 09:24:47	1117493	12366134	famous_v3wf_1199_200_006690821_6	1,797,146	1,093,785	0.6086
13 Feb 2011 22:19:41	1117493	12366134	famous_v3wf_1199_200_006690821_6	1,787,786	1,088,789	0.6090
13 Feb 2011 21:02:42	1117493	12366134	famous_v3wf_1199_200_006690821_6	1,778,426	1,084,619	0.6099
13 Feb 2011 19:35:39	1117493	12366134	famous_v3wf_1199_200_006690821_6	1,769,066	1,080,299	0.6107
13 Feb 2011 18:31:24	1117493	12366134	famous_v3wf_1199_200_006690821_6	1,759,706	1,076,196	0.6116
13 Feb 2011 00:15:21	1117493	12366134	famous_v3wf_1199_200_006690821_6	1,750,346	1,071,826	0.6124
12 Feb 2011 22:32:55	1117493	12366134	famous_v3wf_1199_200_006690821_6	1,740,986	1,066,781	0.6127
12 Feb 2011 21:09:10	1117493	12366134	famous_v3wf_1199_200_006690821_6	1,731,626	1,062,602	0.6136
12 Feb 2011 12:23:39	1117493	12366134	famous_v3wf_1199_200_006690821_6	1,722,266	1,057,514	0.6140
11 Feb 2011 15:19:41	1117493	12366134	famous_v3wf_1199_200_006690821_6	1,712,906	1,053,398	0.6150
11 Feb 2011 14:01:53	1117493	12366134	famous_v3wf_1199_200_006690821_6	1,703,546	1,049,367	0.6160
11 Feb 2011 13:19:31	1117493	12366134	famous_v3wf_1199_200_006690821_6	1,694,186	1,045,338	0.6170
11 Feb 2011 13:19:31	1117493	12366134	famous_v3wf_1199_200_006690821_6	1,684,826	1,041,277	0.6180
11 Feb 2011 13:19:31	1117493	12366134	famous_v3wf_1199_200_006690821_6	1,675,466	1,037,257	0.6191
11 Feb 2011 13:19:31	1117493	12366134	famous_v3wf_1199_200_006690821_6	1,666,106	1,033,272	0.6202
10 Feb 2011 16:16:35	1117493	12366134	famous_v3wf_1199_200_006690821_6	1,656,746	1,029,050	0.6211
10 Feb 2011 14:57:06	1117493	12366134	famous_v3wf_1199_200_006690821_6	1,647,386	1,025,067	0.6222
10 Feb 2011 13:35:12	1117493	12366134	famous_v3wf_1199_200_006690821_6	1,638,026	1,021,086	0.6234