Task 10963257

Name	hadsm3dhet2_jkyf_006590153_0
Workunit	6793526
Created	15 Mar 2010, 11:53:11 UTC
Sent	21 Oct 2010, 22:01:07 UTC
Report deadline	4 Oct 2011, 3:21:07 UTC
Received	7 Dec 2010, 14:10:47 UTC
Server state	Over
Outcome	Computation error
Client state	Compute error
Exit status	22 (0x00000016) Unknown error code
Computer ID	1099533
Run time	7 days 23 hours 22 min 24 sec
CPU time	5 days 13 hours 9 min 41 sec
Validate state	Invalid
Credit	2,183.35
Device peak FLOPS	2.33 GFLOPS
Application version	UK Met Office HadSM3 Slab Model v6.07 windows_intelx86
Stderr	<core_client_version>6.10.58</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> CPDN Monitor - Quit request from BOINC... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5524, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3904, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4936, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4116, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1580, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=944, iMonCtr=1 Model crash detected, will try to restart... CCPDN Monitor - Quit request from BOINC... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4888, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5696, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6864, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4720, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5592, iMonCtr=1 Model crash detected, will try to restart... No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CCPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5072, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5072, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5072, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5072, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5072, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5072, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( called boinc_finish </stderr_txt> ]]>

Latest Trickles Received
Time Sent (UTC)	Host ID	Result ID	Result Name	Timestep	CPU Time (sec)	Average (sec/TS)
07 Dec 2010 14:11:57	1099533	10963257	hadsm3dhet2_jkyf_006590153_0	237,644	477,786	2.0105
05 Dec 2010 11:41:45	1099533	10963257	hadsm3dhet2_jkyf_006590153_0	226,842	456,213	2.0111
03 Dec 2010 19:37:44	1099533	10963257	hadsm3dhet2_jkyf_006590153_0	216,040	434,311	2.0103
03 Dec 2010 00:58:51	1099533	10963257	hadsm3dhet2_jkyf_006590153_0	205,238	412,065	2.0077
01 Dec 2010 23:18:10	1099533	10963257	hadsm3dhet2_jkyf_006590153_0	194,436	390,072	2.0062
30 Nov 2010 18:52:42	1099533	10963257	hadsm3dhet2_jkyf_006590153_0	183,634	367,983	2.0039
26 Nov 2010 06:10:30	1099533	10963257	hadsm3dhet2_jkyf_006590153_0	172,832	346,634	2.0056
24 Nov 2010 09:15:34	1099533	10963257	hadsm3dhet2_jkyf_006590153_0	162,030	325,958	2.0117
23 Nov 2010 09:46:15	1099533	10963257	hadsm3dhet2_jkyf_006590153_0	151,228	303,582	2.0074
23 Nov 2010 00:58:32	1099533	10963257	hadsm3dhet2_jkyf_006590153_0	140,426	281,183	2.0024
22 Nov 2010 16:14:27	1099533	10963257	hadsm3dhet2_jkyf_006590153_0	129,624	259,568	2.0025
17 Nov 2010 15:34:30	1099533	10963257	hadsm3dhet2_jkyf_006590153_0	118,822	238,712	2.0090
15 Nov 2010 19:17:08	1099533	10963257	hadsm3dhet2_jkyf_006590153_0	108,020	218,775	2.0253
15 Nov 2010 09:56:08	1099533	10963257	hadsm3dhet2_jkyf_006590153_0	97,218	196,509	2.0213
14 Nov 2010 21:17:42	1099533	10963257	hadsm3dhet2_jkyf_006590153_0	86,416	172,781	1.9994
14 Nov 2010 10:44:12	1099533	10963257	hadsm3dhet2_jkyf_006590153_0	75,614	150,048	1.9844
14 Nov 2010 07:23:34	1099533	10963257	hadsm3dhet2_jkyf_006590153_0	64,812	127,478	1.9669
13 Nov 2010 17:27:18	1099533	10963257	hadsm3dhet2_jkyf_006590153_0	54,010	105,533	1.9540
03 Nov 2010 05:30:36	1099533	10963257	hadsm3dhet2_jkyf_006590153_0	43,208	83,871	1.9411
30 Oct 2010 18:35:11	1099533	10963257	hadsm3dhet2_jkyf_006590153_0	32,406	62,680	1.9342