Task 13954492

Name	hadcm3n_yhhp_1980_40_007693211_1
Workunit	7848319
Created	23 Jan 2012, 13:28:49 UTC
Sent	23 Jan 2012, 13:28:55 UTC
Report deadline	23 Apr 2012, 20:56:06 UTC
Received	19 Feb 2012, 0:24:38 UTC
Server state	Over
Outcome	Computation error
Client state	Compute error
Exit status	22 (0x00000016) Unknown error code
Computer ID	964561
Run time	25 days 12 hours 36 min 18 sec
CPU time	20 days 21 hours 14 min 45 sec
Validate state	Invalid
Credit	11,197.44
Device peak FLOPS	2.37 GFLOPS
Application version	UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86
Stderr	<core_client_version>6.10.60</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 16:14:05 (3472): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4840, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4840, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4840, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4592, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4592, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4592, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]>

Latest Trickles Received
Time Sent (UTC)	Host ID	Result ID	Result Name	Timestep	CPU Time (sec)	Average (sec/TS)
18 Feb 2012 05:42:35	964561	13954492	hadcm3n_yhhp_1980_40_007693211_1	933,120	1,772,825	1.8999
17 Feb 2012 13:56:25	964561	13954492	hadcm3n_yhhp_1980_40_007693211_1	907,200	1,722,696	1.8989
16 Feb 2012 22:53:29	964561	13954492	hadcm3n_yhhp_1980_40_007693211_1	881,280	1,671,359	1.8965
16 Feb 2012 07:06:32	964561	13954492	hadcm3n_yhhp_1980_40_007693211_1	855,360	1,620,106	1.8941
15 Feb 2012 16:02:05	964561	13954492	hadcm3n_yhhp_1980_40_007693211_1	829,440	1,568,478	1.8910
15 Feb 2012 00:30:18	964561	13954492	hadcm3n_yhhp_1980_40_007693211_1	803,520	1,520,298	1.8920
14 Feb 2012 09:15:56	964561	13954492	hadcm3n_yhhp_1980_40_007693211_1	777,600	1,472,264	1.8933
13 Feb 2012 17:28:42	964561	13954492	hadcm3n_yhhp_1980_40_007693211_1	751,680	1,424,050	1.8945
13 Feb 2012 02:07:22	964561	13954492	hadcm3n_yhhp_1980_40_007693211_1	725,760	1,375,899	1.8958
12 Feb 2012 10:25:27	964561	13954492	hadcm3n_yhhp_1980_40_007693211_1	699,840	1,326,944	1.8961
11 Feb 2012 19:12:49	964561	13954492	hadcm3n_yhhp_1980_40_007693211_1	673,920	1,278,633	1.8973
11 Feb 2012 00:40:16	964561	13954492	hadcm3n_yhhp_1980_40_007693211_1	648,000	1,229,570	1.8975
10 Feb 2012 07:57:17	964561	13954492	hadcm3n_yhhp_1980_40_007693211_1	622,080	1,180,065	1.8970
09 Feb 2012 13:25:49	964561	13954492	hadcm3n_yhhp_1980_40_007693211_1	596,160	1,130,720	1.8967
08 Feb 2012 18:42:04	964561	13954492	hadcm3n_yhhp_1980_40_007693211_1	570,240	1,080,487	1.8948
07 Feb 2012 18:07:18	964561	13954492	hadcm3n_yhhp_1980_40_007693211_1	544,320	1,035,120	1.9017
06 Feb 2012 19:31:33	964561	13954492	hadcm3n_yhhp_1980_40_007693211_1	518,400	988,353	1.9065
06 Feb 2012 02:48:10	964561	13954492	hadcm3n_yhhp_1980_40_007693211_1	492,480	938,825	1.9063
05 Feb 2012 08:44:32	964561	13954492	hadcm3n_yhhp_1980_40_007693211_1	466,560	889,342	1.9062
04 Feb 2012 15:58:28	964561	13954492	hadcm3n_yhhp_1980_40_007693211_1	440,640	838,451	1.9028