Task 12744948

Name	hadcm3n_o51q_1900_40_007201873_0
Workunit	7400153
Created	28 Mar 2011, 14:12:20 UTC
Sent	30 Mar 2011, 1:24:25 UTC
Report deadline	29 Jun 2011, 8:51:36 UTC
Received	20 Apr 2011, 16:30:11 UTC
Server state	Over
Outcome	Computation error
Client state	Compute error
Exit status	22 (0x00000016) Unknown error code
Computer ID	1072839
Run time	15 days 19 hours 36 min 17 sec
CPU time	14 days 18 hours 41 min 26 sec
Validate state	Invalid
Credit	9,331.20
Device peak FLOPS	2.27 GFLOPS
Application version	UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86
Stderr	<core_client_version>6.10.18</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 09:55:17 (15040): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 01:00:05 (19484): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2372, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2372, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5128, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5128, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5128, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5128, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5128, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5128, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]>

Latest Trickles Received
Time Sent (UTC)	Host ID	Result ID	Result Name	Timestep	CPU Time (sec)	Average (sec/TS)
20 Apr 2011 18:17:45	1072839	12744948	hadcm3n_o51q_1900_40_007201873_0	777,600	1,274,840	1.6395
20 Apr 2011 18:17:45	1072839	12744948	hadcm3n_o51q_1900_40_007201873_0	751,680	1,234,603	1.6425
20 Apr 2011 18:17:45	1072839	12744948	hadcm3n_o51q_1900_40_007201873_0	725,760	1,195,390	1.6471
20 Apr 2011 18:17:45	1072839	12744948	hadcm3n_o51q_1900_40_007201873_0	699,840	1,156,943	1.6532
20 Apr 2011 18:17:45	1072839	12744948	hadcm3n_o51q_1900_40_007201873_0	673,920	1,118,135	1.6592
13 Apr 2011 03:03:37	1072839	12744948	hadcm3n_o51q_1900_40_007201873_0	648,000	1,079,995	1.6667
12 Apr 2011 13:49:37	1072839	12744948	hadcm3n_o51q_1900_40_007201873_0	622,080	1,039,676	1.6713
12 Apr 2011 09:07:47	1072839	12744948	hadcm3n_o51q_1900_40_007201873_0	596,160	999,468	1.6765
12 Apr 2011 09:06:51	1072839	12744948	hadcm3n_o51q_1900_40_007201873_0	570,240	961,637	1.6864
12 Apr 2011 09:06:14	1072839	12744948	hadcm3n_o51q_1900_40_007201873_0	544,320	919,319	1.6889
12 Apr 2011 09:05:40	1072839	12744948	hadcm3n_o51q_1900_40_007201873_0	518,400	875,841	1.6895
12 Apr 2011 09:05:11	1072839	12744948	hadcm3n_o51q_1900_40_007201873_0	492,480	833,171	1.6918
12 Apr 2011 09:04:43	1072839	12744948	hadcm3n_o51q_1900_40_007201873_0	466,560	790,444	1.6942
12 Apr 2011 09:04:20	1072839	12744948	hadcm3n_o51q_1900_40_007201873_0	440,640	747,499	1.6964
12 Apr 2011 09:03:57	1072839	12744948	hadcm3n_o51q_1900_40_007201873_0	414,720	704,708	1.6992
12 Apr 2011 09:03:24	1072839	12744948	hadcm3n_o51q_1900_40_007201873_0	388,800	661,150	1.7005
12 Apr 2011 09:02:24	1072839	12744948	hadcm3n_o51q_1900_40_007201873_0	362,880	618,019	1.7031
12 Apr 2011 09:02:04	1072839	12744948	hadcm3n_o51q_1900_40_007201873_0	336,960	573,935	1.7033
12 Apr 2011 09:01:42	1072839	12744948	hadcm3n_o51q_1900_40_007201873_0	311,040	530,331	1.7050
12 Apr 2011 08:58:48	1072839	12744948	hadcm3n_o51q_1900_40_007201873_0	285,120	487,333	1.7092