Task 15830702

Name	hadcm3n_u30l_2020_40_008337957_2
Workunit	8488818
Created	5 Jun 2013, 8:17:19 UTC
Sent	5 Jun 2013, 8:37:52 UTC
Report deadline	4 Sep 2013, 16:05:03 UTC
Received	3 Jul 2013, 11:13:06 UTC
Server state	Over
Outcome	Computation error
Client state	Compute error
Exit status	22 (0x00000016) Unknown error code
Computer ID	1376550
Run time	27 days 17 hours 18 min 14 sec
CPU time	27 days 1 hours 44 min 23 sec
Validate state	Invalid
Credit	9,331.20
Device peak FLOPS	1.76 GFLOPS
Application version	UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86
Stderr	<core_client_version>7.0.64</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5972, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4380, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... 01:18:35 (5436): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7648, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7648, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7648, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7648, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7648, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7648, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]>

Latest Trickles Received
Time Sent (UTC)	Host ID	Result ID	Result Name	Timestep	CPU Time (sec)	Average (sec/TS)
02 Jul 2013 18:59:14	1274848	15830702	hadcm3n_u30l_2020_40_008337957_2	777,600	2,297,878	2.9551
02 Jul 2013 11:55:35	1274848	15830702	hadcm3n_u30l_2020_40_008337957_2	751,680	2,221,188	2.9550
02 Jul 2013 11:04:11	1274848	15830702	hadcm3n_u30l_2020_40_008337957_2	725,760	2,144,878	2.9554
02 Jul 2013 10:26:07	1274848	15830702	hadcm3n_u30l_2020_40_008337957_2	699,840	2,067,948	2.9549
02 Jul 2013 09:58:19	1274848	15830702	hadcm3n_u30l_2020_40_008337957_2	673,920	1,991,511	2.9551
28 Jun 2013 08:08:09	1274848	15830702	hadcm3n_u30l_2020_40_008337957_2	648,000	1,915,457	2.9560
27 Jun 2013 10:44:53	1274848	15830702	hadcm3n_u30l_2020_40_008337957_2	622,080	1,839,583	2.9571
26 Jun 2013 13:11:38	1274848	15830702	hadcm3n_u30l_2020_40_008337957_2	596,160	1,763,757	2.9585
25 Jun 2013 16:26:05	1274848	15830702	hadcm3n_u30l_2020_40_008337957_2	570,240	1,687,812	2.9598
24 Jun 2013 18:40:04	1274848	15830702	hadcm3n_u30l_2020_40_008337957_2	544,320	1,611,676	2.9609
23 Jun 2013 21:02:51	1274848	15830702	hadcm3n_u30l_2020_40_008337957_2	518,400	1,535,850	2.9627
22 Jun 2013 22:32:14	1274848	15830702	hadcm3n_u30l_2020_40_008337957_2	492,480	1,458,782	2.9621
22 Jun 2013 01:21:08	1274848	15830702	hadcm3n_u30l_2020_40_008337957_2	466,560	1,381,022	2.9600
21 Jun 2013 02:50:44	1274848	15830702	hadcm3n_u30l_2020_40_008337957_2	440,640	1,303,650	2.9585
20 Jun 2013 05:31:34	1274848	15830702	hadcm3n_u30l_2020_40_008337957_2	414,720	1,227,431	2.9597
19 Jun 2013 08:06:03	1274848	15830702	hadcm3n_u30l_2020_40_008337957_2	388,800	1,151,059	2.9605
18 Jun 2013 10:19:52	1274848	15830702	hadcm3n_u30l_2020_40_008337957_2	362,880	1,073,817	2.9592
17 Jun 2013 11:47:10	1274848	15830702	hadcm3n_u30l_2020_40_008337957_2	336,960	994,895	2.9526
16 Jun 2013 07:20:08	1274848	15830702	hadcm3n_u30l_2020_40_008337957_2	311,040	917,491	2.9498
15 Jun 2013 09:34:59	1274848	15830702	hadcm3n_u30l_2020_40_008337957_2	285,120	841,082	2.9499