Task 16588253

Name	hadcm3n_8at1_1980_40_008723424_0
Workunit	8869402
Created	23 Apr 2014, 13:00:16 UTC
Sent	3 May 2014, 20:12:00 UTC
Report deadline	3 Aug 2014, 3:39:11 UTC
Received	10 Jun 2014, 23:25:44 UTC
Server state	Over
Outcome	Computation error
Client state	Compute error
Exit status	22 (0x00000016) Unknown error code
Computer ID	1045292
Run time	22 days 0 hours 7 min 55 sec
CPU time	21 days 12 hours 27 min 42 sec
Validate state	Invalid
Credit	9,953.28
Device peak FLOPS	2.49 GFLOPS
Application version	UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86
Stderr	<core_client_version>7.2.42</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4572, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4284, iMonCtr=1 Model crash detected, will try to restart... 20:14:44 (4928): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4996, iMonCtr=1 Model crash detected, will try to restart... 09:59:27 (4892): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1460, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1460, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1460, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1460, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1460, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1460, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]>

Latest Trickles Received
Time Sent (UTC)	Host ID	Result ID	Result Name	Timestep	CPU Time (sec)	Average (sec/TS)
10 Jun 2014 09:04:23	1045292	16588253	hadcm3n_8at1_1980_40_008723424_0	829,440	1,813,422	2.1863
10 Jun 2014 09:03:33	1045292	16588253	hadcm3n_8at1_1980_40_008723424_0	803,520	1,761,215	2.1919
10 Jun 2014 09:02:45	1045292	16588253	hadcm3n_8at1_1980_40_008723424_0	777,600	1,714,784	2.2052
10 Jun 2014 09:01:40	1045292	16588253	hadcm3n_8at1_1980_40_008723424_0	751,680	1,655,849	2.2029
10 Jun 2014 07:09:24	1045292	16588253	hadcm3n_8at1_1980_40_008723424_0	725,760	1,596,880	2.2003
08 Jun 2014 06:36:18	1045292	16588253	hadcm3n_8at1_1980_40_008723424_0	699,840	1,538,054	2.1977
05 Jun 2014 03:49:03	1045292	16588253	hadcm3n_8at1_1980_40_008723424_0	673,920	1,485,595	2.2044
04 Jun 2014 09:41:13	1045292	16588253	hadcm3n_8at1_1980_40_008723424_0	648,000	1,423,798	2.1972
03 Jun 2014 16:18:02	1045292	16588253	hadcm3n_8at1_1980_40_008723424_0	622,080	1,361,919	2.1893
03 Jun 2014 00:19:54	1045292	16588253	hadcm3n_8at1_1980_40_008723424_0	596,160	1,305,699	2.1902
02 Jun 2014 11:42:48	1045292	16588253	hadcm3n_8at1_1980_40_008723424_0	570,240	1,260,492	2.2105
01 Jun 2014 21:04:33	1045292	16588253	hadcm3n_8at1_1980_40_008723424_0	544,320	1,208,071	2.2194
01 Jun 2014 03:41:52	1045292	16588253	hadcm3n_8at1_1980_40_008723424_0	518,400	1,147,693	2.2139
31 May 2014 10:36:50	1045292	16588253	hadcm3n_8at1_1980_40_008723424_0	492,480	1,086,887	2.2070
30 May 2014 17:37:26	1045292	16588253	hadcm3n_8at1_1980_40_008723424_0	466,560	1,025,785	2.1986
30 May 2014 00:20:14	1045292	16588253	hadcm3n_8at1_1980_40_008723424_0	440,640	964,711	2.1893
29 May 2014 06:15:50	1045292	16588253	hadcm3n_8at1_1980_40_008723424_0	414,720	904,050	2.1799
28 May 2014 12:35:01	1045292	16588253	hadcm3n_8at1_1980_40_008723424_0	388,800	843,035	2.1683
27 May 2014 19:28:46	1045292	16588253	hadcm3n_8at1_1980_40_008723424_0	362,880	788,218	2.1721
27 May 2014 03:30:05	1045292	16588253	hadcm3n_8at1_1980_40_008723424_0	336,960	739,470	2.1945