Task 16028268

Name	hadcm3n_o0bn_1980_40_008389014_4
Workunit	8539873
Created	20 Sep 2013, 11:52:32 UTC
Sent	20 Sep 2013, 12:01:19 UTC
Report deadline	20 Dec 2013, 19:28:30 UTC
Received	24 Oct 2013, 22:01:37 UTC
Server state	Over
Outcome	Computation error
Client state	Compute error
Exit status	22 (0x00000016) Unknown error code
Computer ID	1393537
Run time	30 days 6 hours 40 min 28 sec
CPU time	30 days 1 hours 11 min 2 sec
Validate state	Invalid
Credit	10,886.40
Device peak FLOPS	1.84 GFLOPS
Application version	UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86
Stderr	<core_client_version>7.0.64</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4420, iMonCtr=1 Model craSuspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2132, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5992, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5824, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3880, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=476, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3388, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4324, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4324, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4324, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4324, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4324, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4324, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]>

Latest Trickles Received
Time Sent (UTC)	Host ID	Result ID	Result Name	Timestep	CPU Time (sec)	Average (sec/TS)
24 Oct 2013 14:10:44	1111761	16028268	hadcm3n_o0bn_1980_40_008389014_4	907,200	2,567,463	2.8301
23 Oct 2013 11:37:53	1111761	16028268	hadcm3n_o0bn_1980_40_008389014_4	881,280	2,497,929	2.8344
22 Oct 2013 03:51:19	1111761	16028268	hadcm3n_o0bn_1980_40_008389014_4	855,360	2,428,312	2.8389
21 Oct 2013 06:17:51	1111761	16028268	hadcm3n_o0bn_1980_40_008389014_4	829,440	2,358,929	2.8440
20 Oct 2013 10:05:01	1111761	16028268	hadcm3n_o0bn_1980_40_008389014_4	803,520	2,286,028	2.8450
18 Oct 2013 17:14:05	1111761	16028268	hadcm3n_o0bn_1980_40_008389014_4	777,600	2,215,200	2.8488
17 Oct 2013 21:15:24	1111761	16028268	hadcm3n_o0bn_1980_40_008389014_4	751,680	2,143,828	2.8520
16 Oct 2013 19:16:46	1111761	16028268	hadcm3n_o0bn_1980_40_008389014_4	725,760	2,072,503	2.8556
15 Oct 2013 23:14:44	1111761	16028268	hadcm3n_o0bn_1980_40_008389014_4	699,840	2,001,341	2.8597
15 Oct 2013 03:02:49	1111761	16028268	hadcm3n_o0bn_1980_40_008389014_4	673,920	1,930,108	2.8640
14 Oct 2013 07:16:17	1111761	16028268	hadcm3n_o0bn_1980_40_008389014_4	648,000	1,858,756	2.8685
13 Oct 2013 10:19:47	1111761	16028268	hadcm3n_o0bn_1980_40_008389014_4	622,080	1,787,479	2.8734
12 Oct 2013 14:08:32	1111761	16028268	hadcm3n_o0bn_1980_40_008389014_4	596,160	1,715,747	2.8780
11 Oct 2013 17:05:57	1111761	16028268	hadcm3n_o0bn_1980_40_008389014_4	570,240	1,644,089	2.8832
10 Oct 2013 21:04:09	1111761	16028268	hadcm3n_o0bn_1980_40_008389014_4	544,320	1,572,448	2.8888
09 Oct 2013 23:46:07	1111761	16028268	hadcm3n_o0bn_1980_40_008389014_4	518,400	1,499,722	2.8930
08 Oct 2013 21:07:07	1111761	16028268	hadcm3n_o0bn_1980_40_008389014_4	492,480	1,428,140	2.8999
08 Oct 2013 00:49:05	1111761	16028268	hadcm3n_o0bn_1980_40_008389014_4	466,560	1,355,853	2.9061
07 Oct 2013 04:26:56	1111761	16028268	hadcm3n_o0bn_1980_40_008389014_4	440,640	1,283,391	2.9126
06 Oct 2013 08:09:34	1111761	16028268	hadcm3n_o0bn_1980_40_008389014_4	414,720	1,210,769	2.9195