Task 16001524

Name	hadcm3n_z9h2_1960_40_008389461_2
Workunit	8540320
Created	3 Sep 2013, 8:50:51 UTC
Sent	3 Sep 2013, 9:02:43 UTC
Report deadline	3 Dec 2013, 16:29:54 UTC
Received	11 Sep 2013, 11:37:20 UTC
Server state	Over
Outcome	Computation error
Client state	Compute error
Exit status	22 (0x00000016) Unknown error code
Computer ID	1193059
Run time	6 days 15 hours 0 min 17 sec
CPU time	6 days 14 hours 50 min 18 sec
Validate state	Invalid
Credit	4,665.60
Device peak FLOPS	3.07 GFLOPS
Application version	UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86
Stderr	<core_client_version>7.0.64</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 16:42:26 (3084): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2256, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2256, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2256, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2256, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish 18:04:24 (2256): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4756, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4756, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]>

Latest Trickles Received
Time Sent (UTC)	Host ID	Result ID	Result Name	Timestep	CPU Time (sec)	Average (sec/TS)
11 Sep 2013 04:19:59	1193059	16001524	hadcm3n_z9h2_1960_40_008389461_2	388,800	558,234	1.4358
10 Sep 2013 18:01:57	1193059	16001524	hadcm3n_z9h2_1960_40_008389461_2	362,880	521,304	1.4366
10 Sep 2013 07:54:06	1193059	16001524	hadcm3n_z9h2_1960_40_008389461_2	336,960	484,903	1.4391
09 Sep 2013 20:59:24	1193059	16001524	hadcm3n_z9h2_1960_40_008389461_2	311,040	445,532	1.4324
09 Sep 2013 09:33:26	1193059	16001524	hadcm3n_z9h2_1960_40_008389461_2	285,120	408,575	1.4330
08 Sep 2013 19:49:59	1193059	16001524	hadcm3n_z9h2_1960_40_008389461_2	259,200	372,963	1.4389
08 Sep 2013 08:06:54	1193059	16001524	hadcm3n_z9h2_1960_40_008389461_2	233,280	338,145	1.4495
07 Sep 2013 21:33:00	1193059	16001524	hadcm3n_z9h2_1960_40_008389461_2	207,360	300,973	1.4515
07 Sep 2013 10:14:24	1193059	16001524	hadcm3n_z9h2_1960_40_008389461_2	181,440	263,908	1.4545
06 Sep 2013 23:36:49	1193059	16001524	hadcm3n_z9h2_1960_40_008389461_2	155,520	226,013	1.4533
06 Sep 2013 12:58:26	1193059	16001524	hadcm3n_z9h2_1960_40_008389461_2	129,600	187,627	1.4477
06 Sep 2013 02:19:15	1193059	16001524	hadcm3n_z9h2_1960_40_008389461_2	103,680	149,285	1.4399
05 Sep 2013 15:56:00	1193059	16001524	hadcm3n_z9h2_1960_40_008389461_2	77,760	112,126	1.4419
05 Sep 2013 05:52:23	1193059	16001524	hadcm3n_z9h2_1960_40_008389461_2	51,840	75,930	1.4647
04 Sep 2013 19:24:55	1193059	16001524	hadcm3n_z9h2_1960_40_008389461_2	25,920	38,285	1.4770