Task 15134881

Name	hadcm3n_o274_2060_40_008154506_0
Workunit	8309630
Created	17 Aug 2012, 13:27:13 UTC
Sent	17 Aug 2012, 13:34:28 UTC
Report deadline	16 Nov 2012, 21:01:39 UTC
Received	29 Sep 2012, 15:17:16 UTC
Server state	Over
Outcome	Computation error
Client state	Compute error
Exit status	22 (0x00000016) Unknown error code
Computer ID	1089304
Run time	7 days 0 hours 30 min 21 sec
CPU time	7 days 0 hours 12 min 53 sec
Validate state	Invalid
Credit	6,220.80
Device peak FLOPS	3.40 GFLOPS
Application version	UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86
Stderr	<core_client_version>7.0.25</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... 23:13:22 (4980): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 09:12:22 (3972): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=228, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=228, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=228, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3160, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3160, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3160, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]>

Latest Trickles Received
Time Sent (UTC)	Host ID	Result ID	Result Name	Timestep	CPU Time (sec)	Average (sec/TS)
28 Sep 2012 19:10:43	1089304	15134881	hadcm3n_o274_2060_40_008154506_0	518,400	601,191	1.1597
28 Sep 2012 10:46:45	1089304	15134881	hadcm3n_o274_2060_40_008154506_0	492,480	571,097	1.1596
28 Sep 2012 01:54:43	1089304	15134881	hadcm3n_o274_2060_40_008154506_0	466,560	539,170	1.1556
27 Sep 2012 17:03:40	1089304	15134881	hadcm3n_o274_2060_40_008154506_0	440,640	507,295	1.1513
27 Sep 2012 08:44:07	1089304	15134881	hadcm3n_o274_2060_40_008154506_0	414,720	477,406	1.1512
27 Sep 2012 00:27:03	1089304	15134881	hadcm3n_o274_2060_40_008154506_0	388,800	447,766	1.1517
26 Sep 2012 16:11:05	1089304	15134881	hadcm3n_o274_2060_40_008154506_0	362,880	418,173	1.1524
26 Sep 2012 08:02:28	1089304	15134881	hadcm3n_o274_2060_40_008154506_0	336,960	388,725	1.1536
25 Sep 2012 23:45:35	1089304	15134881	hadcm3n_o274_2060_40_008154506_0	311,040	359,067	1.1544
25 Sep 2012 15:29:24	1089304	15134881	hadcm3n_o274_2060_40_008154506_0	285,120	329,444	1.1555
25 Sep 2012 07:07:36	1089304	15134881	hadcm3n_o274_2060_40_008154506_0	259,200	299,212	1.1544
24 Sep 2012 22:15:27	1089304	15134881	hadcm3n_o274_2060_40_008154506_0	233,280	267,541	1.1469
24 Sep 2012 13:16:00	1089304	15134881	hadcm3n_o274_2060_40_008154506_0	207,360	235,395	1.1352
24 Sep 2012 04:23:55	1089304	15134881	hadcm3n_o274_2060_40_008154506_0	181,440	203,389	1.1210
23 Sep 2012 19:29:49	1089304	15134881	hadcm3n_o274_2060_40_008154506_0	155,520	171,571	1.1032
23 Sep 2012 10:56:42	1089304	15134881	hadcm3n_o274_2060_40_008154506_0	129,600	140,744	1.0860
23 Sep 2012 01:49:37	1089304	15134881	hadcm3n_o274_2060_40_008154506_0	103,680	111,860	1.0789
22 Sep 2012 18:13:41	1089304	15134881	hadcm3n_o274_2060_40_008154506_0	77,760	84,474	1.0863
22 Sep 2012 10:33:17	1089304	15134881	hadcm3n_o274_2060_40_008154506_0	51,840	56,851	1.0967
22 Sep 2012 02:51:41	1089304	15134881	hadcm3n_o274_2060_40_008154506_0	25,920	29,167	1.1253