Task 16050184

Name	hadcm3n_odc1_1900_40_008472404_1
Workunit	8623243
Created	30 Sep 2013, 2:22:03 UTC
Sent	30 Sep 2013, 2:41:18 UTC
Report deadline	30 Dec 2013, 10:08:29 UTC
Received	4 Oct 2013, 16:37:38 UTC
Server state	Over
Outcome	Computation error
Client state	Compute error
Exit status	22 (0x00000016) Unknown error code
Computer ID	1295375
Run time	3 days 18 hours 19 min 46 sec
CPU time	3 days 9 hours 43 min 19 sec
Validate state	Invalid
Credit	5,287.68
Device peak FLOPS	3.70 GFLOPS
Application version	UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86
Stderr	<core_client_version>7.0.64</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> 21:58:41 (4596): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 22:05:56 (4592): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4000, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4000, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4000, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4000, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4000, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4000, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]>

Latest Trickles Received
Time Sent (UTC)	Host ID	Result ID	Result Name	Timestep	CPU Time (sec)	Average (sec/TS)
04 Oct 2013 14:06:32	1295375	16050184	hadcm3n_odc1_1900_40_008472404_1	440,640	290,883	0.6601
04 Oct 2013 09:16:46	1295375	16050184	hadcm3n_odc1_1900_40_008472404_1	414,720	273,944	0.6606
04 Oct 2013 03:03:17	1295375	16050184	hadcm3n_odc1_1900_40_008472404_1	388,800	256,237	0.6590
03 Oct 2013 17:32:29	1295375	16050184	hadcm3n_odc1_1900_40_008472404_1	362,880	238,883	0.6583
03 Oct 2013 12:43:14	1295375	16050184	hadcm3n_odc1_1900_40_008472404_1	336,960	221,613	0.6577
03 Oct 2013 05:24:45	1295375	16050184	hadcm3n_odc1_1900_40_008472404_1	311,040	204,230	0.6566
03 Oct 2013 00:42:28	1295375	16050184	hadcm3n_odc1_1900_40_008472404_1	285,120	187,426	0.6574
02 Oct 2013 15:50:48	1295375	16050184	hadcm3n_odc1_1900_40_008472404_1	259,200	170,490	0.6578
02 Oct 2013 08:24:14	1295375	16050184	hadcm3n_odc1_1900_40_008472404_1	233,280	153,611	0.6585
02 Oct 2013 02:16:04	1295375	16050184	hadcm3n_odc1_1900_40_008472404_1	207,360	136,741	0.6594
01 Oct 2013 21:08:31	1295375	16050184	hadcm3n_odc1_1900_40_008472404_1	181,440	119,959	0.6611
01 Oct 2013 12:27:36	1295375	16050184	hadcm3n_odc1_1900_40_008472404_1	155,520	102,503	0.6591
01 Oct 2013 07:40:13	1295375	16050184	hadcm3n_odc1_1900_40_008472404_1	129,600	85,406	0.6590
30 Sep 2013 23:16:26	1295375	16050184	hadcm3n_odc1_1900_40_008472404_1	103,680	68,490	0.6606
30 Sep 2013 18:08:22	1295375	16050184	hadcm3n_odc1_1900_40_008472404_1	77,760	51,341	0.6602
30 Sep 2013 13:02:55	1295375	16050184	hadcm3n_odc1_1900_40_008472404_1	51,840	34,049	0.6568
30 Sep 2013 08:25:09	1295375	16050184	hadcm3n_odc1_1900_40_008472404_1	25,920	17,112	0.6602