Task 12858852

Name	hadcm3n_p6mj_1900_40_007226171_2
Workunit	7424411
Created	4 May 2011, 10:38:03 UTC
Sent	4 May 2011, 10:55:18 UTC
Report deadline	3 Aug 2011, 18:22:29 UTC
Received	18 Sep 2011, 21:54:10 UTC
Server state	Over
Outcome	Computation error
Client state	Compute error
Exit status	22 (0x00000016) Unknown error code
Computer ID	1089738
Run time	13 days 16 hours 27 min 41 sec
CPU time	13 days 11 hours 49 min 45 sec
Validate state	Invalid
Credit	7,153.92
Device peak FLOPS	2.69 GFLOPS
Application version	UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86
Stderr	<core_client_version>6.10.58</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3856, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4088, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3912, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3888, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3628, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3872, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3908, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3872, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3872, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4152, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3920, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3920, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3920, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]>

Latest Trickles Received
Time Sent (UTC)	Host ID	Result ID	Result Name	Timestep	CPU Time (sec)	Average (sec/TS)
07 Sep 2011 18:09:50	1089738	12858852	hadcm3n_p6mj_1900_40_007226171_2	596,160	1,164,855	1.9539
07 Sep 2011 03:29:45	1089738	12858852	hadcm3n_p6mj_1900_40_007226171_2	570,240	1,112,835	1.9515
30 Aug 2011 21:50:22	1089738	12858852	hadcm3n_p6mj_1900_40_007226171_2	544,320	1,061,082	1.9494
23 Aug 2011 16:49:04	1089738	12858852	hadcm3n_p6mj_1900_40_007226171_2	518,400	1,010,439	1.9491
16 Aug 2011 00:18:47	1089738	12858852	hadcm3n_p6mj_1900_40_007226171_2	492,480	963,148	1.9557
08 Aug 2011 15:48:27	1089738	12858852	hadcm3n_p6mj_1900_40_007226171_2	466,560	911,122	1.9529
03 Aug 2011 09:29:16	1089738	12858852	hadcm3n_p6mj_1900_40_007226171_2	440,640	859,083	1.9496
29 Jul 2011 12:10:43	1089738	12858852	hadcm3n_p6mj_1900_40_007226171_2	414,720	807,006	1.9459
26 Jul 2011 21:34:28	1089738	12858852	hadcm3n_p6mj_1900_40_007226171_2	388,800	754,641	1.9409
25 Jul 2011 22:47:15	1089738	12858852	hadcm3n_p6mj_1900_40_007226171_2	362,880	702,729	1.9365
25 Jul 2011 18:51:53	1089738	12858852	hadcm3n_p6mj_1900_40_007226171_2	336,960	650,414	1.9302
25 Jul 2011 17:15:12	1089738	12858852	hadcm3n_p6mj_1900_40_007226171_2	311,040	601,014	1.9323
08 Jul 2011 04:37:11	1089738	12858852	hadcm3n_p6mj_1900_40_007226171_2	285,120	551,311	1.9336
04 Jul 2011 09:03:32	1089738	12858852	hadcm3n_p6mj_1900_40_007226171_2	259,200	513,093	1.9795
10 Jun 2011 04:50:24	1089738	12858852	hadcm3n_p6mj_1900_40_007226171_2	233,280	462,762	1.9837
08 Jun 2011 03:10:46	1089738	12858852	hadcm3n_p6mj_1900_40_007226171_2	207,360	418,736	2.0194
03 Jun 2011 14:41:10	1089738	12858852	hadcm3n_p6mj_1900_40_007226171_2	181,440	366,316	2.0189
01 Jun 2011 11:53:58	1089738	12858852	hadcm3n_p6mj_1900_40_007226171_2	155,520	313,995	2.0190
27 May 2011 07:14:31	1089738	12858852	hadcm3n_p6mj_1900_40_007226171_2	129,600	261,581	2.0184
23 May 2011 08:37:10	1089738	12858852	hadcm3n_p6mj_1900_40_007226171_2	103,680	209,145	2.0172