Task 13640856

Name	hadcm3n_ykuz_1900_40_007524426_2
Workunit	7721901
Created	17 Nov 2011, 3:49:34 UTC
Sent	18 Nov 2011, 9:31:40 UTC
Report deadline	17 Feb 2012, 16:58:51 UTC
Received	25 Dec 2011, 9:04:28 UTC
Server state	Over
Outcome	Computation error
Client state	Compute error
Exit status	22 (0x00000016) Unknown error code
Computer ID	1065107
Run time	9 days 7 hours 6 min 21 sec
CPU time	9 days 7 hours 6 min 21 sec
Validate state	Invalid
Credit	4,665.60
Device peak FLOPS	2.34 GFLOPS
Application version	UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86
Stderr	<core_client_version>6.10.18</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3448, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2884, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3632, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2864, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3496, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3504, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3600, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3452, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3452, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3452, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3452, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3452, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3884, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]>

Latest Trickles Received
Time Sent (UTC)	Host ID	Result ID	Result Name	Timestep	CPU Time (sec)	Average (sec/TS)
25 Dec 2011 02:12:16	1065107	13640856	hadcm3n_ykuz_1900_40_007524426_2	388,800	789,749	2.0312
24 Dec 2011 11:45:32	1065107	13640856	hadcm3n_ykuz_1900_40_007524426_2	362,880	739,900	2.0390
23 Dec 2011 19:56:28	1065107	13640856	hadcm3n_ykuz_1900_40_007524426_2	336,960	689,915	2.0475
23 Dec 2011 04:45:14	1065107	13640856	hadcm3n_ykuz_1900_40_007524426_2	311,040	639,355	2.0555
22 Dec 2011 03:41:52	1065107	13640856	hadcm3n_ykuz_1900_40_007524426_2	285,120	585,902	2.0549
20 Dec 2011 23:30:29	1065107	13640856	hadcm3n_ykuz_1900_40_007524426_2	259,200	531,127	2.0491
18 Dec 2011 08:47:31	1065107	13640856	hadcm3n_ykuz_1900_40_007524426_2	233,280	480,017	2.0577
17 Dec 2011 07:28:37	1065107	13640856	hadcm3n_ykuz_1900_40_007524426_2	207,360	429,070	2.0692
16 Dec 2011 01:34:42	1065107	13640856	hadcm3n_ykuz_1900_40_007524426_2	181,440	372,824	2.0548
12 Dec 2011 03:11:56	1065107	13640856	hadcm3n_ykuz_1900_40_007524426_2	155,520	320,811	2.0628
11 Dec 2011 11:47:30	1065107	13640856	hadcm3n_ykuz_1900_40_007524426_2	129,600	267,415	2.0634
10 Dec 2011 08:36:01	1065107	13640856	hadcm3n_ykuz_1900_40_007524426_2	103,680	214,531	2.0692
08 Dec 2011 08:13:03	1065107	13640856	hadcm3n_ykuz_1900_40_007524426_2	77,760	163,297	2.1000
03 Dec 2011 06:25:42	1065107	13640856	hadcm3n_ykuz_1900_40_007524426_2	51,840	106,204	2.0487
27 Nov 2011 01:18:28	1065107	13640856	hadcm3n_ykuz_1900_40_007524426_2	25,920	52,163	2.0125