Task 15683034

Name	hadcm3n_o1od_2140_40_008269012_1
Workunit	8424136
Created	25 Mar 2013, 18:51:35 UTC
Sent	25 Mar 2013, 18:51:40 UTC
Report deadline	25 Jun 2013, 2:18:51 UTC
Received	13 Apr 2013, 13:41:33 UTC
Server state	Over
Outcome	Computation error
Client state	Compute error
Exit status	25 (0x00000019) Unknown error code
Computer ID	1229213
Run time	3 days 20 hours 35 min
CPU time	3 days 10 hours 35 min 36 sec
Validate state	Invalid
Credit	4,976.64
Device peak FLOPS	4.43 GFLOPS
Application version	UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86
Stderr	<core_client_version>7.0.28</core_client_version> <![CDATA[ <message> The drive cannot locate a specific area or track on the disk. (0x19) - exit code 25 (0x19) </message> <stderr_txt> Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3276, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4952, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4952, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4852, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4244, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4244, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4844, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4532, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4532, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4292, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2068, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5088, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5936, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5936, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5936, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5936, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5936, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5116, iMonCtr=1 Model crash detected, will try to restart... Called boinc_finish </stderr_txt> ]]>

Latest Trickles Received
Time Sent (UTC)	Host ID	Result ID	Result Name	Timestep	CPU Time (sec)	Average (sec/TS)
08 Apr 2013 20:03:44	1229213	15683034	hadcm3n_o1od_2140_40_008269012_1	414,720	242,513	0.5848
07 Apr 2013 12:44:11	1229213	15683034	hadcm3n_o1od_2140_40_008269012_1	388,800	227,243	0.5845
06 Apr 2013 21:38:58	1229213	15683034	hadcm3n_o1od_2140_40_008269012_1	362,880	212,123	0.5846
06 Apr 2013 16:00:57	1229213	15683034	hadcm3n_o1od_2140_40_008269012_1	336,960	196,734	0.5838
05 Apr 2013 23:28:53	1229213	15683034	hadcm3n_o1od_2140_40_008269012_1	311,040	181,655	0.5840
05 Apr 2013 19:08:57	1229213	15683034	hadcm3n_o1od_2140_40_008269012_1	285,120	166,385	0.5836
04 Apr 2013 19:22:37	1229213	15683034	hadcm3n_o1od_2140_40_008269012_1	259,200	151,367	0.5840
03 Apr 2013 20:29:34	1229213	15683034	hadcm3n_o1od_2140_40_008269012_1	233,280	136,295	0.5843
02 Apr 2013 22:41:26	1229213	15683034	hadcm3n_o1od_2140_40_008269012_1	207,360	121,121	0.5841
02 Apr 2013 18:41:04	1229213	15683034	hadcm3n_o1od_2140_40_008269012_1	181,440	106,065	0.5846
01 Apr 2013 19:06:06	1229213	15683034	hadcm3n_o1od_2140_40_008269012_1	155,520	90,706	0.5832
29 Mar 2013 00:16:20	1229213	15683034	hadcm3n_o1od_2140_40_008269012_1	129,600	75,859	0.5853
28 Mar 2013 19:12:09	1229213	15683034	hadcm3n_o1od_2140_40_008269012_1	103,680	60,464	0.5832
27 Mar 2013 21:49:02	1229213	15683034	hadcm3n_o1od_2140_40_008269012_1	77,760	45,331	0.5830
27 Mar 2013 00:28:43	1229213	15683034	hadcm3n_o1od_2140_40_008269012_1	51,840	30,258	0.5837
25 Mar 2013 23:57:39	1229213	15683034	hadcm3n_o1od_2140_40_008269012_1	25,920	15,201	0.5865