Task 13140164

Name	hadcm3n_ylgs_1900_40_007360678_2
Workunit	7558108
Created	15 Jul 2011, 18:31:57 UTC
Sent	15 Jul 2011, 18:34:04 UTC
Report deadline	15 Oct 2011, 2:01:15 UTC
Received	7 Oct 2011, 5:57:51 UTC
Server state	Over
Outcome	Computation error
Client state	Compute error
Exit status	193 (0x000000C1) EXIT_SIGNAL
Computer ID	950229
Run time	81 days 6 hours 23 min 27 sec
CPU time	71 days 13 hours 25 min 5 sec
Validate state	Invalid
Credit	9,331.20
Device peak FLOPS	1.14 GFLOPS
Application version	UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86
Stderr	<core_client_version>6.12.34</core_client_version> <![CDATA[ <message> - exit code 193 (0xc1) </message> <stderr_txt> Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2760, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Ocean Restart file copy failed on ylgsko.dab4c30 Ocean Restart file copy failed on ylgsko.dab58o0 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3916, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4136, iMonCtr=1 Model crash detected, will try to restart... 21:15:57 (3700): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4716, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4168, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2892, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3012, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4532, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4532, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... 19:01:00 (4264): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3352, iMonCtr=1 Model crash detected, will try to restart... Signal 11 received, exiting... Called boinc_finish </stderr_txt> ]]>

Latest Trickles Received
Time Sent (UTC)	Host ID	Result ID	Result Name	Timestep	CPU Time (sec)	Average (sec/TS)
07 Oct 2011 04:56:14	950229	13140164	hadcm3n_ylgs_1900_40_007360678_2	777,600	6,182,677	7.9510
04 Oct 2011 07:40:32	950229	13140164	hadcm3n_ylgs_1900_40_007360678_2	751,680	5,958,271	7.9266
29 Sep 2011 10:00:07	950229	13140164	hadcm3n_ylgs_1900_40_007360678_2	725,760	5,735,702	7.9030
26 Sep 2011 14:38:40	950229	13140164	hadcm3n_ylgs_1900_40_007360678_2	699,840	5,511,565	7.8755
23 Sep 2011 19:26:00	950229	13140164	hadcm3n_ylgs_1900_40_007360678_2	673,920	5,287,387	7.8457
20 Sep 2011 19:11:16	950229	13140164	hadcm3n_ylgs_1900_40_007360678_2	648,000	5,068,037	7.8210
18 Sep 2011 07:50:52	950229	13140164	hadcm3n_ylgs_1900_40_007360678_2	622,080	4,859,988	7.8125
15 Sep 2011 16:18:44	950229	13140164	hadcm3n_ylgs_1900_40_007360678_2	596,160	4,642,744	7.7877
11 Sep 2011 07:14:20	950229	13140164	hadcm3n_ylgs_1900_40_007360678_2	570,240	4,441,101	7.7881
08 Sep 2011 13:37:55	950229	13140164	hadcm3n_ylgs_1900_40_007360678_2	544,320	4,233,975	7.7785
06 Sep 2011 03:11:07	950229	13140164	hadcm3n_ylgs_1900_40_007360678_2	518,400	4,027,495	7.7691
03 Sep 2011 07:32:38	950229	13140164	hadcm3n_ylgs_1900_40_007360678_2	492,480	3,807,369	7.7310
31 Aug 2011 14:53:41	950229	13140164	hadcm3n_ylgs_1900_40_007360678_2	466,560	3,590,505	7.6957
27 Aug 2011 05:44:04	950229	13140164	hadcm3n_ylgs_1900_40_007360678_2	440,640	3,388,166	7.6892
24 Aug 2011 09:33:25	950229	13140164	hadcm3n_ylgs_1900_40_007360678_2	414,720	3,169,801	7.6432
21 Aug 2011 17:28:12	950229	13140164	hadcm3n_ylgs_1900_40_007360678_2	388,800	2,949,985	7.5874
19 Aug 2011 03:25:51	950229	13140164	hadcm3n_ylgs_1900_40_007360678_2	362,880	2,735,067	7.5371
16 Aug 2011 09:29:56	950229	13140164	hadcm3n_ylgs_1900_40_007360678_2	336,960	2,523,311	7.4885
13 Aug 2011 20:18:04	950229	13140164	hadcm3n_ylgs_1900_40_007360678_2	311,040	2,311,288	7.4308
11 Aug 2011 05:30:03	950229	13140164	hadcm3n_ylgs_1900_40_007360678_2	285,120	2,101,016	7.3689