Task 13774908

Name	hadcm3n_yfop_1940_40_007614923_3
Workunit	7793053
Created	13 Dec 2011, 20:20:21 UTC
Sent	13 Dec 2011, 21:07:16 UTC
Report deadline	14 Mar 2012, 4:34:27 UTC
Received	24 Jan 2012, 19:04:56 UTC
Server state	Over
Outcome	Computation error
Client state	Compute error
Exit status	193 (0x000000C1) EXIT_SIGNAL
Computer ID	1135257
Run time	16 days 18 hours 26 min 46 sec
CPU time	13 days 16 hours 48 min 29 sec
Validate state	Invalid
Credit	6,220.80
Device peak FLOPS	2.99 GFLOPS
Application version	UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86
Stderr	<core_client_version>6.10.58</core_client_version> <![CDATA[ <message> - exit code 193 (0xc1) </message> <stderr_txt> CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4140, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... 20:23:34 (7192): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3552, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2616, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6792, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7112, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1972, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7744, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7632, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4868, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6100, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4872, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5680, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5844, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4820, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4820, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4820, iMonCtr=1 Model crash detected, will try to restart... Unhandled Exception Detected... - Unhandled Exception Record - Reason: Access Violation (0xc0000005) at address 0x77885EAB read attempt to address 0x4066FCFB Engaging BOINC Windows Runtime Debugger... Signal 11 received, exiting... Called boinc_finish </stderr_txt> ]]>

Latest Trickles Received
Time Sent (UTC)	Host ID	Result ID	Result Name	Timestep	CPU Time (sec)	Average (sec/TS)
23 Jan 2012 21:05:52	1135257	13774908	hadcm3n_yfop_1940_40_007614923_3	518,400	1,175,268	2.2671
22 Jan 2012 00:39:19	1135257	13774908	hadcm3n_yfop_1940_40_007614923_3	492,480	1,115,380	2.2648
20 Jan 2012 14:06:05	1135257	13774908	hadcm3n_yfop_1940_40_007614923_3	466,560	1,057,590	2.2668
18 Jan 2012 16:46:09	1135257	13774908	hadcm3n_yfop_1940_40_007614923_3	440,640	997,376	2.2635
16 Jan 2012 16:46:58	1135257	13774908	hadcm3n_yfop_1940_40_007614923_3	414,720	936,778	2.2588
14 Jan 2012 14:48:26	1135257	13774908	hadcm3n_yfop_1940_40_007614923_3	388,800	880,289	2.2641
12 Jan 2012 16:51:25	1135257	13774908	hadcm3n_yfop_1940_40_007614923_3	362,880	821,808	2.2647
10 Jan 2012 16:15:13	1135257	13774908	hadcm3n_yfop_1940_40_007614923_3	336,960	762,705	2.2635
08 Jan 2012 20:44:46	1135257	13774908	hadcm3n_yfop_1940_40_007614923_3	311,040	703,412	2.2615
06 Jan 2012 16:49:57	1135257	13774908	hadcm3n_yfop_1940_40_007614923_3	285,120	643,528	2.2570
03 Jan 2012 22:28:19	1135257	13774908	hadcm3n_yfop_1940_40_007614923_3	259,200	583,502	2.2512
02 Jan 2012 00:32:53	1135257	13774908	hadcm3n_yfop_1940_40_007614923_3	233,280	522,900	2.2415
30 Dec 2011 23:08:43	1135257	13774908	hadcm3n_yfop_1940_40_007614923_3	207,360	462,895	2.2323
27 Dec 2011 23:47:37	1135257	13774908	hadcm3n_yfop_1940_40_007614923_3	181,440	401,905	2.2151
25 Dec 2011 18:05:12	1135257	13774908	hadcm3n_yfop_1940_40_007614923_3	155,520	342,038	2.1993
22 Dec 2011 23:13:22	1135257	13774908	hadcm3n_yfop_1940_40_007614923_3	129,600	285,394	2.2021
21 Dec 2011 15:14:36	1135257	13774908	hadcm3n_yfop_1940_40_007614923_3	103,680	227,957	2.1987
19 Dec 2011 15:44:20	1135257	13774908	hadcm3n_yfop_1940_40_007614923_3	77,760	171,709	2.2082
17 Dec 2011 14:50:15	1135257	13774908	hadcm3n_yfop_1940_40_007614923_3	51,840	113,305	2.1857
15 Dec 2011 18:25:23	1135257	13774908	hadcm3n_yfop_1940_40_007614923_3	25,920	57,538	2.2198