Task 12829165

Name	hadcm3n_p5ff_1900_40_007224619_0
Workunit	7422859
Created	26 Apr 2011, 15:32:39 UTC
Sent	28 Apr 2011, 6:15:20 UTC
Report deadline	28 Jul 2011, 13:42:31 UTC
Received	17 May 2011, 5:41:08 UTC
Server state	Over
Outcome	Computation error
Client state	Compute error
Exit status	-1073741819 (0xC0000005) STATUS_ACCESS_VIOLATION
Computer ID	827009
Run time	12 days 9 hours 21 min 26 sec
CPU time	10 days 13 hours 38 min 26 sec
Validate state	Invalid
Credit	6,220.80
Device peak FLOPS	2.24 GFLOPS
Application version	UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86
Stderr	<core_client_version>6.10.58</core_client_version> <![CDATA[ <message> - exit code -1073741819 (0xc0000005) </message> <stderr_txt> Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=644, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4680, iMonCtr=1 Model crash detected, will try to restart... 08:44:54 (4304): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4628, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... 07:19:30 (2220): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4512, iMonCtr=1 Model crash detected, will try to restart... 04:10:35 (5776): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1664, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5456, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5572, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5572, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5908, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4416, iMonCtr=1 Model crash detected, will try to restart... 08:18:02 (5016): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6108, iMonCtr=1 Model crash detected, will try to restart... Unhandled Exception Detected... - Unhandled Exception Record - Reason: Access Violation (0xc0000005) at address 0x77A778F5 read attempt to address 0xFFFFFFF8 Engaging BOINC Windows Runtime Debugger... Unhandled Exception Detected... - Unhandled Exception Record - Reason: Access Violation (0xc0000005) at address 0x77A96E0F read attempt to address 0x00000000 Engaging BOINC Windows Runtime Debugger... </stderr_txt> ]]>

Latest Trickles Received
Time Sent (UTC)	Host ID	Result ID	Result Name	Timestep	CPU Time (sec)	Average (sec/TS)
16 May 2011 20:13:19	827009	12829165	hadcm3n_p5ff_1900_40_007224619_0	518,400	910,457	1.7563
15 May 2011 18:43:46	827009	12829165	hadcm3n_p5ff_1900_40_007224619_0	492,480	864,220	1.7548
14 May 2011 19:53:54	827009	12829165	hadcm3n_p5ff_1900_40_007224619_0	466,560	818,682	1.7547
14 May 2011 06:42:54	827009	12829165	hadcm3n_p5ff_1900_40_007224619_0	440,640	773,977	1.7565
13 May 2011 05:41:04	827009	12829165	hadcm3n_p5ff_1900_40_007224619_0	414,720	727,445	1.7541
12 May 2011 13:06:00	827009	12829165	hadcm3n_p5ff_1900_40_007224619_0	388,800	680,930	1.7514
11 May 2011 13:03:19	827009	12829165	hadcm3n_p5ff_1900_40_007224619_0	362,880	634,458	1.7484
10 May 2011 05:43:57	827009	12829165	hadcm3n_p5ff_1900_40_007224619_0	336,960	588,841	1.7475
09 May 2011 15:58:21	827009	12829165	hadcm3n_p5ff_1900_40_007224619_0	311,040	542,835	1.7452
06 May 2011 11:32:44	827009	12829165	hadcm3n_p5ff_1900_40_007224619_0	285,120	497,016	1.7432
05 May 2011 21:12:30	827009	12829165	hadcm3n_p5ff_1900_40_007224619_0	259,200	451,364	1.7414
05 May 2011 06:04:36	827009	12829165	hadcm3n_p5ff_1900_40_007224619_0	233,280	405,513	1.7383
04 May 2011 06:31:42	827009	12829165	hadcm3n_p5ff_1900_40_007224619_0	207,360	360,100	1.7366
03 May 2011 07:07:48	827009	12829165	hadcm3n_p5ff_1900_40_007224619_0	181,440	314,311	1.7323
02 May 2011 17:58:18	827009	12829165	hadcm3n_p5ff_1900_40_007224619_0	155,520	269,337	1.7318
01 May 2011 18:25:05	827009	12829165	hadcm3n_p5ff_1900_40_007224619_0	129,600	223,704	1.7261
01 May 2011 05:45:53	827009	12829165	hadcm3n_p5ff_1900_40_007224619_0	103,680	179,553	1.7318
30 Apr 2011 17:22:13	827009	12829165	hadcm3n_p5ff_1900_40_007224619_0	77,760	135,662	1.7446
29 Apr 2011 11:34:53	827009	12829165	hadcm3n_p5ff_1900_40_007224619_0	51,840	90,841	1.7523
28 Apr 2011 21:54:17	827009	12829165	hadcm3n_p5ff_1900_40_007224619_0	25,920	45,671	1.7620