Task 14729478

Name	hadcm3n_o7n8_2060_40_007998308_0
Workunit	8153422
Created	21 May 2012, 20:55:27 UTC
Sent	21 May 2012, 23:03:32 UTC
Report deadline	21 Aug 2012, 6:30:43 UTC
Received	31 May 2012, 4:12:16 UTC
Server state	Over
Outcome	Computation error
Client state	Compute error
Exit status	193 (0x000000C1) EXIT_SIGNAL
Computer ID	1218954
Run time	5 days 14 hours 51 min 2 sec
CPU time	5 days 8 hours 33 min 6 sec
Validate state	Invalid
Credit	6,220.80
Device peak FLOPS	3.25 GFLOPS
Application version	UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86
Stderr	<core_client_version>6.12.34</core_client_version> <![CDATA[ <message> - exit code 193 (0xc1) </message> <stderr_txt> Suspended CPDN Monitor - Suspend request from BOINC... 03:11:14 (1500): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5396, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2148, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3060, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1668, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4060, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3492, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2116, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3988, iMonCtr=1 Model crash detected, will try to restart... CController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3412, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3956, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Quit request from BOINC... Unhandled Exception Detected... - Unhandled Exception Record - Reason: Access Violation (0xc0000005) at address 0x778E5EAB read attempt to address 0x00000000 Engaging BOINC Windows Runtime Debugger... Cannot serialize file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_o7n8_2060_40_007998308/dataout/shmem_restart.day Signal 11 received, exiting... Called boinc_finish </stderr_txt> ]]>

Latest Trickles Received
Time Sent (UTC)	Host ID	Result ID	Result Name	Timestep	CPU Time (sec)	Average (sec/TS)
31 May 2012 03:12:12	1218954	14729478	hadcm3n_o7n8_2060_40_007998308_0	518,400	462,781	0.8927
30 May 2012 16:21:22	1218954	14729478	hadcm3n_o7n8_2060_40_007998308_0	492,480	439,687	0.8928
30 May 2012 00:37:24	1218954	14729478	hadcm3n_o7n8_2060_40_007998308_0	466,560	416,953	0.8937
29 May 2012 16:41:31	1218954	14729478	hadcm3n_o7n8_2060_40_007998308_0	440,640	393,315	0.8926
29 May 2012 04:08:52	1218954	14729478	hadcm3n_o7n8_2060_40_007998308_0	414,720	369,907	0.8919
28 May 2012 22:21:32	1218954	14729478	hadcm3n_o7n8_2060_40_007998308_0	388,800	346,798	0.8920
28 May 2012 15:12:27	1218954	14729478	hadcm3n_o7n8_2060_40_007998308_0	362,880	323,905	0.8926
28 May 2012 02:16:28	1218954	14729478	hadcm3n_o7n8_2060_40_007998308_0	336,960	300,679	0.8923
27 May 2012 20:12:22	1218954	14729478	hadcm3n_o7n8_2060_40_007998308_0	311,040	279,118	0.8974
27 May 2012 12:59:52	1218954	14729478	hadcm3n_o7n8_2060_40_007998308_0	285,120	256,647	0.9001
27 May 2012 04:03:36	1218954	14729478	hadcm3n_o7n8_2060_40_007998308_0	259,200	233,116	0.8994
26 May 2012 21:22:53	1218954	14729478	hadcm3n_o7n8_2060_40_007998308_0	233,280	210,729	0.9033
26 May 2012 14:02:25	1218954	14729478	hadcm3n_o7n8_2060_40_007998308_0	207,360	186,897	0.9013
26 May 2012 07:18:30	1218954	14729478	hadcm3n_o7n8_2060_40_007998308_0	181,440	163,495	0.9011
25 May 2012 00:57:12	1218954	14729478	hadcm3n_o7n8_2060_40_007998308_0	155,520	140,235	0.9017
24 May 2012 18:09:47	1218954	14729478	hadcm3n_o7n8_2060_40_007998308_0	129,600	117,578	0.9072
24 May 2012 03:58:33	1218954	14729478	hadcm3n_o7n8_2060_40_007998308_0	103,680	94,245	0.9090
23 May 2012 18:33:44	1218954	14729478	hadcm3n_o7n8_2060_40_007998308_0	77,760	70,859	0.9113
23 May 2012 03:15:21	1218954	14729478	hadcm3n_o7n8_2060_40_007998308_0	51,840	47,293	0.9123
22 May 2012 06:00:16	1218954	14729478	hadcm3n_o7n8_2060_40_007998308_0	25,920	23,761	0.9167