Task 15442610

Name	hadcm3n_zin9_1880_40_008246536_1
Workunit	8401660
Created	21 Nov 2012, 1:57:36 UTC
Sent	21 Nov 2012, 1:57:39 UTC
Report deadline	20 Feb 2013, 9:24:50 UTC
Received	9 Jun 2013, 1:44:16 UTC
Server state	Over
Outcome	Computation error
Client state	Compute error
Exit status	193 (0x000000C1) EXIT_SIGNAL
Computer ID	1194450
Run time	25 days 14 hours 43 min 10 sec
CPU time	17 days 4 hours 37 min 10 sec
Validate state	Invalid
Credit	12,441.60
Device peak FLOPS	2.52 GFLOPS
Application version	UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86
Stderr	<core_client_version>7.0.64</core_client_version> <![CDATA[ <message> (unknown error) - exit code 193 (0xc1) </message> <stderr_txt> Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3484, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... 11:29:34 (3976): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 11:29:35 (3976): No heartbeat from core client for 30 sec - exiting 11:29:36 (3976): No heartbeat from core client for 30 sec - exiting 11:29:37 (3976): No heartbeat from core client for 30 sec - exiting 11:29:38 (3976): No heartbeat from core client for 30 sec - exiting 11:29:39 (3976): No heartbeat from core client for 30 sec - exiting 11:29:40 (3976): No heartbeat from core client for 30 sec - exiting 11:29:41 (3976): No heartbeat from core client for 30 sec - exiting 11:29:42 (3976): No heartbeat from core client for 30 sec - exiting 11:29:43 (3976): No heartbeat from core client for 30 sec - exiting 11:29:44 (3976): No heartbeat from core client for 30 sec - exiting 11:30:40 (5064): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3712, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3712, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3988, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1620, iMonCtr=1 Model crash detected, will try to restart... CController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3688, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3860, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1332, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... CSuspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... no start tag in app init data 17:33:47 (3872): Can't parse init data file - running in standalone mode Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CSuspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3712, iMonCtr=1 Model crash detected, will try to restart... CCSuspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4040, iMonCtr=1 Model crash detected, will try to restart... CSuspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4884, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3936, iMonCtr=1 Model crash detected, will try to restart... Unhandled Exception Detected... - Unhandled Exception Record - Reason: Access Violation (0xc0000005) at address 0x77467373 read attempt to address 0xFFFFFFF8 Engaging BOINC Windows Runtime Debugger... Cannot serialize file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_zin9_1880_40_008246536/dataout/shmem_restart.day Signal 11 received, exiting... Called boinc_finish </stderr_txt> ]]>

Latest Trickles Received
Time Sent (UTC)	Host ID	Result ID	Result Name	Timestep	CPU Time (sec)	Average (sec/TS)
09 Jun 2013 00:46:30	1194450	15442610	hadcm3n_zin9_1880_40_008246536_1	1,036,800	1,485,425	1.4327
02 Jun 2013 22:45:02	1194450	15442610	hadcm3n_zin9_1880_40_008246536_1	1,010,880	1,448,195	1.4326
29 May 2013 00:55:09	1194450	15442610	hadcm3n_zin9_1880_40_008246536_1	984,960	1,410,752	1.4323
20 May 2013 23:39:23	1194450	15442610	hadcm3n_zin9_1880_40_008246536_1	959,040	1,373,638	1.4323
12 May 2013 21:33:08	1194450	15442610	hadcm3n_zin9_1880_40_008246536_1	933,120	1,336,495	1.4323
08 May 2013 00:21:41	1194450	15442610	hadcm3n_zin9_1880_40_008246536_1	907,200	1,299,356	1.4323
24 Apr 2013 04:28:43	1194450	15442610	hadcm3n_zin9_1880_40_008246536_1	881,280	1,261,489	1.4314
17 Apr 2013 03:50:09	1194450	15442610	hadcm3n_zin9_1880_40_008246536_1	855,360	1,223,287	1.4301
13 Apr 2013 23:48:18	1194450	15442610	hadcm3n_zin9_1880_40_008246536_1	829,440	1,185,677	1.4295
12 Apr 2013 02:22:04	1194450	15442610	hadcm3n_zin9_1880_40_008246536_1	803,520	1,148,410	1.4292
02 Apr 2013 02:31:16	1194450	15442610	hadcm3n_zin9_1880_40_008246536_1	777,600	1,110,525	1.4281
26 Mar 2013 04:19:24	1194450	15442610	hadcm3n_zin9_1880_40_008246536_1	751,680	1,071,625	1.4256
23 Mar 2013 01:24:59	1194450	15442610	hadcm3n_zin9_1880_40_008246536_1	725,760	1,033,794	1.4244
22 Mar 2013 02:47:08	1194450	15442610	hadcm3n_zin9_1880_40_008246536_1	699,840	998,054	1.4261
17 Mar 2013 23:31:13	1194450	15442610	hadcm3n_zin9_1880_40_008246536_1	673,920	961,416	1.4266
16 Mar 2013 01:45:50	1194450	15442610	hadcm3n_zin9_1880_40_008246536_1	648,000	924,954	1.4274
10 Mar 2013 01:09:44	1194450	15442610	hadcm3n_zin9_1880_40_008246536_1	622,080	887,298	1.4263
04 Mar 2013 00:00:47	1194450	15442610	hadcm3n_zin9_1880_40_008246536_1	596,160	849,190	1.4244
26 Feb 2013 03:33:36	1194450	15442610	hadcm3n_zin9_1880_40_008246536_1	570,240	811,498	1.4231
21 Feb 2013 02:13:13	1194450	15442610	hadcm3n_zin9_1880_40_008246536_1	544,320	774,253	1.4224