Task 15493710

Name	hadcm3n_39pe_1940_40_008263811_2
Workunit	8418935
Created	21 Dec 2012, 9:50:05 UTC
Sent	21 Dec 2012, 10:42:01 UTC
Report deadline	22 Mar 2013, 18:09:12 UTC
Received	7 Jan 2013, 22:46:53 UTC
Server state	Over
Outcome	Computation error
Client state	Compute error
Exit status	193 (0x000000C1) EXIT_SIGNAL
Computer ID	1229189
Run time	11 days 7 hours 2 min 23 sec
CPU time	10 days 5 hours 26 min 51 sec
Validate state	Invalid
Credit	9,331.20
Device peak FLOPS	3.37 GFLOPS
Application version	UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86
Stderr	<core_client_version>7.0.28</core_client_version> <![CDATA[ <message> - exit code 193 (0xc1) </message> <stderr_txt> Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6180, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3712, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2440, iMonCtr=1 Model crash detected, will try to restart... 08:17:27 (2376): No heartbeat from core client for 30 sec - exiting 08:17:28 (2376): No heartbeat from core client for 30 sec - exiting 08:17:30 (2376): No heartbeat from core client for 30 sec - exiting 08:17:31 (2376): No heartbeat from core client for 30 sec - exiting 08:17:32 (2376): No heartbeat from core client for 30 sec - exiting 08:17:33 (2376): No heartbeat from core client for 30 sec - exiting 08:17:34 (2376): No heartbeat from core client for 30 sec - exiting 08:17:35 (2376): No heartbeat from core client for 30 sec - exiting 08:17:36 (2376): No heartbeat from core client for 30 sec - exiting 08:17:37 (2376): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3052, iMonCtr=1 Model crash detected, will try to restart... 17:08:21 (3588): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2688, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3692, iMonCtr=1 Model crash detected, will try to restart... Signal 4 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4308, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4308, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Unhandled Exception Detected... - Unhandled Exception Record - Reason: Access Violation (0xc0000005) at address 0x7717FF2B write attempt to address 0xFFFFFFF8 Engaging BOINC Windows Runtime Debugger... Unhandled Exception Detected... - Unhandled Exception Record - Reason: Access Violation (0xc0000005) at address 0x772A71F3 read attempt to address 0xFFFFFFF8 Engaging BOINC Windows Runtime Debugger... Cannot serialize file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_39pe_1940_40_008263811/dataout/shmem_restart.day Signal 11 received, exiting... Called boinc_finish </stderr_txt> ]]>

Latest Trickles Received
Time Sent (UTC)	Host ID	Result ID	Result Name	Timestep	CPU Time (sec)	Average (sec/TS)
07 Jan 2013 11:56:52	1229189	15493710	hadcm3n_39pe_1940_40_008263811_2	777,600	881,862	1.1341
06 Jan 2013 20:42:08	1229189	15493710	hadcm3n_39pe_1940_40_008263811_2	751,680	852,965	1.1347
06 Jan 2013 12:11:50	1229189	15493710	hadcm3n_39pe_1940_40_008263811_2	725,760	824,009	1.1354
05 Jan 2013 19:45:43	1229189	15493710	hadcm3n_39pe_1940_40_008263811_2	699,840	795,089	1.1361
05 Jan 2013 06:40:21	1229189	15493710	hadcm3n_39pe_1940_40_008263811_2	673,920	782,899	1.1617
04 Jan 2013 13:20:28	1229189	15493710	hadcm3n_39pe_1940_40_008263811_2	648,000	753,895	1.1634
03 Jan 2013 14:40:43	1229189	15493710	hadcm3n_39pe_1940_40_008263811_2	622,080	724,545	1.1647
02 Jan 2013 17:32:26	1229189	15493710	hadcm3n_39pe_1940_40_008263811_2	596,160	696,099	1.1676
02 Jan 2013 09:45:20	1229189	15493710	hadcm3n_39pe_1940_40_008263811_2	570,240	667,190	1.1700
01 Jan 2013 18:33:19	1229189	15493710	hadcm3n_39pe_1940_40_008263811_2	544,320	637,534	1.1712
01 Jan 2013 09:51:33	1229189	15493710	hadcm3n_39pe_1940_40_008263811_2	518,400	608,327	1.1735
31 Dec 2012 18:31:54	1229189	15493710	hadcm3n_39pe_1940_40_008263811_2	492,480	579,517	1.1767
31 Dec 2012 09:23:33	1229189	15493710	hadcm3n_39pe_1940_40_008263811_2	466,560	550,352	1.1796
30 Dec 2012 18:27:50	1229189	15493710	hadcm3n_39pe_1940_40_008263811_2	440,640	521,593	1.1837
30 Dec 2012 08:54:27	1229189	15493710	hadcm3n_39pe_1940_40_008263811_2	414,720	492,668	1.1880
29 Dec 2012 17:27:57	1229189	15493710	hadcm3n_39pe_1940_40_008263811_2	388,800	463,520	1.1922
29 Dec 2012 00:48:23	1229189	15493710	hadcm3n_39pe_1940_40_008263811_2	362,880	432,751	1.1925
28 Dec 2012 15:41:02	1229189	15493710	hadcm3n_39pe_1940_40_008263811_2	336,960	402,446	1.1943
28 Dec 2012 06:49:00	1229189	15493710	hadcm3n_39pe_1940_40_008263811_2	311,040	372,381	1.1972
27 Dec 2012 17:01:17	1229189	15493710	hadcm3n_39pe_1940_40_008263811_2	285,120	342,276	1.2005