Task 15943549

Name	hadcm3n_o0f5_1980_40_008398521_2
Workunit	8549377
Created	29 Aug 2013, 15:35:57 UTC
Sent	29 Aug 2013, 15:36:28 UTC
Report deadline	28 Nov 2013, 23:03:39 UTC
Received	24 Sep 2013, 15:40:40 UTC
Server state	Over
Outcome	Computation error
Client state	Compute error
Exit status	193 (0x000000C1) EXIT_SIGNAL
Computer ID	1309451
Run time	5 days 14 hours 23 min 26 sec
CPU time	4 days 13 hours 16 min 36 sec
Validate state	Invalid
Credit	3,110.40
Device peak FLOPS	2.76 GFLOPS
Application version	UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86
Stderr	<core_client_version>7.0.64</core_client_version> <![CDATA[ <message> (unknown error) - exit code 193 (0xc1) </message> <stderr_txt> Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=172, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4036, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4036, iMonCtr=1 Model crash detected, will try to restart... 09:50:55 (4068): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3476, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2408, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3636, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2380, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4320, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4268, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2920, iMonCtr=1 Model crash detected, will try to restart... 06:51:40 (1628): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 06:51:42 (1628): No heartbeat from core client for 30 sec - exiting 06:51:43 (1628): No heartbeat from core client for 30 sec - exiting 06:58:14 (836): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 06:58:15 (836): No heartbeat from core client for 30 sec - exiting Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2200, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1240, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3428, iMonCtr=1 Model crash detected, will try to restart... 06:42:02 (3412): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Unhandled Exception Detected... - Unhandled Exception Record - Reason: Access Violation (0xc0000005) at address 0x77727985 read attempt to address 0xFFFFFFF8 Engaging BOINC Windows Runtime Debugger... Unhandled Exception Detected... - Unhandled Exception Record - Reason: Access Violation (0xc0000005) at address 0x77731CAF read attempt to address 0xFFFFFFF8 Engaging BOINC Windows Runtime Debugger... Cannot serialize file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_o0f5_1980_40_008398521/dataout/shmem_restart.day Signal 11 received, exiting... Called boinc_finish </stderr_txt> ]]>

Latest Trickles Received
Time Sent (UTC)	Host ID	Result ID	Result Name	Timestep	CPU Time (sec)	Average (sec/TS)
23 Sep 2013 14:10:26	1176151	15943549	hadcm3n_o0f5_1980_40_008398521_2	259,200	381,438	1.4716
20 Sep 2013 15:17:28	1176151	15943549	hadcm3n_o0f5_1980_40_008398521_2	233,280	344,070	1.4749
18 Sep 2013 10:03:43	1176151	15943549	hadcm3n_o0f5_1980_40_008398521_2	207,360	305,888	1.4752
17 Sep 2013 05:07:00	1176151	15943549	hadcm3n_o0f5_1980_40_008398521_2	181,440	267,328	1.4734
11 Sep 2013 14:32:37	1176151	15943549	hadcm3n_o0f5_1980_40_008398521_2	155,520	229,662	1.4767
08 Sep 2013 15:44:04	1176151	15943549	hadcm3n_o0f5_1980_40_008398521_2	129,600	190,512	1.4700
07 Sep 2013 17:16:26	1176151	15943549	hadcm3n_o0f5_1980_40_008398521_2	103,680	152,412	1.4700
06 Sep 2013 16:29:46	1176151	15943549	hadcm3n_o0f5_1980_40_008398521_2	77,760	112,146	1.4422
04 Sep 2013 10:42:28	1176151	15943549	hadcm3n_o0f5_1980_40_008398521_2	51,840	73,087	1.4099
03 Sep 2013 06:04:16	1176151	15943549	hadcm3n_o0f5_1980_40_008398521_2	25,920	35,114	1.3547