Task 13405839

Name	hadcm3n_u0pl_1980_40_007458051_1
Workunit	7655554
Created	22 Sep 2011, 12:54:40 UTC
Sent	26 Sep 2011, 2:26:58 UTC
Report deadline	26 Dec 2011, 9:54:09 UTC
Received	16 Nov 2011, 19:35:28 UTC
Server state	Over
Outcome	Computation error
Client state	Compute error
Exit status	193 (0x000000C1) EXIT_SIGNAL
Computer ID	1125445
Run time	25 days 1 hours 24 min 33 sec
CPU time	23 days 4 hours 44 min 4 sec
Validate state	Invalid
Credit	12,441.60
Device peak FLOPS	2.65 GFLOPS
Application version	UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86
Stderr	<core_client_version>6.12.34</core_client_version> <![CDATA[ <message> - exit code 193 (0xc1) </message> <stderr_txt> CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4696, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=308, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1520, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4384, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4816, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... 13:22:56 (4464): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4964, iMonCtr=1 Model crash detected, will try to restart... 12:50:18 (4920): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4776, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1640, iMonCtr=1 Model crash detected, will try to restart... 12:49:01 (4208): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:49:39 (4260): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:49:41 (4260): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4636, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4636, iMonCtr=1 Model crash detected, will try to restart... 16:08:37 (4580): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4300, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... 03:12:07 (4960): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CCPDN Monitor - Quit request from BOINC... CCPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4236, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... CCPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Unhandled Exception Detected... - Unhandled Exception Record - Reason: Access Violation (0xc0000005) at address 0x778B3A93 read attempt to address 0x40AA0E36 Engaging BOINC Windows Runtime Debugger... Unhandled Exception Detected... - Unhandled Exception Record - Reason: Access Violation (0xc0000005) at address 0x77473A93 read attempt to address 0x40AA0E36 Engaging BOINC Windows Runtime Debugger... Cannot serialize file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_u0pl_1980_40_007458051/dataout/shmem_restart.day Signal 11 received, exiting... Called boinc_finish </stderr_txt> ]]>

Latest Trickles Received
Time Sent (UTC)	Host ID	Result ID	Result Name	Timestep	CPU Time (sec)	Average (sec/TS)
15 Nov 2011 22:40:01	1125445	13405839	hadcm3n_u0pl_1980_40_007458051_1	1,036,800	2,003,521	1.9324
15 Nov 2011 19:00:20	1125445	13405839	hadcm3n_u0pl_1980_40_007458051_1	1,010,880	1,953,633	1.9326
09 Nov 2011 16:44:02	1125445	13405839	hadcm3n_u0pl_1980_40_007458051_1	984,960	1,900,792	1.9298
08 Nov 2011 17:17:56	1125445	13405839	hadcm3n_u0pl_1980_40_007458051_1	959,040	1,850,202	1.9292
07 Nov 2011 05:55:51	1125445	13405839	hadcm3n_u0pl_1980_40_007458051_1	933,120	1,802,517	1.9317
06 Nov 2011 08:51:13	1125445	13405839	hadcm3n_u0pl_1980_40_007458051_1	907,200	1,751,887	1.9311
05 Nov 2011 15:34:21	1125445	13405839	hadcm3n_u0pl_1980_40_007458051_1	881,280	1,701,326	1.9305
04 Nov 2011 06:51:53	1125445	13405839	hadcm3n_u0pl_1980_40_007458051_1	855,360	1,650,961	1.9301
01 Nov 2011 06:53:40	1125445	13405839	hadcm3n_u0pl_1980_40_007458051_1	829,440	1,597,715	1.9263
31 Oct 2011 19:37:46	1125445	13405839	hadcm3n_u0pl_1980_40_007458051_1	803,520	1,544,867	1.9226
31 Oct 2011 19:02:52	1125445	13405839	hadcm3n_u0pl_1980_40_007458051_1	777,600	1,498,128	1.9266
31 Oct 2011 18:37:17	1125445	13405839	hadcm3n_u0pl_1980_40_007458051_1	751,680	1,449,614	1.9285
31 Oct 2011 18:18:23	1125445	13405839	hadcm3n_u0pl_1980_40_007458051_1	725,760	1,401,665	1.9313
31 Oct 2011 17:22:55	1125445	13405839	hadcm3n_u0pl_1980_40_007458051_1	699,840	1,354,109	1.9349
31 Oct 2011 16:46:22	1125445	13405839	hadcm3n_u0pl_1980_40_007458051_1	673,920	1,305,071	1.9365
31 Oct 2011 14:24:42	1125445	13405839	hadcm3n_u0pl_1980_40_007458051_1	648,000	1,255,623	1.9377
31 Oct 2011 13:51:41	1125445	13405839	hadcm3n_u0pl_1980_40_007458051_1	622,080	1,208,349	1.9424
31 Oct 2011 13:51:41	1125445	13405839	hadcm3n_u0pl_1980_40_007458051_1	596,160	1,164,615	1.9535
31 Oct 2011 13:51:41	1125445	13405839	hadcm3n_u0pl_1980_40_007458051_1	570,240	1,117,266	1.9593
31 Oct 2011 13:51:41	1125445	13405839	hadcm3n_u0pl_1980_40_007458051_1	544,320	1,071,177	1.9679