Task 16235279

Name	hadcm3n_7ntq_1980_40_008442161_1
Workunit	8593017
Created	14 Jan 2014, 9:26:44 UTC
Sent	14 Jan 2014, 9:29:06 UTC
Report deadline	15 Apr 2014, 16:56:17 UTC
Received	8 Feb 2014, 14:24:25 UTC
Server state	Over
Outcome	Computation error
Client state	Compute error
Exit status	-1073741819 (0xC0000005) STATUS_ACCESS_VIOLATION
Computer ID	1303522
Run time	6 days 8 hours 36 min 23 sec
CPU time	5 days 11 hours 14 min 21 sec
Validate state	Invalid
Credit	3,110.40
Device peak FLOPS	2.99 GFLOPS
Application version	UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86
Stderr	<core_client_version>7.0.28</core_client_version> <![CDATA[ <message> - exit code -1073741819 (0xc0000005) </message> <stderr_txt> Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4792, iMonCtr=1 Model crash detected, will try to restart... 07:36:30 (3932): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 19:38:42 (6412): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7004, iMonCtr=1 Model crash detected, will try to restart... 10:41:49 (3000): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:42:47 (1228): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPIDController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3836, iMonCtr=1 Model crash detected, will try to restart... 11:55:17 (1824): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4144, iMonCtr=1 Model crash detected, will try to restart... 08:46:41 (5832): No heartbeat from core client for 30 sec - exiting 08:46:42 (5832): No heartbeat from core client for 30 sec - exiting 08:46:43 (5832): No heartbeat from core client for 30 sec - exiting 08:46:44 (5832): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5620, iMonCtr=1 Model crash detected, will try to restart... 12:27:44 (1932): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... C19:12:52 (5892): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4596, iMonCtr=1 Model crash detected, will try to restart... BUFFIN: C I/O Error feof - Unit 63 - Return code = 16 BUFFIN: C I/O Error feof - Unit 64 - Return code = 16 BUFFIN: C I/O Error feof - Unit 65 - Return code = 16 BUFFIN: C I/O Error feof - Unit 66 - Return code = 16 BUFFIN: C I/O Error feof - Unit 67 - Return code = 16 BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 20:55:15 (5820): No heartbeat from core client for 30 sec - exiting Error converting file to netcdf: dataout/7ntqko.pji7c10 Error converting file to netcdf: dataout/7ntqko.pii7c10 Error converting file to netcdf: dataout/7ntqko.pfi7c10 Error converting file to netcdf: dataout/7ntqka.phi7c10 Error converting file to netcdf: dataout/7ntqka.pgi7c10 Error converting file to netcdf: dataout/7ntqka.pei7c10 Error converting file to netcdf: dataout/7ntqka.pdi7c10 CPDN Monitor - No 'heartbeat' from BOINC... BUFFIN: C I/O Error feof - Unit 63 - Return code = 16 BUFFIN: C I/O Error feof - Unit 64 - Return code = 16 BUFFIN: C I/O Error feof - Unit 65 - Return code = 16 BUFFIN: C I/O Error feof - Unit 66 - Return code = 16 BUFFIN: C I/O Error feof - Unit 67 - Return code = 16 BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 Error converting file to netcdf: dataout/7ntqko.pji7c10 Error converting file to netcdf: dataout/7ntqko.pii7c10 Error converting file to netcdf: dataout/7ntqko.pfi7c10 Error converting file to netcdf: dataout/7ntqka.phi7c10 Error converting file to netcdf: dataout/7ntqka.pgi7c10 Error converting file to netcdf: dataout/7ntqka.pei7c10 Error converting file to netcdf: dataout/7ntqka.pdi7c10 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=364, iMonCtr=1 Model crash detected, will try to restart... 08:36:31 (5072): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 10:35:52 (5916): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 21:55:08 (7636): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... Unhandled Exception Detected... - Unhandled Exception Record - Reason: Access Violation (0xc0000005) at address 0x77903AC3 read attempt to address 0x00000000 Engaging BOINC Windows Runtime Debugger... Unhandled Exception Detected... - Unhandled Exception Record - Reason: Access Violation (0xc0000005) at address 0x779943E0 read attempt to address 0x00000004 Engaging BOINC Windows Runtime Debugger... </stderr_txt> ]]>

Latest Trickles Received
Time Sent (UTC)	Host ID	Result ID	Result Name	Timestep	CPU Time (sec)	Average (sec/TS)
01 Feb 2014 18:25:44	1303522	16235279	hadcm3n_7ntq_1980_40_008442161_1	259,200	459,442	1.7725
28 Jan 2014 12:57:11	1303522	16235279	hadcm3n_7ntq_1980_40_008442161_1	233,280	412,684	1.7691
25 Jan 2014 21:14:45	1303522	16235279	hadcm3n_7ntq_1980_40_008442161_1	207,360	368,199	1.7757
24 Jan 2014 22:40:13	1303522	16235279	hadcm3n_7ntq_1980_40_008442161_1	181,440	322,303	1.7764
21 Jan 2014 20:35:56	1303522	16235279	hadcm3n_7ntq_1980_40_008442161_1	155,520	276,211	1.7760
20 Jan 2014 19:17:00	1303522	16235279	hadcm3n_7ntq_1980_40_008442161_1	129,600	229,757	1.7728
19 Jan 2014 19:32:20	1303522	16235279	hadcm3n_7ntq_1980_40_008442161_1	103,680	183,519	1.7701
18 Jan 2014 16:00:07	1303522	16235279	hadcm3n_7ntq_1980_40_008442161_1	77,760	137,281	1.7654
17 Jan 2014 13:03:04	1303522	16235279	hadcm3n_7ntq_1980_40_008442161_1	51,840	90,453	1.7448
15 Jan 2014 19:25:06	1303522	16235279	hadcm3n_7ntq_1980_40_008442161_1	25,920	44,722	1.7254