Task 13352952

Name	hadcm3n_t67c_1940_40_007444735_1
Workunit	7642238
Created	9 Sep 2011, 11:16:27 UTC
Sent	9 Sep 2011, 11:29:35 UTC
Report deadline	9 Dec 2011, 18:56:46 UTC
Received	19 Dec 2011, 8:24:30 UTC
Server state	Over
Outcome	Computation error
Client state	Compute error
Exit status	193 (0x000000C1) EXIT_SIGNAL
Computer ID	1014962
Run time	17 days 13 hours 45 min 5 sec
CPU time	14 days 1 hours 33 min 31 sec
Validate state	Invalid
Credit	6,220.80
Device peak FLOPS	2.11 GFLOPS
Application version	UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86
Stderr	<core_client_version>6.11.4</core_client_version> <![CDATA[ <message> - exit code 193 (0xc1) </message> <stderr_txt> CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4756, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4756, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6692, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6692, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 19:39:18 (8116): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=856, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=856, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=856, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... BUFFIN: C I/O Error feof - Unit 63 - Return code = 16 BUFFIN: C I/O Error feof - Unit 64 - Return code = 16 BUFFIN: C I/O Error feof - Unit 65 - Return code = 16 BUFFIN: C I/O Error feof - Unit 66 - Return code = 16 BUFFIN: C I/O Error feof - Unit 67 - Return code = 16 BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 Error converting file to netcdf: dataout/t67cko.pjf4c10 Error converting file to netcdf: dataout/t67cko.pif4c10 Error converting file to netcdf: dataout/t67cko.pff4c10 Error converting file to netcdf: dataout/t67cka.phf4c10 Error converting file to netcdf: dataout/t67cka.pgf4c10 Error converting file to netcdf: dataout/t67cka.pef4c10 Error converting file to netcdf: dataout/t67cka.pdf4c10 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CCPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 09:32:20 (8128): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7280, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7240, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Unhandled Exception Detected... - Unhandled Exception Record - Reason: Access Violation (0xc0000005) at address 0x77A96E0F read attempt to address 0x40AE128F Engaging BOINC Windows Runtime Debugger... Unhandled Exception Detected... - Unhandled Exception Record - Reason: Access Violation (0xc0000005) at address 0x77716E0F read attempt to address 0x40AE128F Engaging BOINC Windows Runtime Debugger... Cannot serialize file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_t67c_1940_40_007444735/dataout/shmem_restart.day Signal 11 received, exiting... Called boinc_finish </stderr_txt> ]]>

Latest Trickles Received
Time Sent (UTC)	Host ID	Result ID	Result Name	Timestep	CPU Time (sec)	Average (sec/TS)
13 Dec 2011 15:03:09	1014962	13352952	hadcm3n_t67c_1940_40_007444735_1	518,400	1,210,305	2.3347
09 Dec 2011 10:18:00	1014962	13352952	hadcm3n_t67c_1940_40_007444735_1	492,480	1,154,058	2.3434
06 Dec 2011 11:45:45	1014962	13352952	hadcm3n_t67c_1940_40_007444735_1	466,560	1,091,216	2.3389
01 Dec 2011 13:17:34	1014962	13352952	hadcm3n_t67c_1940_40_007444735_1	440,640	1,033,260	2.3449
28 Nov 2011 09:15:39	1014962	13352952	hadcm3n_t67c_1940_40_007444735_1	414,720	974,110	2.3488
23 Nov 2011 10:17:26	1014962	13352952	hadcm3n_t67c_1940_40_007444735_1	388,800	916,003	2.3560
21 Nov 2011 09:08:34	1014962	13352952	hadcm3n_t67c_1940_40_007444735_1	362,880	858,460	2.3657
16 Nov 2011 07:59:34	1014962	13352952	hadcm3n_t67c_1940_40_007444735_1	336,960	797,549	2.3669
10 Nov 2011 09:23:36	1014962	13352952	hadcm3n_t67c_1940_40_007444735_1	311,040	739,560	2.3777
08 Nov 2011 19:15:31	1014962	13352952	hadcm3n_t67c_1940_40_007444735_1	285,120	667,750	2.3420
04 Nov 2011 15:58:56	1014962	13352952	hadcm3n_t67c_1940_40_007444735_1	259,200	604,943	2.3339
02 Nov 2011 13:30:41	1014962	13352952	hadcm3n_t67c_1940_40_007444735_1	233,280	546,051	2.3408
13 Oct 2011 11:38:15	1014962	13352952	hadcm3n_t67c_1940_40_007444735_1	207,360	487,547	2.3512
10 Oct 2011 14:05:57	1014962	13352952	hadcm3n_t67c_1940_40_007444735_1	181,440	430,318	2.3717
06 Oct 2011 07:42:13	1014962	13352952	hadcm3n_t67c_1940_40_007444735_1	155,520	366,118	2.3542
29 Sep 2011 14:10:37	1014962	13352952	hadcm3n_t67c_1940_40_007444735_1	129,600	307,473	2.3725
26 Sep 2011 13:32:58	1014962	13352952	hadcm3n_t67c_1940_40_007444735_1	103,680	238,647	2.3018
22 Sep 2011 09:58:49	1014962	13352952	hadcm3n_t67c_1940_40_007444735_1	77,760	179,695	2.3109
20 Sep 2011 10:39:30	1014962	13352952	hadcm3n_t67c_1940_40_007444735_1	51,840	126,894	2.4478
14 Sep 2011 15:33:38	1014962	13352952	hadcm3n_t67c_1940_40_007444735_1	25,920	59,011	2.2767