Task 13103534

Name	hadcm3n_yd3j_1900_40_007349833_1
Workunit	7547263
Created	6 Jul 2011, 14:01:38 UTC
Sent	17 Jul 2011, 9:42:11 UTC
Report deadline	16 Oct 2011, 17:09:22 UTC
Received	9 Sep 2011, 0:34:37 UTC
Server state	Over
Outcome	Computation error
Client state	Compute error
Exit status	255 (0x000000FF) Unknown error code
Computer ID	1006326
Run time	6 days 17 hours 14 min 12 sec
CPU time	6 days 10 hours 28 min 35 sec
Validate state	Invalid
Credit	3,110.40
Device peak FLOPS	2.34 GFLOPS
Application version	UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86
Stderr	<core_client_version>6.10.58</core_client_version> <![CDATA[ <message> The extended attributes are inconsistent. (0xff) - exit code 255 (0xff) </message> <stderr_txt> CPDN Monitor - Quit request from BOINC... 10:23:41 (6744): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2480, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2956, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5356, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CCPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5608, iMonCtr=1 Model crash detected, will try to restart... 12:08:25 (1028): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4792, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2112, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3160, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3856, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Unhandled Exception Detected... - Unhandled Exception Record - Reason: Out Of Memory (C++ Exception) (0xe06d7363) at address 0x777EFC56 Engaging BOINC Windows Runtime Debugger... </stderr_txt> ]]>

Latest Trickles Received
Time Sent (UTC)	Host ID	Result ID	Result Name	Timestep	CPU Time (sec)	Average (sec/TS)
08 Sep 2011 20:50:32	1006326	13103534	hadcm3n_yd3j_1900_40_007349833_1	259,200	545,245	2.1036
07 Sep 2011 14:44:35	1006326	13103534	hadcm3n_yd3j_1900_40_007349833_1	233,280	491,170	2.1055
05 Sep 2011 21:11:16	1006326	13103534	hadcm3n_yd3j_1900_40_007349833_1	207,360	436,997	2.1074
04 Sep 2011 16:49:41	1006326	13103534	hadcm3n_yd3j_1900_40_007349833_1	181,440	382,399	2.1076
25 Aug 2011 18:33:23	1006326	13103534	hadcm3n_yd3j_1900_40_007349833_1	155,520	329,889	2.1212
24 Aug 2011 15:15:36	1006326	13103534	hadcm3n_yd3j_1900_40_007349833_1	129,600	279,367	2.1556
17 Aug 2011 23:43:57	1006326	13103534	hadcm3n_yd3j_1900_40_007349833_1	103,680	227,001	2.1894
10 Aug 2011 23:06:01	1006326	13103534	hadcm3n_yd3j_1900_40_007349833_1	77,760	173,229	2.2277
07 Aug 2011 16:16:59	1006326	13103534	hadcm3n_yd3j_1900_40_007349833_1	51,840	115,653	2.2310
25 Jul 2011 22:03:32	1006326	13103534	hadcm3n_yd3j_1900_40_007349833_1	25,920	59,737	2.3047