Task 13111812

Name	hadcm3n_ygah_1900_40_007353971_0
Workunit	7551401
Created	6 Jul 2011, 14:30:40 UTC
Sent	15 Jul 2011, 15:05:26 UTC
Report deadline	14 Oct 2011, 22:32:37 UTC
Received	8 Aug 2011, 4:28:49 UTC
Server state	Over
Outcome	Computation error
Client state	Compute error
Exit status	-226 (0xFFFFFF1E) ERR_TOO_MANY_EXITS
Computer ID	1122800
Run time	8 days 7 hours 34 min 19 sec
CPU time	7 days 23 hours 59 min 3 sec
Validate state	Invalid
Credit	6,842.88
Device peak FLOPS	3.11 GFLOPS
Application version	UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86
Stderr	<core_client_version>6.10.58</core_client_version> <![CDATA[ <message> too many exit(0)s </message> <stderr_txt> Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5148, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4956, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7988, iMonCtr=1 Model crash detected, will try to restart... CController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5296, iMonCtr=1 Model crash detected, will try to restart... 08:42:12 (5188): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 08:44:56 (5188): No heartbeat from core client for 30 sec - exiting 08:45:04 (5188): No heartbeat from core client for 30 sec - exiting 08:45:05 (5188): No heartbeat from core client for 30 sec - exiting 08:45:06 (5188): No heartbeat from core client for 30 sec - exiting 08:45:07 (5188): No heartbeat from core client for 30 sec - exiting 08:45:08 (5188): No heartbeat from core client for 30 sec - exiting 08:45:09 (5188): No heartbeat from core client for 30 sec - exiting 08:45:10 (5188): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4616, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... </stderr_txt> ]]>

Latest Trickles Received
Time Sent (UTC)	Host ID	Result ID	Result Name	Timestep	CPU Time (sec)	Average (sec/TS)
04 Aug 2011 10:33:44	1122800	13111812	hadcm3n_ygah_1900_40_007353971_0	570,240	661,723	1.1604
03 Aug 2011 15:48:35	1122800	13111812	hadcm3n_ygah_1900_40_007353971_0	544,320	629,626	1.1567
03 Aug 2011 07:42:04	1122800	13111812	hadcm3n_ygah_1900_40_007353971_0	518,400	599,613	1.1567
02 Aug 2011 13:51:18	1122800	13111812	hadcm3n_ygah_1900_40_007353971_0	492,480	567,992	1.1533
01 Aug 2011 17:56:28	1122800	13111812	hadcm3n_ygah_1900_40_007353971_0	466,560	536,998	1.1510
01 Aug 2011 07:18:11	1122800	13111812	hadcm3n_ygah_1900_40_007353971_0	440,640	506,125	1.1486
28 Jul 2011 18:40:39	1122800	13111812	hadcm3n_ygah_1900_40_007353971_0	414,720	474,020	1.1430
28 Jul 2011 05:59:20	1122800	13111812	hadcm3n_ygah_1900_40_007353971_0	388,800	442,786	1.1389
27 Jul 2011 05:31:13	1122800	13111812	hadcm3n_ygah_1900_40_007353971_0	362,880	411,229	1.1332
25 Jul 2011 23:01:11	1122800	13111812	hadcm3n_ygah_1900_40_007353971_0	336,960	379,195	1.1253
25 Jul 2011 22:49:19	1122800	13111812	hadcm3n_ygah_1900_40_007353971_0	311,040	345,998	1.1124
25 Jul 2011 21:58:49	1122800	13111812	hadcm3n_ygah_1900_40_007353971_0	285,120	315,677	1.1072
25 Jul 2011 20:39:01	1122800	13111812	hadcm3n_ygah_1900_40_007353971_0	259,200	284,841	1.0989
25 Jul 2011 19:39:26	1122800	13111812	hadcm3n_ygah_1900_40_007353971_0	233,280	253,818	1.0880
25 Jul 2011 19:07:52	1122800	13111812	hadcm3n_ygah_1900_40_007353971_0	207,360	223,441	1.0776
25 Jul 2011 19:07:41	1122800	13111812	hadcm3n_ygah_1900_40_007353971_0	181,440	192,264	1.0597
25 Jul 2011 18:55:33	1122800	13111812	hadcm3n_ygah_1900_40_007353971_0	155,520	161,472	1.0383
25 Jul 2011 18:10:40	1122800	13111812	hadcm3n_ygah_1900_40_007353971_0	129,600	131,170	1.0121
25 Jul 2011 17:56:51	1122800	13111812	hadcm3n_ygah_1900_40_007353971_0	103,680	122,484	1.1814
25 Jul 2011 17:30:26	1122800	13111812	hadcm3n_ygah_1900_40_007353971_0	77,760	92,370	1.1879