Task 14762336

Name	hadcm3n_100d_1980_40_007999208_2
Workunit	8154322
Created	31 May 2012, 19:27:39 UTC
Sent	31 May 2012, 19:29:31 UTC
Report deadline	31 Aug 2012, 2:56:42 UTC
Received	16 Aug 2012, 17:25:34 UTC
Server state	Over
Outcome	Computation error
Client state	Aborted by user
Exit status	203 (0x000000CB) EXIT_ABORTED_VIA_GUI
Computer ID	1168062
Run time	7 days 0 hours 17 min 13 sec
CPU time	6 days 18 hours 5 min 53 sec
Validate state	Invalid
Credit	4,665.60
Device peak FLOPS	2.65 GFLOPS
Application version	UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86
Stderr	<core_client_version>7.0.28</core_client_version> <![CDATA[ <message> aborted by user </message> <stderr_txt> 12:34:33 (3752): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2904, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3224, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2804, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4024, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CCPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2996, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5380, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3768, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2972, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2860, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2800, iMonCtr=1 Model crash detected, will try to restart... 17:51:18 (2868): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2900, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2148, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=136, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5080, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2776, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3652, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2392, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2392, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3304, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2824, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2772, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3004, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4060, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3380, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3696, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3828, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Abort request from BOINC... Called boinc_finish </stderr_txt> ]]>

Latest Trickles Received
Time Sent (UTC)	Host ID	Result ID	Result Name	Timestep	CPU Time (sec)	Average (sec/TS)
12 Aug 2012 15:16:54	1168062	14762336	hadcm3n_100d_1980_40_007999208_2	388,800	555,663	1.4292
05 Aug 2012 17:47:34	1168062	14762336	hadcm3n_100d_1980_40_007999208_2	362,880	518,781	1.4296
04 Aug 2012 13:56:11	1168062	14762336	hadcm3n_100d_1980_40_007999208_2	336,960	481,993	1.4304
29 Jul 2012 17:07:07	1168062	14762336	hadcm3n_100d_1980_40_007999208_2	311,040	445,374	1.4319
23 Jul 2012 18:11:21	1168062	14762336	hadcm3n_100d_1980_40_007999208_2	285,120	408,045	1.4311
15 Jul 2012 14:43:23	1168062	14762336	hadcm3n_100d_1980_40_007999208_2	259,200	370,860	1.4308
14 Jul 2012 10:10:16	1168062	14762336	hadcm3n_100d_1980_40_007999208_2	233,280	332,986	1.4274
08 Jul 2012 08:44:56	1168062	14762336	hadcm3n_100d_1980_40_007999208_2	207,360	295,007	1.4227
06 Jul 2012 15:55:29	1168062	14762336	hadcm3n_100d_1980_40_007999208_2	181,440	257,572	1.4196
03 Jul 2012 11:33:32	1168062	14762336	hadcm3n_100d_1980_40_007999208_2	155,520	219,910	1.4140
02 Jul 2012 15:48:41	1168062	14762336	hadcm3n_100d_1980_40_007999208_2	129,600	182,325	1.4068
16 Jun 2012 15:16:36	1168062	14762336	hadcm3n_100d_1980_40_007999208_2	103,680	145,823	1.4065
12 Jun 2012 17:24:53	1168062	14762336	hadcm3n_100d_1980_40_007999208_2	77,760	110,205	1.4172
09 Jun 2012 11:20:03	1168062	14762336	hadcm3n_100d_1980_40_007999208_2	51,840	74,369	1.4346
06 Jun 2012 04:51:00	1168062	14762336	hadcm3n_100d_1980_40_007999208_2	25,920	37,813	1.4588