Task 13389858

Name	hadcm3n_t59t_1940_40_007448189_3
Workunit	7645692
Created	15 Sep 2011, 16:27:44 UTC
Sent	15 Sep 2011, 16:34:26 UTC
Report deadline	16 Dec 2011, 0:01:37 UTC
Received	9 Oct 2011, 13:27:15 UTC
Server state	Over
Outcome	Computation error
Client state	Compute error
Exit status	193 (0x000000C1) EXIT_SIGNAL
Computer ID	1041747
Run time	6 days 22 hours 50 min 8 sec
CPU time	6 days 0 hours 24 min 46 sec
Validate state	Invalid
Credit	3,110.40
Device peak FLOPS	2.30 GFLOPS
Application version	UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86
Stderr	<core_client_version>6.10.18</core_client_version> <![CDATA[ <message> - exit code 193 (0xc1) </message> <stderr_txt> Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6072, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5792, iMonCtr=1 Model crash detected, will try to restart... 10:51:23 (4740): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5368, iMonCtr=1 Model crash detected, will try to restart... 17:39:22 (5324): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2312, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4996, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6524, iMonCtr=1 Model crash detected, will try to restart... 15:48:46 (6788): No heartbeat from core client for 30 sec - exiting 15:48:47 (6788): No heartbeat from core client for 30 sec - exiting 15:48:48 (6788): No heartbeat from core client for 30 sec - exiting 15:48:49 (6788): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 15:51:23 (5304): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7716, iMonCtr=1 Model crash detected, will try to restart... 06:59:42 (4756): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5628, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5492, iMonCtr=1 Model crash detected, will try to restart... 11:53:35 (5956): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5988, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... 15:47:49 (2952): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 15:49:40 (7512): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 12:33:03 (5452): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Signal 11 received, exiting... Called boinc_finish </stderr_txt> ]]>

Latest Trickles Received
Time Sent (UTC)	Host ID	Result ID	Result Name	Timestep	CPU Time (sec)	Average (sec/TS)
08 Oct 2011 18:48:23	1041747	13389858	hadcm3n_t59t_1940_40_007448189_3	259,200	519,867	2.0057
05 Oct 2011 17:47:05	1041747	13389858	hadcm3n_t59t_1940_40_007448189_3	233,280	467,002	2.0019
02 Oct 2011 19:08:56	1041747	13389858	hadcm3n_t59t_1940_40_007448189_3	207,360	415,270	2.0027
01 Oct 2011 14:59:07	1041747	13389858	hadcm3n_t59t_1940_40_007448189_3	181,440	362,946	2.0004
29 Sep 2011 18:06:50	1041747	13389858	hadcm3n_t59t_1940_40_007448189_3	155,520	311,443	2.0026
27 Sep 2011 16:39:38	1041747	13389858	hadcm3n_t59t_1940_40_007448189_3	129,600	258,572	1.9952
25 Sep 2011 14:59:27	1041747	13389858	hadcm3n_t59t_1940_40_007448189_3	103,680	206,638	1.9930
23 Sep 2011 15:30:53	1041747	13389858	hadcm3n_t59t_1940_40_007448189_3	77,760	155,509	1.9999
21 Sep 2011 15:01:34	1041747	13389858	hadcm3n_t59t_1940_40_007448189_3	51,840	104,112	2.0083
18 Sep 2011 14:37:32	1041747	13389858	hadcm3n_t59t_1940_40_007448189_3	25,920	52,741	2.0348