Task 13651912

Name	hadcm3n_yf31_1900_40_007517558_3
Workunit	7715033
Created	21 Nov 2011, 20:30:49 UTC
Sent	21 Nov 2011, 20:46:57 UTC
Report deadline	21 Feb 2012, 4:14:08 UTC
Received	14 Dec 2011, 13:43:37 UTC
Server state	Over
Outcome	Computation error
Client state	Compute error
Exit status	193 (0x000000C1) EXIT_SIGNAL
Computer ID	984012
Run time	7 days 20 hours 12 min 26 sec
CPU time	7 days 11 hours 11 min 12 sec
Validate state	Invalid
Credit	3,110.40
Device peak FLOPS	2.05 GFLOPS
Application version	UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86
Stderr	<core_client_version>6.6.28</core_client_version> <![CDATA[ <message> - exit code 193 (0xc1) </message> <stderr_txt> Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4980, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4420, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=440, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2232, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1572, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5232, iMonCtr=1 Model crash detected, will try to restart... 18:45:27 (1536): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 18:45:30 (1536): No heartbeat from core client for 30 sec - exiting 18:45:31 (1536): No heartbeat from core client for 30 sec - exiting 11:07:18 (1048): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 11:07:19 (1048): No heartbeat from core client for 30 sec - exiting 11:07:20 (1048): No heartbeat from core client for 30 sec - exiting 11:07:21 (1048): No heartbeat from core client for 30 sec - exiting 11:07:22 (1048): No heartbeat from core client for 30 sec - exiting 11:07:23 (1048): No heartbeat from core client for 30 sec - exiting 11:07:24 (1048): No heartbeat from core client for 30 sec - exiting 11:07:25 (1048): No heartbeat from core client for 30 sec - exiting 11:07:26 (1048): No heartbeat from core client for 30 sec - exiting 11:07:28 (1048): No heartbeat from core client for 30 sec - exiting 11:07:29 (1048): No heartbeat from core client for 30 sec - exiting Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5404, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3300, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=288, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4068, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1932, iMonCtr=1 Model crash detected, will try to restart... Signal 11 received, exiting... Called boinc_finish </stderr_txt> ]]>

Latest Trickles Received
Time Sent (UTC)	Host ID	Result ID	Result Name	Timestep	CPU Time (sec)	Average (sec/TS)
13 Dec 2011 23:10:09	984012	13651912	hadcm3n_yf31_1900_40_007517558_3	259,200	645,049	2.4886
10 Dec 2011 20:18:09	984012	13651912	hadcm3n_yf31_1900_40_007517558_3	233,280	581,167	2.4913
08 Dec 2011 21:59:52	984012	13651912	hadcm3n_yf31_1900_40_007517558_3	207,360	516,145	2.4891
06 Dec 2011 23:12:26	984012	13651912	hadcm3n_yf31_1900_40_007517558_3	181,440	450,856	2.4849
03 Dec 2011 23:12:24	984012	13651912	hadcm3n_yf31_1900_40_007517558_3	155,520	386,137	2.4829
02 Dec 2011 16:03:53	984012	13651912	hadcm3n_yf31_1900_40_007517558_3	129,600	322,975	2.4921
30 Nov 2011 15:24:24	984012	13651912	hadcm3n_yf31_1900_40_007517558_3	103,680	258,760	2.4958
28 Nov 2011 17:32:57	984012	13651912	hadcm3n_yf31_1900_40_007517558_3	77,760	194,638	2.5031
25 Nov 2011 18:54:21	984012	13651912	hadcm3n_yf31_1900_40_007517558_3	51,840	129,883	2.5055
23 Nov 2011 19:42:17	984012	13651912	hadcm3n_yf31_1900_40_007517558_3	25,920	64,994	2.5075