Task 13124384

Name	hadcm3n_yl52_1900_40_007360256_1
Workunit	7557686
Created	6 Jul 2011, 15:11:34 UTC
Sent	7 Jul 2011, 19:25:42 UTC
Report deadline	7 Oct 2011, 2:52:53 UTC
Received	5 Sep 2011, 21:04:03 UTC
Server state	Over
Outcome	Computation error
Client state	Compute error
Exit status	193 (0x000000C1) EXIT_SIGNAL
Computer ID	1022218
Run time	11 days 17 hours 58 min 4 sec
CPU time	10 days 12 hours 50 min 42 sec
Validate state	Invalid
Credit	6,220.80
Device peak FLOPS	2.60 GFLOPS
Application version	UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86
Stderr	<core_client_version>6.12.33</core_client_version> <![CDATA[ <message> - exit code 193 (0xc1) </message> <stderr_txt> Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4856, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4944, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5060, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5128, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4668, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2432, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=15740, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2740, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5072, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 13:21:08 (4156): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 13:21:09 (4156): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - Suspend request from BOINC... Signal 11 received, exiting... Called boinc_finish </stderr_txt> ]]>

Latest Trickles Received
Time Sent (UTC)	Host ID	Result ID	Result Name	Timestep	CPU Time (sec)	Average (sec/TS)
05 Sep 2011 19:23:04	1022218	13124384	hadcm3n_yl52_1900_40_007360256_1	518,400	910,239	1.7559
04 Sep 2011 11:01:30	1022218	13124384	hadcm3n_yl52_1900_40_007360256_1	492,480	867,297	1.7611
02 Sep 2011 02:45:05	1022218	13124384	hadcm3n_yl52_1900_40_007360256_1	466,560	824,955	1.7682
30 Aug 2011 19:31:58	1022218	13124384	hadcm3n_yl52_1900_40_007360256_1	440,640	781,376	1.7733
29 Aug 2011 15:33:29	1022218	13124384	hadcm3n_yl52_1900_40_007360256_1	414,720	737,805	1.7790
26 Aug 2011 15:34:45	1022218	13124384	hadcm3n_yl52_1900_40_007360256_1	388,800	693,014	1.7824
23 Aug 2011 21:30:56	1022218	13124384	hadcm3n_yl52_1900_40_007360256_1	362,880	644,182	1.7752
21 Aug 2011 17:01:05	1022218	13124384	hadcm3n_yl52_1900_40_007360256_1	336,960	597,405	1.7729
17 Aug 2011 15:25:30	1022218	13124384	hadcm3n_yl52_1900_40_007360256_1	311,040	552,722	1.7770
14 Aug 2011 21:37:39	1022218	13124384	hadcm3n_yl52_1900_40_007360256_1	285,120	510,326	1.7899
01 Aug 2011 02:36:43	1022218	13124384	hadcm3n_yl52_1900_40_007360256_1	259,200	466,217	1.7987
28 Jul 2011 19:28:09	1022218	13124384	hadcm3n_yl52_1900_40_007360256_1	233,280	419,636	1.7989
25 Jul 2011 22:10:02	1022218	13124384	hadcm3n_yl52_1900_40_007360256_1	207,360	373,718	1.8023
25 Jul 2011 20:52:10	1022218	13124384	hadcm3n_yl52_1900_40_007360256_1	181,440	326,654	1.8003
25 Jul 2011 19:41:37	1022218	13124384	hadcm3n_yl52_1900_40_007360256_1	155,520	280,571	1.8041
25 Jul 2011 19:14:23	1022218	13124384	hadcm3n_yl52_1900_40_007360256_1	129,600	233,529	1.8019
25 Jul 2011 17:26:45	1022218	13124384	hadcm3n_yl52_1900_40_007360256_1	103,680	187,043	1.8040
25 Jul 2011 14:46:54	1022218	13124384	hadcm3n_yl52_1900_40_007360256_1	77,760	139,331	1.7918
25 Jul 2011 14:14:27	1022218	13124384	hadcm3n_yl52_1900_40_007360256_1	51,840	93,319	1.8001
10 Jul 2011 17:54:45	1022218	13124384	hadcm3n_yl52_1900_40_007360256_1	25,920	44,803	1.7285