Task 13389106

Name	hadcm3n_o6kx_1940_40_007448498_2
Workunit	7646001
Created	15 Sep 2011, 7:47:51 UTC
Sent	15 Sep 2011, 7:56:23 UTC
Report deadline	15 Dec 2011, 15:23:34 UTC
Received	13 Oct 2011, 7:25:07 UTC
Server state	Over
Outcome	Computation error
Client state	Compute error
Exit status	22 (0x00000016) Unknown error code
Computer ID	970194
Run time	23 days 21 hours 3 min 3 sec
CPU time	19 days 7 hours 26 min 1 sec
Validate state	Invalid
Credit	7,776.00
Device peak FLOPS	2.14 GFLOPS
Application version	UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86
Stderr	<core_client_version>6.10.58</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 17:17:39 (1076): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 17:17:40 (1076): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - Suspend request from BOINC... 09:33:53 (3420): Can't acquire lockfile (32) - waiting 35s 09:34:20 (3824): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Signal 11 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1888, iMonCtr=1 Model crash detected, will try to restart... Signal 11 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1888, iMonCtr=1 Model crash detected, will try to restart... Signal 11 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1888, iMonCtr=1 Model crash detected, will try to restart... Signal 11 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1888, iMonCtr=1 Model crash detected, will try to restart... Signal 11 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1888, iMonCtr=1 Model crash detected, will try to restart... Signal 11 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1888, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]>

Latest Trickles Received
Time Sent (UTC)	Host ID	Result ID	Result Name	Timestep	CPU Time (sec)	Average (sec/TS)
13 Oct 2011 07:28:09	970194	13389106	hadcm3n_o6kx_1940_40_007448498_2	648,000	1,750,249	2.7010
10 Oct 2011 12:44:28	970194	13389106	hadcm3n_o6kx_1940_40_007448498_2	622,080	1,683,060	2.7055
10 Oct 2011 12:44:28	970194	13389106	hadcm3n_o6kx_1940_40_007448498_2	596,160	1,609,856	2.7004
10 Oct 2011 12:44:28	970194	13389106	hadcm3n_o6kx_1940_40_007448498_2	570,240	1,535,633	2.6930
10 Oct 2011 12:44:28	970194	13389106	hadcm3n_o6kx_1940_40_007448498_2	544,320	1,463,397	2.6885
10 Oct 2011 12:44:27	970194	13389106	hadcm3n_o6kx_1940_40_007448498_2	518,400	1,393,576	2.6882
03 Oct 2011 07:37:04	970194	13389106	hadcm3n_o6kx_1940_40_007448498_2	492,480	1,325,755	2.6920
03 Oct 2011 07:37:04	970194	13389106	hadcm3n_o6kx_1940_40_007448498_2	466,560	1,264,232	2.7097
03 Oct 2011 07:37:04	970194	13389106	hadcm3n_o6kx_1940_40_007448498_2	440,640	1,200,064	2.7235
03 Oct 2011 07:37:04	970194	13389106	hadcm3n_o6kx_1940_40_007448498_2	414,720	1,121,580	2.7044
03 Oct 2011 07:37:04	970194	13389106	hadcm3n_o6kx_1940_40_007448498_2	388,800	1,055,539	2.7149
03 Oct 2011 07:37:04	970194	13389106	hadcm3n_o6kx_1940_40_007448498_2	362,880	987,861	2.7223
03 Oct 2011 07:37:04	970194	13389106	hadcm3n_o6kx_1940_40_007448498_2	336,960	920,301	2.7312
26 Sep 2011 14:33:37	970194	13389106	hadcm3n_o6kx_1940_40_007448498_2	311,040	852,147	2.7397
26 Sep 2011 14:33:37	970194	13389106	hadcm3n_o6kx_1940_40_007448498_2	285,120	788,568	2.7657
26 Sep 2011 14:33:37	970194	13389106	hadcm3n_o6kx_1940_40_007448498_2	259,200	726,424	2.8026
26 Sep 2011 14:33:37	970194	13389106	hadcm3n_o6kx_1940_40_007448498_2	233,280	657,768	2.8197
26 Sep 2011 14:33:37	970194	13389106	hadcm3n_o6kx_1940_40_007448498_2	207,360	585,709	2.8246
26 Sep 2011 14:33:37	970194	13389106	hadcm3n_o6kx_1940_40_007448498_2	181,440	513,604	2.8307
21 Sep 2011 08:36:19	970194	13389106	hadcm3n_o6kx_1940_40_007448498_2	155,520	441,719	2.8403