Task 12928340

Name	hadcm3n_o2ft_1940_40_007267387_1
Workunit	7465627
Created	3 Jun 2011, 3:03:06 UTC
Sent	3 Jun 2011, 3:03:20 UTC
Report deadline	2 Sep 2011, 10:30:31 UTC
Received	16 Jun 2011, 0:06:31 UTC
Server state	Over
Outcome	Computation error
Client state	Compute error
Exit status	22 (0x00000016) Unknown error code
Computer ID	1064106
Run time	3 days 18 hours 6 min 33 sec
CPU time	3 days 13 hours 6 min 6 sec
Validate state	Invalid
Credit	2,177.28
Device peak FLOPS	2.81 GFLOPS
Application version	UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86
Stderr	<core_client_version>6.10.18</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4528, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 01:30:44 (4184): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 09:15:45 (4236): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 09:15:46 (4236): No heartbeat from core client for 30 sec - exiting 09:15:47 (4236): No heartbeat from core client for 30 sec - exiting 09:15:48 (4236): No heartbeat from core client for 30 sec - exiting 09:15:49 (4236): No heartbeat from core client for 30 sec - exiting 09:15:50 (4236): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5240, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5240, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5240, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5240, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5240, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5240, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]>

Latest Trickles Received
Time Sent (UTC)	Host ID	Result ID	Result Name	Timestep	CPU Time (sec)	Average (sec/TS)
14 Jun 2011 08:16:20	1064106	12928340	hadcm3n_o2ft_1940_40_007267387_1	181,440	275,198	1.5167
13 Jun 2011 11:19:58	1064106	12928340	hadcm3n_o2ft_1940_40_007267387_1	155,520	235,633	1.5151
12 Jun 2011 23:46:01	1064106	12928340	hadcm3n_o2ft_1940_40_007267387_1	129,600	195,415	1.5078
11 Jun 2011 10:10:59	1064106	12928340	hadcm3n_o2ft_1940_40_007267387_1	103,680	155,592	1.5007
10 Jun 2011 22:39:27	1064106	12928340	hadcm3n_o2ft_1940_40_007267387_1	77,760	115,996	1.4917
10 Jun 2011 03:59:37	1064106	12928340	hadcm3n_o2ft_1940_40_007267387_1	51,840	77,417	1.4934
09 Jun 2011 17:35:02	1064106	12928340	hadcm3n_o2ft_1940_40_007267387_1	25,920	38,326	1.4786