Task 12995447

Name	hadcm3n_o4cc_1940_40_007301705_1
Workunit	7499129
Created	22 Jun 2011, 14:37:20 UTC
Sent	22 Jun 2011, 14:37:30 UTC
Report deadline	21 Sep 2011, 22:04:41 UTC
Received	10 Jul 2011, 2:01:15 UTC
Server state	Over
Outcome	Computation error
Client state	Compute error
Exit status	22 (0x00000016) Unknown error code
Computer ID	960478
Run time	12 days 3 hours 36 min 58 sec
CPU time	12 days 3 hours 2 min 50 sec
Validate state	Invalid
Credit	8,087.04
Device peak FLOPS	2.55 GFLOPS
Application version	UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86
Stderr	<core_client_version>6.10.58</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 15:09:29 (2080): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2144, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2144, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2144, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2144, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2144, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2144, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]>

Latest Trickles Received
Time Sent (UTC)	Host ID	Result ID	Result Name	Timestep	CPU Time (sec)	Average (sec/TS)
09 Jul 2011 11:26:15	960478	12995447	hadcm3n_o4cc_1940_40_007301705_1	673,920	1,044,266	1.5495
08 Jul 2011 22:40:33	960478	12995447	hadcm3n_o4cc_1940_40_007301705_1	648,000	1,001,249	1.5451
08 Jul 2011 11:03:12	960478	12995447	hadcm3n_o4cc_1940_40_007301705_1	622,080	958,286	1.5405
07 Jul 2011 21:59:54	960478	12995447	hadcm3n_o4cc_1940_40_007301705_1	596,160	915,329	1.5354
07 Jul 2011 15:40:21	960478	12995447	hadcm3n_o4cc_1940_40_007301705_1	570,240	872,349	1.5298
07 Jul 2011 15:39:21	960478	12995447	hadcm3n_o4cc_1940_40_007301705_1	544,320	838,814	1.5410
07 Jul 2011 15:39:21	960478	12995447	hadcm3n_o4cc_1940_40_007301705_1	518,400	804,705	1.5523
07 Jul 2011 15:39:21	960478	12995447	hadcm3n_o4cc_1940_40_007301705_1	492,480	770,263	1.5640
05 Jul 2011 23:57:45	960478	12995447	hadcm3n_o4cc_1940_40_007301705_1	466,560	734,602	1.5745
05 Jul 2011 23:57:45	960478	12995447	hadcm3n_o4cc_1940_40_007301705_1	440,640	697,609	1.5832
04 Jul 2011 23:49:04	960478	12995447	hadcm3n_o4cc_1940_40_007301705_1	414,720	658,286	1.5873
04 Jul 2011 09:38:48	960478	12995447	hadcm3n_o4cc_1940_40_007301705_1	388,800	619,206	1.5926
03 Jul 2011 23:01:40	960478	12995447	hadcm3n_o4cc_1940_40_007301705_1	362,880	580,203	1.5989
03 Jul 2011 11:46:07	960478	12995447	hadcm3n_o4cc_1940_40_007301705_1	336,960	541,793	1.6079
03 Jul 2011 01:22:01	960478	12995447	hadcm3n_o4cc_1940_40_007301705_1	311,040	503,499	1.6188
02 Jul 2011 23:19:49	960478	12995447	hadcm3n_o4cc_1940_40_007301705_1	285,120	464,882	1.6305
02 Jul 2011 02:01:35	960478	12995447	hadcm3n_o4cc_1940_40_007301705_1	259,200	425,946	1.6433
02 Jul 2011 00:00:06	960478	12995447	hadcm3n_o4cc_1940_40_007301705_1	233,280	386,557	1.6571
01 Jul 2011 02:21:00	960478	12995447	hadcm3n_o4cc_1940_40_007301705_1	207,360	344,739	1.6625
30 Jun 2011 13:33:08	960478	12995447	hadcm3n_o4cc_1940_40_007301705_1	181,440	301,079	1.6594