Task 13949604

Name	hadcm3n_y91f_1900_40_007523858_4
Workunit	7721333
Created	20 Jan 2012, 23:53:46 UTC
Sent	20 Jan 2012, 23:53:50 UTC
Report deadline	21 Apr 2012, 7:21:01 UTC
Received	15 Feb 2012, 23:12:10 UTC
Server state	Over
Outcome	Computation error
Client state	Compute error
Exit status	22 (0x00000016) Unknown error code
Computer ID	1291906
Run time	17 days 11 hours 50 min 5 sec
CPU time	16 days 18 hours 30 min 56 sec
Validate state	Invalid
Credit	9,642.24
Device peak FLOPS	2.66 GFLOPS
Application version	UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86
Stderr	<core_client_version>6.10.58</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 16:05:58 (4852): No heartbeat from core client for 30 sec - exiting 16:05:59 (4852): No heartbeat from core client for 30 sec - exiting 16:06:00 (4852): No heartbeat from core client for 30 sec - exiting 16:06:01 (4852): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5460, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5460, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5460, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5460, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5460, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5460, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]>

Latest Trickles Received
Time Sent (UTC)	Host ID	Result ID	Result Name	Timestep	CPU Time (sec)	Average (sec/TS)
15 Feb 2012 09:01:52	1131234	13949604	hadcm3n_y91f_1900_40_007523858_4	803,520	1,419,298	1.7664
15 Feb 2012 09:01:52	1131234	13949604	hadcm3n_y91f_1900_40_007523858_4	777,600	1,382,211	1.7775
15 Feb 2012 09:01:52	1131234	13949604	hadcm3n_y91f_1900_40_007523858_4	751,680	1,336,946	1.7786
11 Feb 2012 21:35:25	1131234	13949604	hadcm3n_y91f_1900_40_007523858_4	725,760	1,288,857	1.7759
10 Feb 2012 22:12:20	1131234	13949604	hadcm3n_y91f_1900_40_007523858_4	699,840	1,240,988	1.7732
09 Feb 2012 21:10:34	1131234	13949604	hadcm3n_y91f_1900_40_007523858_4	673,920	1,192,993	1.7702
09 Feb 2012 08:35:30	1131234	13949604	hadcm3n_y91f_1900_40_007523858_4	648,000	1,145,024	1.7670
08 Feb 2012 22:03:58	1131234	13949604	hadcm3n_y91f_1900_40_007523858_4	622,080	1,097,089	1.7636
08 Feb 2012 22:03:58	1131234	13949604	hadcm3n_y91f_1900_40_007523858_4	596,160	1,049,253	1.7600
07 Feb 2012 22:05:03	1131234	13949604	hadcm3n_y91f_1900_40_007523858_4	570,240	1,005,315	1.7630
06 Feb 2012 22:04:51	1131234	13949604	hadcm3n_y91f_1900_40_007523858_4	544,320	957,483	1.7590
06 Feb 2012 22:04:51	1131234	13949604	hadcm3n_y91f_1900_40_007523858_4	518,400	909,688	1.7548
05 Feb 2012 22:03:02	1131234	13949604	hadcm3n_y91f_1900_40_007523858_4	492,480	861,600	1.7495
03 Feb 2012 22:05:02	1131234	13949604	hadcm3n_y91f_1900_40_007523858_4	466,560	813,700	1.7440
03 Feb 2012 08:47:36	1131234	13949604	hadcm3n_y91f_1900_40_007523858_4	440,640	779,126	1.7682
02 Feb 2012 22:01:22	1131234	13949604	hadcm3n_y91f_1900_40_007523858_4	414,720	731,439	1.7637
01 Feb 2012 10:01:16	1131234	13949604	hadcm3n_y91f_1900_40_007523858_4	388,800	682,430	1.7552
31 Jan 2012 17:58:21	1131234	13949604	hadcm3n_y91f_1900_40_007523858_4	362,880	660,040	1.8189
31 Jan 2012 17:58:20	1131234	13949604	hadcm3n_y91f_1900_40_007523858_4	336,960	613,357	1.8203
30 Jan 2012 22:04:41	1131234	13949604	hadcm3n_y91f_1900_40_007523858_4	311,040	569,594	1.8313