Task 16201234

Name	hadcm3n_o91u_1900_40_008466853_3
Workunit	8617692
Created	3 Jan 2014, 18:25:41 UTC
Sent	3 Jan 2014, 18:26:01 UTC
Report deadline	5 Apr 2014, 1:53:12 UTC
Received	17 Feb 2014, 19:58:08 UTC
Server state	Over
Outcome	Computation error
Client state	Compute error
Exit status	22 (0x00000016) Unknown error code
Computer ID	1308532
Run time	11 days 3 hours 39 min 38 sec
CPU time	10 days 22 hours 17 min 8 sec
Validate state	Invalid
Credit	11,819.52
Device peak FLOPS	3.80 GFLOPS
Application version	UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86
Stderr	<core_client_version>7.2.33</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4612, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4360, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4704, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4580, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4788, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4788, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4788, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4788, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4788, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4788, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]>

Latest Trickles Received
Time Sent (UTC)	Host ID	Result ID	Result Name	Timestep	CPU Time (sec)	Average (sec/TS)
18 Feb 2014 12:02:36	1308532	16201234	hadcm3n_o91u_1900_40_008466853_3	984,960	943,633	0.9580
16 Feb 2014 17:16:56	1308532	16201234	hadcm3n_o91u_1900_40_008466853_3	959,040	919,048	0.9583
15 Feb 2014 23:47:18	1308532	16201234	hadcm3n_o91u_1900_40_008466853_3	933,120	894,441	0.9585
15 Feb 2014 16:54:55	1308532	16201234	hadcm3n_o91u_1900_40_008466853_3	907,200	869,823	0.9588
15 Feb 2014 10:01:58	1308532	16201234	hadcm3n_o91u_1900_40_008466853_3	881,280	845,176	0.9590
12 Feb 2014 18:44:57	1308532	16201234	hadcm3n_o91u_1900_40_008466853_3	855,360	820,381	0.9591
09 Feb 2014 22:45:16	1308532	16201234	hadcm3n_o91u_1900_40_008466853_3	829,440	795,701	0.9593
09 Feb 2014 10:20:56	1308532	16201234	hadcm3n_o91u_1900_40_008466853_3	803,520	770,986	0.9595
08 Feb 2014 16:30:38	1308532	16201234	hadcm3n_o91u_1900_40_008466853_3	777,600	745,369	0.9586
07 Feb 2014 23:03:42	1308532	16201234	hadcm3n_o91u_1900_40_008466853_3	751,680	720,041	0.9579
05 Feb 2014 20:37:58	1308532	16201234	hadcm3n_o91u_1900_40_008466853_3	725,760	695,073	0.9577
02 Feb 2014 22:44:18	1308532	16201234	hadcm3n_o91u_1900_40_008466853_3	699,840	670,254	0.9577
02 Feb 2014 12:59:13	1308532	16201234	hadcm3n_o91u_1900_40_008466853_3	673,920	645,040	0.9571
01 Feb 2014 19:05:54	1308532	16201234	hadcm3n_o91u_1900_40_008466853_3	648,000	619,901	0.9566
01 Feb 2014 11:52:30	1308532	16201234	hadcm3n_o91u_1900_40_008466853_3	622,080	594,786	0.9561
30 Jan 2014 18:23:26	1308532	16201234	hadcm3n_o91u_1900_40_008466853_3	596,160	570,074	0.9562
27 Jan 2014 19:08:22	1308532	16201234	hadcm3n_o91u_1900_40_008466853_3	570,240	545,328	0.9563
25 Jan 2014 14:23:25	1308532	16201234	hadcm3n_o91u_1900_40_008466853_3	544,320	520,514	0.9563
24 Jan 2014 18:39:09	1308532	16201234	hadcm3n_o91u_1900_40_008466853_3	518,400	495,811	0.9564
21 Jan 2014 21:31:09	1308532	16201234	hadcm3n_o91u_1900_40_008466853_3	492,480	470,932	0.9562