Task 16060647

Name	hadcm3n_o9xo_1900_40_008467999_2
Workunit	8618838
Created	7 Oct 2013, 20:30:05 UTC
Sent	7 Oct 2013, 21:05:35 UTC
Report deadline	7 Jan 2014, 4:32:46 UTC
Received	12 Oct 2013, 22:51:37 UTC
Server state	Over
Outcome	Computation error
Client state	Compute error
Exit status	22 (0x00000016) Unknown error code
Computer ID	1275614
Run time	4 days 12 hours 51 min 55 sec
CPU time	4 days 9 hours 27 min 21 sec
Validate state	Invalid
Credit	4,354.56
Device peak FLOPS	3.76 GFLOPS
Application version	UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86
Stderr	<core_client_version>7.0.64</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6424, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6424, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6424, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6424, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6424, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6424, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]>

Latest Trickles Received
Time Sent (UTC)	Host ID	Result ID	Result Name	Timestep	CPU Time (sec)	Average (sec/TS)
12 Oct 2013 12:59:39	1275614	16060647	hadcm3n_o9xo_1900_40_008467999_2	362,880	367,989	1.0141
12 Oct 2013 05:33:41	1275614	16060647	hadcm3n_o9xo_1900_40_008467999_2	336,960	341,760	1.0142
11 Oct 2013 22:09:29	1275614	16060647	hadcm3n_o9xo_1900_40_008467999_2	311,040	315,550	1.0145
11 Oct 2013 14:35:43	1275614	16060647	hadcm3n_o9xo_1900_40_008467999_2	285,120	289,084	1.0139
11 Oct 2013 07:08:41	1275614	16060647	hadcm3n_o9xo_1900_40_008467999_2	259,200	262,691	1.0135
10 Oct 2013 23:34:59	1275614	16060647	hadcm3n_o9xo_1900_40_008467999_2	233,280	236,559	1.0141
10 Oct 2013 16:00:05	1275614	16060647	hadcm3n_o9xo_1900_40_008467999_2	207,360	210,362	1.0145
10 Oct 2013 08:30:54	1275614	16060647	hadcm3n_o9xo_1900_40_008467999_2	181,440	184,058	1.0144
10 Oct 2013 01:06:57	1275614	16060647	hadcm3n_o9xo_1900_40_008467999_2	155,520	157,868	1.0151
09 Oct 2013 12:45:26	1275614	16060647	hadcm3n_o9xo_1900_40_008467999_2	129,600	131,824	1.0172
09 Oct 2013 04:25:51	1275614	16060647	hadcm3n_o9xo_1900_40_008467999_2	103,680	105,382	1.0164
08 Oct 2013 21:02:03	1275614	16060647	hadcm3n_o9xo_1900_40_008467999_2	77,760	79,287	1.0196
08 Oct 2013 13:14:56	1275614	16060647	hadcm3n_o9xo_1900_40_008467999_2	51,840	52,843	1.0193
08 Oct 2013 05:32:11	1275614	16060647	hadcm3n_o9xo_1900_40_008467999_2	25,920	26,504	1.0225