Task 15549976

Name	hadcm3n_n3oc_1880_40_008286910_0
Workunit	8438045
Created	17 Jan 2013, 22:02:16 UTC
Sent	17 Jan 2013, 22:05:30 UTC
Report deadline	19 Apr 2013, 5:32:41 UTC
Received	1 Feb 2013, 5:22:14 UTC
Server state	Over
Outcome	Computation error
Client state	Compute error
Exit status	22 (0x00000016) Unknown error code
Computer ID	1087370
Run time	14 days 4 hours 18 min 6 sec
CPU time	13 days 6 hours 25 min 45 sec
Validate state	Invalid
Credit	4,976.64
Device peak FLOPS	1.94 GFLOPS
Application version	UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86
Stderr	<core_client_version>6.10.60</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 18:20:09 (4860): No heartbeat from core client for 30 sec - exiting 18:20:10 (4860): No heartbeat from core client for 30 sec - exiting 18:20:11 (4860): No heartbeat from core client for 30 sec - exiting 18:20:13 (4860): No heartbeat from core client for 30 sec - exiting 18:20:14 (4860): No heartbeat from core client for 30 sec - exiting 18:20:15 (4860): No heartbeat from core client for 30 sec - exiting 18:20:16 (4860): No heartbeat from core client for 30 sec - exiting 18:20:17 (4860): No heartbeat from core client for 30 sec - exiting 18:20:18 (4860): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4660, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4660, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4660, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4660, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4660, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4660, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]>

Latest Trickles Received
Time Sent (UTC)	Host ID	Result ID	Result Name	Timestep	CPU Time (sec)	Average (sec/TS)
31 Jan 2013 22:29:28	1087370	15549976	hadcm3n_n3oc_1880_40_008286910_0	414,720	1,134,852	2.7364
31 Jan 2013 01:44:27	1087370	15549976	hadcm3n_n3oc_1880_40_008286910_0	388,800	1,063,959	2.7365
30 Jan 2013 04:52:39	1087370	15549976	hadcm3n_n3oc_1880_40_008286910_0	362,880	992,982	2.7364
29 Jan 2013 07:24:19	1087370	15549976	hadcm3n_n3oc_1880_40_008286910_0	336,960	921,986	2.7362
28 Jan 2013 09:54:42	1087370	15549976	hadcm3n_n3oc_1880_40_008286910_0	311,040	850,625	2.7348
27 Jan 2013 13:21:53	1087370	15549976	hadcm3n_n3oc_1880_40_008286910_0	285,120	779,578	2.7342
26 Jan 2013 16:50:49	1087370	15549976	hadcm3n_n3oc_1880_40_008286910_0	259,200	708,604	2.7338
25 Jan 2013 19:50:51	1087370	15549976	hadcm3n_n3oc_1880_40_008286910_0	233,280	637,718	2.7337
24 Jan 2013 23:13:22	1087370	15549976	hadcm3n_n3oc_1880_40_008286910_0	207,360	566,626	2.7326
24 Jan 2013 02:19:58	1087370	15549976	hadcm3n_n3oc_1880_40_008286910_0	181,440	495,406	2.7304
23 Jan 2013 05:37:41	1087370	15549976	hadcm3n_n3oc_1880_40_008286910_0	155,520	424,658	2.7306
22 Jan 2013 08:15:24	1087370	15549976	hadcm3n_n3oc_1880_40_008286910_0	129,600	353,445	2.7272
21 Jan 2013 10:56:43	1087370	15549976	hadcm3n_n3oc_1880_40_008286910_0	103,680	282,699	2.7266
20 Jan 2013 15:05:46	1087370	15549976	hadcm3n_n3oc_1880_40_008286910_0	77,760	211,821	2.7240
19 Jan 2013 18:06:42	1087370	15549976	hadcm3n_n3oc_1880_40_008286910_0	51,840	140,815	2.7163
18 Jan 2013 20:56:54	1087370	15549976	hadcm3n_n3oc_1880_40_008286910_0	25,920	70,085	2.7039