Task 13302621

Name	hadcm3n_o3wn_1980_40_007425962_1
Workunit	7623465
Created	26 Aug 2011, 19:53:22 UTC
Sent	27 Aug 2011, 5:27:05 UTC
Report deadline	26 Nov 2011, 12:54:16 UTC
Received	28 Sep 2011, 14:08:57 UTC
Server state	Over
Outcome	Computation error
Client state	Compute error
Exit status	22 (0x00000016) Unknown error code
Computer ID	1164126
Run time	6 days 4 hours 51 min 54 sec
CPU time	6 days 3 hours 2 min 3 sec
Validate state	Invalid
Credit	6,531.84
Device peak FLOPS	4.47 GFLOPS
Application version	UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86
Stderr	<core_client_version>6.12.34</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 21:29:43 (5652): No heartbeat from core client for 30 sec - exiting 21:29:44 (5652): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 11:33:13 (4336): No heartbeat from core client for 30 sec - exiting 11:33:14 (4336): No heartbeat from core client for 30 sec - exiting 11:33:15 (4336): No heartbeat from core client for 30 sec - exiting 11:33:16 (4336): No heartbeat from core client for 30 sec - exiting 11:33:17 (4336): No heartbeat from core client for 30 sec - exiting 11:33:18 (4336): No heartbeat from core client for 30 sec - exiting 11:33:19 (4336): No heartbeat from core client for 30 sec - exiting 11:33:20 (4336): No heartbeat from core client for 30 sec - exiting 11:33:21 (4336): No heartbeat from core client for 30 sec - exiting 11:33:22 (4336): No heartbeat from core client for 30 sec - exiting 11:33:23 (4336): No heartbeat from core client for 30 sec - exiting 11:33:25 (4336): No heartbeat from core client for 30 sec - exiting 11:33:26 (4336): No heartbeat from core client for 30 sec - exiting 11:33:27 (4336): No heartbeat from core client for 30 sec - exiting 11:33:28 (4336): No heartbeat from core client for 30 sec - exiting 11:33:29 (4336): No heartbeat from core client for 30 sec - exiting 11:33:30 (4336): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 11:47:40 (4928): No heartbeat from core client for 30 sec - exiting 11:47:41 (4928): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2992, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2992, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2992, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2992, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2992, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2992, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]>

Latest Trickles Received
Time Sent (UTC)	Host ID	Result ID	Result Name	Timestep	CPU Time (sec)	Average (sec/TS)
28 Sep 2011 12:02:09	1164126	13302621	hadcm3n_o3wn_1980_40_007425962_1	544,320	527,522	0.9691
28 Sep 2011 05:26:56	1164126	13302621	hadcm3n_o3wn_1980_40_007425962_1	518,400	504,568	0.9733
27 Sep 2011 19:38:16	1164126	13302621	hadcm3n_o3wn_1980_40_007425962_1	492,480	480,594	0.9759
27 Sep 2011 09:43:43	1164126	13302621	hadcm3n_o3wn_1980_40_007425962_1	466,560	456,808	0.9791
27 Sep 2011 02:18:43	1164126	13302621	hadcm3n_o3wn_1980_40_007425962_1	440,640	433,455	0.9837
26 Sep 2011 18:35:45	1164126	13302621	hadcm3n_o3wn_1980_40_007425962_1	414,720	409,926	0.9884
26 Sep 2011 10:05:35	1164126	13302621	hadcm3n_o3wn_1980_40_007425962_1	388,800	386,225	0.9934
26 Sep 2011 02:15:34	1164126	13302621	hadcm3n_o3wn_1980_40_007425962_1	362,880	362,099	0.9978
25 Sep 2011 17:38:12	1164126	13302621	hadcm3n_o3wn_1980_40_007425962_1	336,960	334,121	0.9916
25 Sep 2011 10:16:33	1164126	13302621	hadcm3n_o3wn_1980_40_007425962_1	311,040	308,334	0.9913
24 Sep 2011 22:22:04	1164126	13302621	hadcm3n_o3wn_1980_40_007425962_1	285,120	283,423	0.9940
24 Sep 2011 04:24:04	1164126	13302621	hadcm3n_o3wn_1980_40_007425962_1	259,200	258,407	0.9969
23 Sep 2011 07:52:07	1164126	13302621	hadcm3n_o3wn_1980_40_007425962_1	233,280	232,675	0.9974
22 Sep 2011 22:27:16	1164126	13302621	hadcm3n_o3wn_1980_40_007425962_1	207,360	205,753	0.9923
22 Sep 2011 10:19:02	1164126	13302621	hadcm3n_o3wn_1980_40_007425962_1	181,440	180,768	0.9963
22 Sep 2011 00:02:10	1164126	13302621	hadcm3n_o3wn_1980_40_007425962_1	155,520	155,684	1.0011
21 Sep 2011 17:27:09	1164126	13302621	hadcm3n_o3wn_1980_40_007425962_1	129,600	131,106	1.0116
20 Sep 2011 17:50:54	1164126	13302621	hadcm3n_o3wn_1980_40_007425962_1	103,680	104,636	1.0092
20 Sep 2011 09:49:04	1164126	13302621	hadcm3n_o3wn_1980_40_007425962_1	77,760	78,165	1.0052
13 Sep 2011 19:50:49	1164126	13302621	hadcm3n_o3wn_1980_40_007425962_1	51,840	51,899	1.0011