Task 14985041

Name	hadcm3n_yi2y_1980_40_008086093_0
Workunit	8241207
Created	23 Jul 2012, 16:36:44 UTC
Sent	23 Jul 2012, 17:00:24 UTC
Report deadline	23 Oct 2012, 0:27:35 UTC
Received	25 Aug 2012, 20:30:13 UTC
Server state	Over
Outcome	Computation error
Client state	Compute error
Exit status	22 (0x00000016) Unknown error code
Computer ID	1130678
Run time	19 days 12 hours 58 min 33 sec
CPU time	15 days 18 hours 3 min 59 sec
Validate state	Invalid
Credit	8,709.12
Device peak FLOPS	2.12 GFLOPS
Application version	UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86
Stderr	<core_client_version>6.10.58</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> 04:01:05 (11116): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 04:01:06 (11116): No heartbeat from core client for 30 sec - exiting 04:01:07 (11116): No heartbeat from core client for 30 sec - exiting 04:01:08 (11116): No heartbeat from core client for 30 sec - exiting 04:01:09 (11116): No heartbeat from core client for 30 sec - exiting 04:01:10 (11116): No heartbeat from core client for 30 sec - exiting 04:01:11 (11116): No heartbeat from core client for 30 sec - exiting 04:01:12 (11116): No heartbeat from core client for 30 sec - exiting 04:01:13 (11116): No heartbeat from core client for 30 sec - exiting 04:01:14 (11116): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 18:53:59 (6760): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 18:54:00 (6760): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3992, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3992, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3992, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3992, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3992, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3992, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]>

Latest Trickles Received
Time Sent (UTC)	Host ID	Result ID	Result Name	Timestep	CPU Time (sec)	Average (sec/TS)
25 Aug 2012 06:59:22	1130678	14985041	hadcm3n_yi2y_1980_40_008086093_0	725,760	1,335,216	1.8397
24 Aug 2012 17:43:36	1130678	14985041	hadcm3n_yi2y_1980_40_008086093_0	699,840	1,288,249	1.8408
24 Aug 2012 04:10:15	1130678	14985041	hadcm3n_yi2y_1980_40_008086093_0	673,920	1,241,142	1.8417
23 Aug 2012 12:02:09	1130678	14985041	hadcm3n_yi2y_1980_40_008086093_0	648,000	1,194,062	1.8427
22 Aug 2012 15:58:10	1130678	14985041	hadcm3n_yi2y_1980_40_008086093_0	622,080	1,146,374	1.8428
22 Aug 2012 02:39:51	1130678	14985041	hadcm3n_yi2y_1980_40_008086093_0	596,160	1,098,555	1.8427
21 Aug 2012 07:19:36	1130678	14985041	hadcm3n_yi2y_1980_40_008086093_0	570,240	1,050,559	1.8423
20 Aug 2012 17:58:52	1130678	14985041	hadcm3n_yi2y_1980_40_008086093_0	544,320	1,003,696	1.8439
20 Aug 2012 03:48:27	1130678	14985041	hadcm3n_yi2y_1980_40_008086093_0	518,400	956,692	1.8455
19 Aug 2012 14:00:30	1130678	14985041	hadcm3n_yi2y_1980_40_008086093_0	492,480	909,668	1.8471
19 Aug 2012 00:12:28	1130678	14985041	hadcm3n_yi2y_1980_40_008086093_0	466,560	862,717	1.8491
18 Aug 2012 10:04:21	1130678	14985041	hadcm3n_yi2y_1980_40_008086093_0	440,640	815,985	1.8518
15 Aug 2012 21:43:09	1130678	14985041	hadcm3n_yi2y_1980_40_008086093_0	414,720	769,033	1.8543
14 Aug 2012 07:21:13	1130678	14985041	hadcm3n_yi2y_1980_40_008086093_0	388,800	722,029	1.8571
13 Aug 2012 08:39:19	1130678	14985041	hadcm3n_yi2y_1980_40_008086093_0	362,880	675,386	1.8612
12 Aug 2012 11:36:05	1130678	14985041	hadcm3n_yi2y_1980_40_008086093_0	336,960	629,060	1.8669
11 Aug 2012 03:00:53	1130678	14985041	hadcm3n_yi2y_1980_40_008086093_0	311,040	581,896	1.8708
08 Aug 2012 12:57:01	1130678	14985041	hadcm3n_yi2y_1980_40_008086093_0	285,120	534,753	1.8755
07 Aug 2012 17:29:17	1130678	14985041	hadcm3n_yi2y_1980_40_008086093_0	259,200	487,412	1.8804
06 Aug 2012 10:14:03	1130678	14985041	hadcm3n_yi2y_1980_40_008086093_0	233,280	439,749	1.8851