Task 13650669

Name	hadcm3n_yei9_1900_40_007517980_2
Workunit	7715455
Created	21 Nov 2011, 6:48:21 UTC
Sent	21 Nov 2011, 7:29:13 UTC
Report deadline	20 Feb 2012, 14:56:24 UTC
Received	28 Dec 2011, 21:11:09 UTC
Server state	Over
Outcome	Computation error
Client state	Compute error
Exit status	22 (0x00000016) Unknown error code
Computer ID	1132123
Run time	11 days 9 hours 13 min 46 sec
CPU time	11 days 5 hours 10 min 26 sec
Validate state	Invalid
Credit	8,398.08
Device peak FLOPS	3.15 GFLOPS
Application version	UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86
Stderr	<core_client_version>6.10.58</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 16:16:06 (1516): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 16:18:05 (3904): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 20:00:19 (1492): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 20:06:15 (2612): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 20:06:16 (2612): No heartbeat from core client for 30 sec - exiting 20:06:17 (2612): No heartbeat from core client for 30 sec - exiting 20:06:18 (2612): No heartbeat from core client for 30 sec - exiting 20:06:19 (2612): No heartbeat from core client for 30 sec - exiting 20:06:20 (2612): No heartbeat from core client for 30 sec - exiting 20:06:21 (2612): No heartbeat from core client for 30 sec - exiting Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3616, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3616, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3616, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3616, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3616, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3616, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]>

Latest Trickles Received
Time Sent (UTC)	Host ID	Result ID	Result Name	Timestep	CPU Time (sec)	Average (sec/TS)
13 Dec 2011 09:11:39	1132123	13650669	hadcm3n_yei9_1900_40_007517980_2	699,840	943,843	1.3487
12 Dec 2011 23:52:40	1132123	13650669	hadcm3n_yei9_1900_40_007517980_2	673,920	910,851	1.3516
12 Dec 2011 14:35:14	1132123	13650669	hadcm3n_yei9_1900_40_007517980_2	648,000	877,587	1.3543
12 Dec 2011 04:02:10	1132123	13650669	hadcm3n_yei9_1900_40_007517980_2	622,080	844,382	1.3574
11 Dec 2011 18:36:48	1132123	13650669	hadcm3n_yei9_1900_40_007517980_2	596,160	811,049	1.3605
11 Dec 2011 07:51:35	1132123	13650669	hadcm3n_yei9_1900_40_007517980_2	570,240	776,871	1.3624
10 Dec 2011 20:43:22	1132123	13650669	hadcm3n_yei9_1900_40_007517980_2	544,320	740,295	1.3600
10 Dec 2011 11:51:44	1132123	13650669	hadcm3n_yei9_1900_40_007517980_2	518,400	706,886	1.3636
10 Dec 2011 02:04:37	1132123	13650669	hadcm3n_yei9_1900_40_007517980_2	492,480	673,943	1.3685
09 Dec 2011 16:54:34	1132123	13650669	hadcm3n_yei9_1900_40_007517980_2	466,560	641,337	1.3746
09 Dec 2011 07:42:22	1132123	13650669	hadcm3n_yei9_1900_40_007517980_2	440,640	608,640	1.3813
08 Dec 2011 22:25:00	1132123	13650669	hadcm3n_yei9_1900_40_007517980_2	414,720	575,543	1.3878
08 Dec 2011 13:04:16	1132123	13650669	hadcm3n_yei9_1900_40_007517980_2	388,800	542,516	1.3954
08 Dec 2011 03:51:39	1132123	13650669	hadcm3n_yei9_1900_40_007517980_2	362,880	509,681	1.4045
07 Dec 2011 18:41:03	1132123	13650669	hadcm3n_yei9_1900_40_007517980_2	336,960	476,549	1.4143
06 Dec 2011 20:46:06	1132123	13650669	hadcm3n_yei9_1900_40_007517980_2	311,040	443,253	1.4251
06 Dec 2011 11:35:42	1132123	13650669	hadcm3n_yei9_1900_40_007517980_2	285,120	409,751	1.4371
03 Dec 2011 21:01:41	1132123	13650669	hadcm3n_yei9_1900_40_007517980_2	259,200	374,777	1.4459
03 Dec 2011 10:06:34	1132123	13650669	hadcm3n_yei9_1900_40_007517980_2	233,280	336,715	1.4434
02 Dec 2011 23:53:58	1132123	13650669	hadcm3n_yei9_1900_40_007517980_2	207,360	298,860	1.4413