Task 15929203

Name	hadcm3n_o123_1980_40_008407489_0
Workunit	8558345
Created	20 Aug 2013, 12:06:03 UTC
Sent	20 Aug 2013, 18:55:00 UTC
Report deadline	20 Nov 2013, 2:22:11 UTC
Received	23 Sep 2013, 10:21:01 UTC
Server state	Over
Outcome	Computation error
Client state	Compute error
Exit status	22 (0x00000016) Unknown error code
Computer ID	1254500
Run time	32 days 1 hours 15 min 33 sec
CPU time	24 days 8 hours 51 min 49 sec
Validate state	Invalid
Credit	7,464.96
Device peak FLOPS	2.61 GFLOPS
Application version	UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86
Stderr	<core_client_version>7.0.64</core_client_version> <![CDATA[ <message> デバイスがコマンドを認識できません。 (0x16) - exit code 22 (0x16) </message> <stderr_txt> Suspended CPDN Monitor - Suspend request from BOINC... 05:30:50 (4532): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 09:03:07 (87692): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=24548, iMonCtr=1 Model crash detected, will try to restart... 10:28:16 (5816): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 04:29:28 (67200): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 19:05:18 (2348): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 19:23:20 (134768): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 19:23:21 (134768): No heartbeat from core client for 30 sec - exiting 19:23:22 (134768): No heartbeat from core client for 30 sec - exiting 19:23:23 (134768): No heartbeat from core client for 30 sec - exiting 19:23:24 (134768): No heartbeat from core client for 30 sec - exiting 19:23:25 (134768): No heartbeat from core client for 30 sec - exiting 19:23:26 (134768): No heartbeat from core client for 30 sec - exiting 19:23:27 (134768): No heartbeat from core client for 30 sec - exiting 19:23:28 (134768): No heartbeat from core client for 30 sec - exiting 19:23:29 (134768): No heartbeat from core client for 30 sec - exiting 19:23:30 (134768): No heartbeat from core client for 30 sec - exiting 19:23:31 (134768): No heartbeat from core client for 30 sec - exiting 01:42:04 (145368): No heartbeat from core client for 30 sec - exiting 01:42:05 (145368): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=460, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=460, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=460, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=460, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=460, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=460, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( Called boinc_finish 07:17:11 (4868): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... BUFFIN: C I/O Error feof - Unit 21 - Return code = 16 Model crashed: READDUMP: BAD BUFFIN OF DATA tmp/pipe_dummy 2048 BUFFIN: C I/O Error feof - Unit 21 - Return code = 16 Model crashed: READDUMP: BAD BUFFIN OF DATA tmp/pipe_dummy 2048 BUFFIN: C I/O Error feof - Unit 21 - Return code = 16 Model crashed: READDUMP: BAD BUFFIN OF DATA tmp/pipe_dummy 2048 BUFFIN: C I/O Error feof - Unit 21 - Return code = 16 Model crashed: READDUMP: BAD BUFFIN OF DATA tmp/pipe_dummy 2048 BUFFIN: C I/O Error feof - Unit 21 - Return code = 16 Model crashed: READDUMP: BAD BUFFIN OF DATA tmp/pipe_dummy 2048 BUFFIN: C I/O Error feof - Unit 21 - Return code = 16 Model crashed: READDUMP: BAD BUFFIN OF DATA tmp/pipe_dummy 2048 Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]>

Latest Trickles Received
Time Sent (UTC)	Host ID	Result ID	Result Name	Timestep	CPU Time (sec)	Average (sec/TS)
23 Sep 2013 10:24:46	1254500	15929203	hadcm3n_o123_1980_40_008407489_0	622,080	2,095,815	3.3690
20 Sep 2013 12:56:05	1254500	15929203	hadcm3n_o123_1980_40_008407489_0	596,160	1,964,381	3.2951
18 Sep 2013 12:24:52	1254500	15929203	hadcm3n_o123_1980_40_008407489_0	570,240	1,821,562	3.1944
17 Sep 2013 08:02:48	1254500	15929203	hadcm3n_o123_1980_40_008407489_0	544,320	1,739,605	3.1959
16 Sep 2013 12:58:11	1254500	15929203	hadcm3n_o123_1980_40_008407489_0	518,400	1,686,915	3.2541
15 Sep 2013 08:13:40	1254500	15929203	hadcm3n_o123_1980_40_008407489_0	492,480	1,619,925	3.2893
14 Sep 2013 15:49:46	1254500	15929203	hadcm3n_o123_1980_40_008407489_0	466,560	1,564,638	3.3536
13 Sep 2013 13:22:07	1254500	15929203	hadcm3n_o123_1980_40_008407489_0	440,640	1,481,597	3.3624
12 Sep 2013 06:43:45	1254500	15929203	hadcm3n_o123_1980_40_008407489_0	414,720	1,381,232	3.3305
11 Sep 2013 13:57:24	1254500	15929203	hadcm3n_o123_1980_40_008407489_0	388,800	1,324,979	3.4079
10 Sep 2013 21:17:45	1254500	15929203	hadcm3n_o123_1980_40_008407489_0	362,880	1,268,977	3.4970
10 Sep 2013 04:12:37	1254500	15929203	hadcm3n_o123_1980_40_008407489_0	336,960	1,212,386	3.5980
09 Sep 2013 09:28:24	1254500	15929203	hadcm3n_o123_1980_40_008407489_0	311,040	1,154,603	3.7121
08 Sep 2013 03:09:59	1254500	15929203	hadcm3n_o123_1980_40_008407489_0	285,120	1,101,333	3.8627
06 Sep 2013 07:26:43	1254500	15929203	hadcm3n_o123_1980_40_008407489_0	259,200	973,756	3.7568
04 Sep 2013 12:48:14	1254500	15929203	hadcm3n_o123_1980_40_008407489_0	233,280	892,394	3.8254
02 Sep 2013 10:44:55	1254500	15929203	hadcm3n_o123_1980_40_008407489_0	207,360	742,163	3.5791
31 Aug 2013 06:38:50	1254500	15929203	hadcm3n_o123_1980_40_008407489_0	181,440	633,717	3.4927
29 Aug 2013 10:09:19	1254500	15929203	hadcm3n_o123_1980_40_008407489_0	155,520	535,369	3.4424
28 Aug 2013 08:28:10	1254500	15929203	hadcm3n_o123_1980_40_008407489_0	129,600	480,116	3.7046