Task 15902576

Name	hadcm3n_n0ao_1880_40_008403058_0
Workunit	8553914
Created	23 Jul 2013, 10:55:07 UTC
Sent	23 Jul 2013, 14:16:33 UTC
Report deadline	22 Oct 2013, 21:43:44 UTC
Received	14 Aug 2013, 15:50:26 UTC
Server state	Over
Outcome	Computation error
Client state	Compute error
Exit status	22 (0x00000016) Unknown error code
Computer ID	1221759
Run time	6 days 11 hours 15 min 10 sec
CPU time	6 days 8 hours 23 min 11 sec
Validate state	Invalid
Credit	6,220.80
Device peak FLOPS	3.80 GFLOPS
Application version	UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86
Stderr	<core_client_version>7.0.64</core_client_version> <![CDATA[ <message> El dispositivo no reconoce el comando. (0x16) - exit code 22 (0x16) </message> <stderr_txt> CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6004, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6004, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6004, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6004, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6004, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6004, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( Called boinc_finish 08:28:35 (4284): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 08:29:31 (5104): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 08:30:21 (3184): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 08:31:58 (2828): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 14:54:28 (2164): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 10:10:49 (4840): No heartbeat from core client for 30 sec - exiting 10:10:50 (4840): No heartbeat from core client for 30 sec - exiting 10:10:51 (4840): No heartbeat from core client for 30 sec - exiting 10:10:52 (4840): No heartbeat from core client for 30 sec - exiting 10:10:54 (4840): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 12:24:05 (6408): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... BUFFIN: C I/O Error feof - Unit 110 - Return code = 16 Model crashed: REPLANCA :I/O ERROR tmp/pipe_dummy 2048 BUFFIN: C I/O Error feof - Unit 110 - Return code = 16 Model crashed: REPLANCA :I/O ERROR tmp/pipe_dummy 2048 BUFFIN: C I/O Error feof - Unit 110 - Return code = 16 Model crashed: REPLANCA :I/O ERROR tmp/pipe_dummy 2048 BUFFIN: C I/O Error feof - Unit 110 - Return code = 16 Model crashed: REPLANCA :I/O ERROR tmp/pipe_dummy 2048 BUFFIN: C I/O Error feof - Unit 110 - Return code = 16 Model crashed: REPLANCA :I/O ERROR tmp/pipe_dummy 2048 BUFFIN: C I/O Error feof - Unit 110 - Return code = 16 Model crashed: REPLANCA :I/O ERROR tmp/pipe_dummy 2048 Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]>

Latest Trickles Received
Time Sent (UTC)	Host ID	Result ID	Result Name	Timestep	CPU Time (sec)	Average (sec/TS)
14 Aug 2013 15:58:56	1221759	15902576	hadcm3n_n0ao_1880_40_008403058_0	518,400	551,859	1.0645
14 Aug 2013 15:58:56	1221759	15902576	hadcm3n_n0ao_1880_40_008403058_0	492,480	530,050	1.0763
14 Aug 2013 15:58:56	1221759	15902576	hadcm3n_n0ao_1880_40_008403058_0	466,560	507,949	1.0887
14 Aug 2013 15:58:56	1221759	15902576	hadcm3n_n0ao_1880_40_008403058_0	440,640	486,607	1.1043
14 Aug 2013 15:58:56	1221759	15902576	hadcm3n_n0ao_1880_40_008403058_0	414,720	464,733	1.1206
14 Aug 2013 15:58:56	1221759	15902576	hadcm3n_n0ao_1880_40_008403058_0	388,800	443,410	1.1405
14 Aug 2013 15:58:56	1221759	15902576	hadcm3n_n0ao_1880_40_008403058_0	362,880	421,828	1.1624
14 Aug 2013 15:58:55	1221759	15902576	hadcm3n_n0ao_1880_40_008403058_0	336,960	400,889	1.1897
30 Jul 2013 10:23:22	1221759	15902576	hadcm3n_n0ao_1880_40_008403058_0	311,040	380,889	1.2246
30 Jul 2013 10:23:22	1221759	15902576	hadcm3n_n0ao_1880_40_008403058_0	285,120	360,855	1.2656
30 Jul 2013 10:23:22	1221759	15902576	hadcm3n_n0ao_1880_40_008403058_0	259,200	341,517	1.3176
29 Jul 2013 14:33:43	1221759	15902576	hadcm3n_n0ao_1880_40_008403058_0	233,280	325,385	1.3948
29 Jul 2013 14:33:43	1221759	15902576	hadcm3n_n0ao_1880_40_008403058_0	207,360	309,882	1.4944
29 Jul 2013 14:33:43	1221759	15902576	hadcm3n_n0ao_1880_40_008403058_0	181,440	294,392	1.6225
29 Jul 2013 14:33:43	1221759	15902576	hadcm3n_n0ao_1880_40_008403058_0	155,520	278,238	1.7891
25 Jul 2013 08:14:48	1221759	15902576	hadcm3n_n0ao_1880_40_008403058_0	129,600	139,851	1.0791
25 Jul 2013 00:15:54	1221759	15902576	hadcm3n_n0ao_1880_40_008403058_0	103,680	111,829	1.0786
24 Jul 2013 15:23:45	1221759	15902576	hadcm3n_n0ao_1880_40_008403058_0	77,760	83,999	1.0802
24 Jul 2013 07:29:46	1221759	15902576	hadcm3n_n0ao_1880_40_008403058_0	51,840	56,192	1.0840
23 Jul 2013 22:34:27	1221759	15902576	hadcm3n_n0ao_1880_40_008403058_0	25,920	28,239	1.0895