Task 11596085

Name	famous_uo7b_1799_200_006664210_5
Workunit	6867582
Created	3 Jul 2010, 14:01:09 UTC
Sent	3 Jul 2010, 15:46:32 UTC
Report deadline	2 Oct 2010, 23:13:43 UTC
Received	28 Jan 2013, 14:56:56 UTC
Server state	Over
Outcome	Computation error
Client state	Compute error
Exit status	22 (0x00000016) Unknown error code
Computer ID	877610
Run time	2 days 0 hours 57 min 51 sec
CPU time	2 days 0 hours 57 min 51 sec
Validate state	Invalid
Credit	1,544.17
Device peak FLOPS	2.76 GFLOPS
Application version	UK Met Office FAMOUS v6.11 windows_intelx86
Stderr	<core_client_version>6.10.58</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> 18:01:14 (3388): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 60 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 61 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 16:11:20 (380): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... BUFFIN: Read Failed: Result too large BUFFIN: C I/O Error feof - Unit 60 - Return code = 16 BUFFIN: Read Failed: Result too large BUFFIN: C I/O Error feof - Unit 61 - Return code = 16 BUFFIN: Read Failed: Result too large BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 BUFFIN: Read Failed: Result too large BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 CPDN Monitor - Quit request from BOINC... 18:19:55 (4156): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 60 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 61 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 12:15:13 (5588): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=0 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=0 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=0 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=0 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=0 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=0 Model crash detected, will try to restart... 20:34:53 (5200): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=0 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=0 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=0 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=0 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=0 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=0 Model crash detected, will try to restart... 12:00:54 (5096): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Model crashed: ATM_DYN : INVALID THETA DETECTED. tmp/pipe_dummy Model crashed: ATM_DYN : INVALID THETA DETECTED. tmp/pipe_dummy Model crashed: ATM_DYN : INVALID THETA DETECTED. tmp/pipe_dummy Model crashed: ATM_DYN : INVALID THETA DETECTED. tmp/pipe_dummy Model crashed: ATM_DYN : INVALID THETA DETECTED. tmp/pipe_dummy Model crashed: ATM_DYN : INVALID THETA DETECTED. tmp/pipe_dummy Sorry, too many model crashes! :-( 10:23:44 (4000): called boinc_finish </stderr_txt> ]]>

Latest Trickles Received
Time Sent (UTC)	Host ID	Result ID	Result Name	Timestep	CPU Time (sec)	Average (sec/TS)
22 Jul 2010 19:15:58	877610	11596085	famous_uo7b_1799_200_006664210_5	468,026	173,412	0.3705
22 Jul 2010 18:01:31	877610	11596085	famous_uo7b_1799_200_006664210_5	458,666	169,975	0.3706
22 Jul 2010 16:56:55	877610	11596085	famous_uo7b_1799_200_006664210_5	449,306	166,533	0.3706
22 Jul 2010 16:00:44	877610	11596085	famous_uo7b_1799_200_006664210_5	439,946	163,055	0.3706
20 Jul 2010 22:29:19	877610	11596085	famous_uo7b_1799_200_006664210_5	430,586	159,626	0.3707
20 Jul 2010 22:01:54	877610	11596085	famous_uo7b_1799_200_006664210_5	421,226	158,323	0.3759
20 Jul 2010 20:04:31	877610	11596085	famous_uo7b_1799_200_006664210_5	411,866	152,866	0.3712
20 Jul 2010 18:39:35	877610	11596085	famous_uo7b_1799_200_006664210_5	402,506	149,500	0.3714
20 Jul 2010 17:36:01	877610	11596085	famous_uo7b_1799_200_006664210_5	393,146	146,060	0.3715
20 Jul 2010 16:35:27	877610	11596085	famous_uo7b_1799_200_006664210_5	383,786	142,645	0.3717
18 Jul 2010 20:04:12	877610	11596085	famous_uo7b_1799_200_006664210_5	374,426	139,266	0.3719
18 Jul 2010 19:02:06	877610	11596085	famous_uo7b_1799_200_006664210_5	365,066	135,856	0.3721
18 Jul 2010 18:00:16	877610	11596085	famous_uo7b_1799_200_006664210_5	355,706	132,450	0.3724
18 Jul 2010 16:57:43	877610	11596085	famous_uo7b_1799_200_006664210_5	346,346	129,086	0.3727
18 Jul 2010 15:55:37	877610	11596085	famous_uo7b_1799_200_006664210_5	336,986	125,714	0.3731
18 Jul 2010 14:52:35	877610	11596085	famous_uo7b_1799_200_006664210_5	327,626	122,268	0.3732
18 Jul 2010 13:34:59	877610	11596085	famous_uo7b_1799_200_006664210_5	318,266	118,814	0.3733
18 Jul 2010 11:45:51	877610	11596085	famous_uo7b_1799_200_006664210_5	308,906	115,421	0.3736
18 Jul 2010 10:42:30	877610	11596085	famous_uo7b_1799_200_006664210_5	299,546	112,023	0.3740
18 Jul 2010 09:42:31	877610	11596085	famous_uo7b_1799_200_006664210_5	290,186	108,639	0.3744