Task 11777620

Name	famous_vdqk_1799_200_006703570_1
Workunit	6906823
Created	26 Aug 2010, 16:48:17 UTC
Sent	28 Nov 2010, 18:50:58 UTC
Report deadline	28 Feb 2011, 2:18:09 UTC
Received	31 Dec 2010, 19:51:08 UTC
Server state	Over
Outcome	Computation error
Client state	Compute error
Exit status	22 (0x00000016) Unknown error code
Computer ID	1119500
Run time	5 days 18 hours 48 min 45 sec
CPU time	5 days 4 hours 28 min 46 sec
Validate state	Invalid
Credit	463.31
Device peak FLOPS	0.36 GFLOPS
Application version	UK Met Office FAMOUS v6.11 i686-pc-linux-gnu
Stderr	<core_client_version>6.10.17</core_client_version> <![CDATA[ <message> process exited with code 22 (0x16, -234) </message> <stderr_txt> CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 60 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 61 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 68 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 69 - Return code = 1 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 60 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 61 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 68 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 69 - Return code = 1 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Signal 15 received, exiting... (1015): called boinc_finish CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 60 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 61 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 68 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 69 - Return code = 1 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... (1040): No heartbeat from core client for 30 sec - exiting (1040): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (1040): No heartbeat from core client for 30 sec - exiting (1040): No heartbeat from core client for 30 sec - exiting (1040): No heartbeat from core client for 30 sec - exiting (1040): No heartbeat from core client for 30 sec - exiting Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4827, iMonCtr=1 Model crash detected, will try to restart... (4827): No heartbeat from core client for 30 sec - exiting Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4827, iMonCtr=1 (4827): No heartbeat from core client for 30 sec - exiting (4827): No heartbeat from core client for 30 sec - exiting (4827): No heartbeat from core client for 30 sec - exiting Model crash detected, will try to restart... CPDN Monitor - No 'heartbeat' from BOINC... (4827): No heartbeat from core client for 30 sec - exiting (4827): No heartbeat from core client for 30 sec - exiting (4827): No heartbeat from core client for 30 sec - exiting (4827): No heartbeat from core client for 30 sec - exiting (4827): No heartbeat from core client for 30 sec - exiting (4827): No heartbeat from core client for 30 sec - exiting (4827): No heartbeat from core client for 30 sec - exiting (4827): No heartbeat from core client for 30 sec - exiting (4886): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (4886): No heartbeat from core client for 30 sec - exiting Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4924, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4924, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4924, iMonCtr=1 Model crash detected, will try to restart... (4924): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (4924): No heartbeat from core client for 30 sec - exiting (4924): No heartbeat from core client for 30 sec - exiting Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5035, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5035, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( (5035): called boinc_finish </stderr_txt> ]]>

Latest Trickles Received
Time Sent (UTC)	Host ID	Result ID	Result Name	Timestep	CPU Time (sec)	Average (sec/TS)
31 Dec 2010 05:03:25	1119500	11777620	famous_vdqk_1799_200_006703570_1	140,426	421,849	3.0041
22 Dec 2010 16:01:34	1119500	11777620	famous_vdqk_1799_200_006703570_1	131,066	393,809	3.0047
21 Dec 2010 19:07:35	1119500	11777620	famous_vdqk_1799_200_006703570_1	121,706	365,383	3.0022
17 Dec 2010 11:06:36	1119500	11777620	famous_vdqk_1799_200_006703570_1	112,346	337,085	3.0004
12 Dec 2010 11:21:22	1119500	11777620	famous_vdqk_1799_200_006703570_1	102,986	308,810	2.9986
11 Dec 2010 11:38:33	1119500	11777620	famous_vdqk_1799_200_006703570_1	93,626	280,556	2.9966
10 Dec 2010 16:11:40	1119500	11777620	famous_vdqk_1799_200_006703570_1	84,266	252,252	2.9935
09 Dec 2010 18:55:00	1119500	11777620	famous_vdqk_1799_200_006703570_1	74,906	224,123	2.9921
08 Dec 2010 20:20:32	1119500	11777620	famous_vdqk_1799_200_006703570_1	65,546	196,324	2.9952
01 Dec 2010 20:42:25	1119500	11777620	famous_vdqk_1799_200_006703570_1	56,186	168,424	2.9976
01 Dec 2010 11:14:22	1119500	11777620	famous_vdqk_1799_200_006703570_1	46,826	140,345	2.9972
01 Dec 2010 08:51:36	1119500	11777620	famous_vdqk_1799_200_006703570_1	37,466	112,289	2.9971
30 Nov 2010 19:41:04	1119500	11777620	famous_vdqk_1799_200_006703570_1	28,106	84,222	2.9966
30 Nov 2010 10:45:36	1119500	11777620	famous_vdqk_1799_200_006703570_1	18,746	56,171	2.9964
30 Nov 2010 07:14:52	1119500	11777620	famous_vdqk_1799_200_006703570_1	9,386	28,090	2.9928