Task 12428844

Name	famous_xr89_1499_200_007097120_0
Workunit	7300420
Created	18 Dec 2010, 17:25:25 UTC
Sent	21 Dec 2010, 14:38:43 UTC
Report deadline	22 Mar 2011, 22:05:54 UTC
Received	22 Dec 2010, 17:34:28 UTC
Server state	Over
Outcome	Computation error
Client state	Compute error
Exit status	22 (0x00000016) Unknown error code
Computer ID	1109904
Run time	12 hours 52 min 46 sec
CPU time	12 hours 16 min 4 sec
Validate state	Invalid
Credit	247.14
Device peak FLOPS	2.81 GFLOPS
Application version	UK Met Office FAMOUS v6.11 windows_intelx86
Stderr	<core_client_version>6.10.58</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> CPDN Monitor - Quit request from BOINC... 03:37:53 (748): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:37:55 (748): No heartbeat from core client for 30 sec - exiting 03:39:18 (4172): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:39:19 (4172): No heartbeat from core client for 30 sec - exiting 03:39:20 (4172): No heartbeat from core client for 30 sec - exiting 03:39:21 (4172): No heartbeat from core client for 30 sec - exiting 04:30:49 (6920): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 60 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 61 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 04:32:38 (5776): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... BUFFIN: Read Failed: Result too large BUFFIN: C I/O Error feof - Unit 60 - Return code = 16 BUFFIN: Read Failed: Result too large BUFFIN: C I/O Error feof - Unit 61 - Return code = 16 BUFFIN: Read Failed: Result too large BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 BUFFIN: Read Failed: Result too large BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Signal 11 received, exiting... 08:59:06 (5752): called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7948, iMonCtr=1 Model crash detected, will try to restart... Signal 11 received, exiting... 08:59:11 (7460): called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7948, iMonCtr=1 Model crash detected, will try to restart... Signal 11 received, exiting... 08:59:15 (5508): called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7948, iMonCtr=1 Model crash detected, will try to restart... Signal 11 received, exiting... 08:59:20 (7356): called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7948, iMonCtr=1 Model crash detected, will try to restart... Signal 11 received, exiting... 08:59:24 (632): called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7948, iMonCtr=1 Model crash detected, will try to restart... Signal 11 received, exiting... 08:59:28 (4596): called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7948, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( 08:59:33 (7948): called boinc_finish </stderr_txt> ]]>

Latest Trickles Received
Time Sent (UTC)	Host ID	Result ID	Result Name	Timestep	CPU Time (sec)	Average (sec/TS)
22 Dec 2010 12:50:16	1109904	12428844	famous_xr89_1499_200_007097120_0	74,906	40,617	0.5422
22 Dec 2010 11:21:21	1109904	12428844	famous_xr89_1499_200_007097120_0	65,546	35,584	0.5429
22 Dec 2010 11:21:21	1109904	12428844	famous_xr89_1499_200_007097120_0	56,186	30,478	0.5424
22 Dec 2010 11:21:21	1109904	12428844	famous_xr89_1499_200_007097120_0	46,826	25,555	0.5457
22 Dec 2010 11:21:21	1109904	12428844	famous_xr89_1499_200_007097120_0	37,466	20,385	0.5441
22 Dec 2010 11:21:21	1109904	12428844	famous_xr89_1499_200_007097120_0	28,106	15,266	0.5432
22 Dec 2010 11:21:21	1109904	12428844	famous_xr89_1499_200_007097120_0	18,746	10,183	0.5432
22 Dec 2010 11:21:21	1109904	12428844	famous_xr89_1499_200_007097120_0	9,386	5,069	0.5401