Name | famous_v3wf_1199_200_006690821_6 |
Workunit | 6894074 |
Created | 6 Dec 2010, 22:12:06 UTC |
Sent | 6 Dec 2010, 22:15:19 UTC |
Report deadline | 8 Mar 2011, 5:42:30 UTC |
Received | 14 Feb 2011, 12:44:14 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 1117493 |
Run time | 14 days 0 hours 6 min 53 sec |
CPU time | 12 days 19 hours 1 min 5 sec |
Validate state | Invalid |
Credit | 5,991.12 |
Device peak FLOPS | 2.32 GFLOPS |
Application version | UK Met Office FAMOUS v6.11 windows_intelx86 |
Stderr | <core_client_version>6.10.58</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 60 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 61 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1244, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 60 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 61 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3316, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3316, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3316, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3316, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3316, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3316, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( 21:45:33 (3316): called boinc_finish 17:14:34 (4968): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:12:58 (2544): Can't acquire lockfile (32) - waiting 35s 10:13:00 (4384): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 09:37:59 (6848): Can't acquire lockfile (32) - waiting 35s 09:38:07 (1696): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 60 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 61 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4684, iMonCtr=1 Model crash detected, will try to restart... Controller:Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... 09:35:37 (4984): Can't acquire lockfile (32) - waiting 35s 09:35:54 (5772): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 60 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 61 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 09:19:57 (5436): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7584, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 10:05:48 (7828): Can't acquire lockfile (32) - waiting 35s 10:06:07 (4372): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 12:46:04 (3580): Can't acquire lockfile (32) - waiting 35s 12:46:31 (1116): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=556, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Model crashed: ATM_DYN : INVALID THETA DETECTED. tmp/pipe_dummy Model crashed: ATM_DYN : INVALID THETA DETECTED. tmp/pipe_dummy Model crashed: ATM_DYN : INVALID THETA DETECTED. tmp/pipe_dummy Model crashed: ATM_DYN : INVALID THETA DETECTED. tmp/pipe_dummy Model crashed: ATM_DYN : INVALID THETA DETECTED. tmp/pipe_dummy Model crashed: ATM_DYN : INVALID THETA DETECTED. tmp/pipe_dummy Sorry, too many model crashes! :-( 12:43:12 (4112): called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
14 Feb 2011 11:50:06 | 1117493 | 12366134 | famous_v3wf_1199_200_006690821_6 | 1,815,866 | 1,102,094 | 0.6069 |
14 Feb 2011 10:40:14 | 1117493 | 12366134 | famous_v3wf_1199_200_006690821_6 | 1,806,506 | 1,097,932 | 0.6078 |
14 Feb 2011 09:24:47 | 1117493 | 12366134 | famous_v3wf_1199_200_006690821_6 | 1,797,146 | 1,093,785 | 0.6086 |
13 Feb 2011 22:19:41 | 1117493 | 12366134 | famous_v3wf_1199_200_006690821_6 | 1,787,786 | 1,088,789 | 0.6090 |
13 Feb 2011 21:02:42 | 1117493 | 12366134 | famous_v3wf_1199_200_006690821_6 | 1,778,426 | 1,084,619 | 0.6099 |
13 Feb 2011 19:35:39 | 1117493 | 12366134 | famous_v3wf_1199_200_006690821_6 | 1,769,066 | 1,080,299 | 0.6107 |
13 Feb 2011 18:31:24 | 1117493 | 12366134 | famous_v3wf_1199_200_006690821_6 | 1,759,706 | 1,076,196 | 0.6116 |
13 Feb 2011 00:15:21 | 1117493 | 12366134 | famous_v3wf_1199_200_006690821_6 | 1,750,346 | 1,071,826 | 0.6124 |
12 Feb 2011 22:32:55 | 1117493 | 12366134 | famous_v3wf_1199_200_006690821_6 | 1,740,986 | 1,066,781 | 0.6127 |
12 Feb 2011 21:09:10 | 1117493 | 12366134 | famous_v3wf_1199_200_006690821_6 | 1,731,626 | 1,062,602 | 0.6136 |
12 Feb 2011 12:23:39 | 1117493 | 12366134 | famous_v3wf_1199_200_006690821_6 | 1,722,266 | 1,057,514 | 0.6140 |
11 Feb 2011 15:19:41 | 1117493 | 12366134 | famous_v3wf_1199_200_006690821_6 | 1,712,906 | 1,053,398 | 0.6150 |
11 Feb 2011 14:01:53 | 1117493 | 12366134 | famous_v3wf_1199_200_006690821_6 | 1,703,546 | 1,049,367 | 0.6160 |
11 Feb 2011 13:19:31 | 1117493 | 12366134 | famous_v3wf_1199_200_006690821_6 | 1,694,186 | 1,045,338 | 0.6170 |
11 Feb 2011 13:19:31 | 1117493 | 12366134 | famous_v3wf_1199_200_006690821_6 | 1,684,826 | 1,041,277 | 0.6180 |
11 Feb 2011 13:19:31 | 1117493 | 12366134 | famous_v3wf_1199_200_006690821_6 | 1,675,466 | 1,037,257 | 0.6191 |
11 Feb 2011 13:19:31 | 1117493 | 12366134 | famous_v3wf_1199_200_006690821_6 | 1,666,106 | 1,033,272 | 0.6202 |
10 Feb 2011 16:16:35 | 1117493 | 12366134 | famous_v3wf_1199_200_006690821_6 | 1,656,746 | 1,029,050 | 0.6211 |
10 Feb 2011 14:57:06 | 1117493 | 12366134 | famous_v3wf_1199_200_006690821_6 | 1,647,386 | 1,025,067 | 0.6222 |
10 Feb 2011 13:35:12 | 1117493 | 12366134 | famous_v3wf_1199_200_006690821_6 | 1,638,026 | 1,021,086 | 0.6234 |
©2024 cpdn.org