Name | famous_uf3i_1999_200_006719114_1 |
Workunit | 6922367 |
Created | 2 Oct 2010, 21:39:37 UTC |
Sent | 3 Oct 2010, 17:04:49 UTC |
Report deadline | 3 Jan 2011, 0:32:00 UTC |
Received | 22 Oct 2010, 5:17:25 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 1105122 |
Run time | 9 days 1 hours 56 min 12 sec |
CPU time | 8 days 13 hours 31 min 45 sec |
Validate state | Invalid |
Credit | 5,991.12 |
Device peak FLOPS | 3.41 GFLOPS |
Application version | UK Met Office FAMOUS v6.11 windows_intelx86 |
Stderr | <core_client_version>6.10.58</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 07:32:39 (2960): Can't acquire lockfile (32) - waiting 35s 07:32:58 (3768): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 60 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 61 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 01:00:17 (4008): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 60 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 61 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 02:59:34 (4452): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 60 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 61 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Signal 11 received, exiting... 17:10:51 (4060): called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4428, iMonCtr=1 Model crash detected, will try to restart... Model crashed: ATM_DYN : INVALID THETA DETECTED. tmp/pipe_dummy Model crashed: ERR in FNZTOP - ITERATION HASN'T CONVERGED Model crashed: P_TH_ADJ : NEGATIVE PRESSURE VALUE CREATED. tmp/pipe_dummy Model crashed: ATM_DYN : INVALID THETA DETECTED. tmp/pipe_dummy Model crashed: SUBROUTINE POTTEM STOPPING - PRESSURE OUT OF RANGE Sorry, too many model crashes! :-( 17:11:10 (4428): called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
21 Oct 2010 22:03:43 | 1105122 | 11922025 | famous_uf3i_1999_200_006719114_1 | 1,815,866 | 739,494 | 0.4072 |
20 Oct 2010 21:16:54 | 1105122 | 11922025 | famous_uf3i_1999_200_006719114_1 | 1,806,506 | 735,815 | 0.4073 |
20 Oct 2010 20:10:39 | 1105122 | 11922025 | famous_uf3i_1999_200_006719114_1 | 1,797,146 | 732,154 | 0.4074 |
20 Oct 2010 19:03:49 | 1105122 | 11922025 | famous_uf3i_1999_200_006719114_1 | 1,787,786 | 728,503 | 0.4075 |
20 Oct 2010 18:02:00 | 1105122 | 11922025 | famous_uf3i_1999_200_006719114_1 | 1,778,426 | 724,855 | 0.4076 |
20 Oct 2010 16:56:27 | 1105122 | 11922025 | famous_uf3i_1999_200_006719114_1 | 1,769,066 | 721,223 | 0.4077 |
20 Oct 2010 15:50:23 | 1105122 | 11922025 | famous_uf3i_1999_200_006719114_1 | 1,759,706 | 717,566 | 0.4078 |
20 Oct 2010 14:49:15 | 1105122 | 11922025 | famous_uf3i_1999_200_006719114_1 | 1,750,346 | 713,946 | 0.4079 |
20 Oct 2010 13:38:05 | 1105122 | 11922025 | famous_uf3i_1999_200_006719114_1 | 1,740,986 | 710,353 | 0.4080 |
20 Oct 2010 12:36:08 | 1105122 | 11922025 | famous_uf3i_1999_200_006719114_1 | 1,731,626 | 706,756 | 0.4081 |
20 Oct 2010 11:34:10 | 1105122 | 11922025 | famous_uf3i_1999_200_006719114_1 | 1,722,266 | 703,253 | 0.4083 |
20 Oct 2010 10:33:01 | 1105122 | 11922025 | famous_uf3i_1999_200_006719114_1 | 1,712,906 | 699,767 | 0.4085 |
19 Oct 2010 23:59:57 | 1105122 | 11922025 | famous_uf3i_1999_200_006719114_1 | 1,703,546 | 696,198 | 0.4087 |
19 Oct 2010 22:58:55 | 1105122 | 11922025 | famous_uf3i_1999_200_006719114_1 | 1,694,186 | 692,704 | 0.4089 |
19 Oct 2010 21:57:52 | 1105122 | 11922025 | famous_uf3i_1999_200_006719114_1 | 1,684,826 | 689,221 | 0.4091 |
19 Oct 2010 20:56:33 | 1105122 | 11922025 | famous_uf3i_1999_200_006719114_1 | 1,675,466 | 685,801 | 0.4093 |
19 Oct 2010 19:58:31 | 1105122 | 11922025 | famous_uf3i_1999_200_006719114_1 | 1,666,106 | 682,421 | 0.4096 |
19 Oct 2010 18:59:38 | 1105122 | 11922025 | famous_uf3i_1999_200_006719114_1 | 1,656,746 | 679,060 | 0.4099 |
19 Oct 2010 17:57:57 | 1105122 | 11922025 | famous_uf3i_1999_200_006719114_1 | 1,647,386 | 675,703 | 0.4102 |
19 Oct 2010 17:00:21 | 1105122 | 11922025 | famous_uf3i_1999_200_006719114_1 | 1,638,026 | 672,351 | 0.4105 |
©2024 cpdn.org