Name | famous_w5pt_599_200_006754097_2 |
Workunit | 6957413 |
Created | 18 Dec 2010, 15:16:23 UTC |
Sent | 29 Dec 2010, 22:54:06 UTC |
Report deadline | 31 Mar 2011, 6:21:17 UTC |
Received | 7 Jan 2011, 16:16:57 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 1081342 |
Run time | 4 days 20 hours 0 min 54 sec |
CPU time | 4 days 7 hours 14 min 44 sec |
Validate state | Invalid |
Credit | 3,891.17 |
Device peak FLOPS | 3.69 GFLOPS |
Application version | UK Met Office FAMOUS v6.11 windows_intelx86 |
Stderr | <core_client_version>6.10.58</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 17:03:06 (1308): Can't acquire lockfile (32) - waiting 35s Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 60 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 61 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 60 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 61 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 06:24:59 (3524): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4552, selfPID=4552, iMonCtr=1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 60 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 61 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 09:11:15 (9408): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 06:36:46 (5024): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 14:36:31 (6036): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 14:36:32 (6036): No heartbeat from core client for 30 sec - exiting 14:36:33 (6036): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... 16:16:07 (6728): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 16:16:09 (6728): No heartbeat from core client for 30 sec - exiting 16:18:10 (8852): No heartbeat from core client for 30 sec - exiting 16:18:11 (8852): No heartbeat from core client for 30 sec - exiting 16:18:12 (8852): No heartbeat from core client for 30 sec - exiting 16:18:13 (8852): No heartbeat from core client for 30 sec - exiting 16:18:14 (8852): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 16:18:15 (8852): No heartbeat from core client for 30 sec - exiting 16:56:28 (9028): No heartbeat from core client for 30 sec - exiting 16:56:30 (9028): No heartbeat from core client for 30 sec - exiting 16:56:32 (9028): No heartbeat from core client for 30 sec - exiting 16:56:33 (9028): No heartbeat from core client for 30 sec - exiting 16:56:34 (9028): No heartbeat from core client for 30 sec - exiting 16:56:35 (9028): No heartbeat from core client for 30 sec - exiting 16:56:36 (9028): No heartbeat from core client for 30 sec - exiting 16:56:37 (9028): No heartbeat from core client for 30 sec - exiting 16:56:38 (9028): No heartbeat from core client for 30 sec - exiting 16:56:39 (9028): No heartbeat from core client for 30 sec - exiting 16:56:41 (9028): No heartbeat from core client for 30 sec - exiting 16:56:42 (9028): No heartbeat from core client for 30 sec - exiting 16:56:43 (9028): No heartbeat from core client for 30 sec - exiting 16:56:44 (9028): No heartbeat from core client for 30 sec - exiting 16:56:45 (9028): No heartbeat from core client for 30 sec - exiting 16:56:46 (9028): No heartbeat from core client for 30 sec - exiting 16:56:47 (9028): No heartbeat from core client for 30 sec - exiting 16:56:48 (9028): No heartbeat from core client for 30 sec - exiting 16:56:49 (9028): No heartbeat from core client for 30 sec - exiting 16:56:51 (9028): No heartbeat from core client for 30 sec - exiting 16:56:52 (9028): No heartbeat from core client for 30 sec - exiting 16:56:53 (9028): No heartbeat from core client for 30 sec - exiting 16:56:54 (9028): No heartbeat from core client for 30 sec - exiting 16:56:55 (9028): No heartbeat from core client for 30 sec - exiting 16:56:56 (9028): No heartbeat from core client for 30 sec - exiting 16:56:57 (9028): No heartbeat from core client for 30 sec - exiting 16:56:58 (9028): No heartbeat from core client for 30 sec - exiting 16:56:59 (9028): No heartbeat from core client for 30 sec - exiting 16:57:00 (9028): No heartbeat from core client for 30 sec - exiting 16:57:03 (9028): No heartbeat from core client for 30 sec - exiting 16:57:04 (9028): No heartbeat from core client for 30 sec - exiting 16:57:05 (9028): No heartbeat from core client for 30 sec - exiting 16:57:06 (9028): No heartbeat from core client for 30 sec - exiting 16:57:07 (9028): No heartbeat from core client for 30 sec - exiting 16:57:08 (9028): No heartbeat from core client for 30 sec - exiting 16:57:09 (9028): No heartbeat from core client for 30 sec - exiting 16:57:10 (9028): No heartbeat from core client for 30 sec - exiting 16:57:11 (9028): No heartbeat from core client for 30 sec - exiting 16:57:12 (9028): No heartbeat from core client for 30 sec - exiting 16:57:15 (9028): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=0 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=0 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=0 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=0 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=0 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=0 Model crash detected, will try to restart... 17:06:54 (9052): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 17:06:55 (9052): No heartbeat from core client for 30 sec - exiting 17:06:56 (9052): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... 22:11:59 (5308): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... Model crashed: ATM_DYN : INVALID THETA DETECTED. tmp/pipe_dummy Model crashed: ATM_DYN : INVALID THETA DETECTED. tmp/pipe_dummy Model crashed: ATM_DYN : INVALID THETA DETECTED. tmp/pipe_dummy Model crashed: ATM_DYN : INVALID THETA DETECTED. tmp/pipe_dummy Model crashed: ATM_DYN : INVALID THETA DETECTED. tmp/pipe_dummy Model crashed: ATM_DYN : INVALID THETA DETECTED. tmp/pipe_dummy Sorry, too many model crashes! :-( 11:13:36 (5968): called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
07 Jan 2011 15:29:13 | 1081342 | 12414223 | famous_w5pt_599_200_006754097_2 | 1,179,386 | 369,095 | 0.3130 |
07 Jan 2011 14:40:15 | 1081342 | 12414223 | famous_w5pt_599_200_006754097_2 | 1,170,026 | 366,220 | 0.3130 |
07 Jan 2011 14:00:42 | 1081342 | 12414223 | famous_w5pt_599_200_006754097_2 | 1,160,666 | 363,317 | 0.3130 |
07 Jan 2011 12:58:30 | 1081342 | 12414223 | famous_w5pt_599_200_006754097_2 | 1,151,306 | 360,431 | 0.3131 |
07 Jan 2011 12:14:56 | 1081342 | 12414223 | famous_w5pt_599_200_006754097_2 | 1,141,946 | 357,591 | 0.3131 |
06 Jan 2011 20:26:38 | 1081342 | 12414223 | famous_w5pt_599_200_006754097_2 | 1,132,586 | 354,698 | 0.3132 |
06 Jan 2011 19:35:54 | 1081342 | 12414223 | famous_w5pt_599_200_006754097_2 | 1,123,226 | 351,676 | 0.3131 |
06 Jan 2011 18:45:02 | 1081342 | 12414223 | famous_w5pt_599_200_006754097_2 | 1,113,866 | 348,580 | 0.3129 |
06 Jan 2011 18:34:47 | 1081342 | 12414223 | famous_w5pt_599_200_006754097_2 | 1,104,506 | 345,483 | 0.3128 |
06 Jan 2011 16:51:31 | 1081342 | 12414223 | famous_w5pt_599_200_006754097_2 | 1,095,146 | 342,568 | 0.3128 |
06 Jan 2011 15:50:43 | 1081342 | 12414223 | famous_w5pt_599_200_006754097_2 | 1,085,786 | 339,692 | 0.3129 |
06 Jan 2011 14:32:04 | 1081342 | 12414223 | famous_w5pt_599_200_006754097_2 | 1,076,426 | 336,490 | 0.3126 |
06 Jan 2011 13:38:20 | 1081342 | 12414223 | famous_w5pt_599_200_006754097_2 | 1,067,066 | 333,662 | 0.3127 |
06 Jan 2011 12:42:26 | 1081342 | 12414223 | famous_w5pt_599_200_006754097_2 | 1,057,706 | 330,827 | 0.3128 |
06 Jan 2011 11:50:33 | 1081342 | 12414223 | famous_w5pt_599_200_006754097_2 | 1,048,346 | 328,030 | 0.3129 |
06 Jan 2011 11:22:38 | 1081342 | 12414223 | famous_w5pt_599_200_006754097_2 | 1,038,986 | 325,168 | 0.3130 |
06 Jan 2011 11:22:38 | 1081342 | 12414223 | famous_w5pt_599_200_006754097_2 | 1,029,626 | 322,302 | 0.3130 |
06 Jan 2011 11:22:38 | 1081342 | 12414223 | famous_w5pt_599_200_006754097_2 | 1,020,266 | 319,440 | 0.3131 |
06 Jan 2011 11:22:38 | 1081342 | 12414223 | famous_w5pt_599_200_006754097_2 | 1,010,906 | 316,552 | 0.3131 |
06 Jan 2011 11:22:38 | 1081342 | 12414223 | famous_w5pt_599_200_006754097_2 | 1,001,546 | 313,655 | 0.3132 |
©2024 climateprediction.net