Name | famous_unbm_1399_200_006663069_2 |
Workunit | 6866441 |
Created | 10 Jun 2010, 15:24:53 UTC |
Sent | 5 Jul 2010, 15:55:02 UTC |
Report deadline | 4 Oct 2010, 23:22:13 UTC |
Received | 3 Aug 2010, 10:49:12 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 1000415 |
Run time | 14 days 17 hours 55 min 52 sec |
CPU time | 13 days 6 hours 59 min 56 sec |
Validate state | Invalid |
Credit | 5,219.08 |
Device peak FLOPS | 1.75 GFLOPS |
Application version | UK Met Office FAMOUS v6.11 windows_intelx86 |
Stderr | <core_client_version>6.6.36</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 60 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 61 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2880, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4900, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3180, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3180, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3180, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3916, iMonCtr=1 Model crash detected, will try to restart... BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 60 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 61 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7328, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4900, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5272, iMonCtr=1 Model crash detected, will try to restart... 11:16:54 (5700): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6040, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3340, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5564, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4392, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3828, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4964, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4516, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4808, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5084, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5084, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5176, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4992, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1688, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5884, iMonCtr=1 Model crash detected, will try to restart... BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 60 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 61 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5420, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4520, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5292, iMonCtr=1 Model crash detected, will try to restart... BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 60 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 61 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3376, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3880, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5288, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5288, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5288, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5288, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5288, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5288, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( 10:50:33 (5288): called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
03 Aug 2010 09:35:43 | 1000415 | 11566203 | famous_unbm_1399_200_006663069_2 | 1,581,866 | 1,147,297 | 0.7253 |
02 Aug 2010 13:58:55 | 1000415 | 11566203 | famous_unbm_1399_200_006663069_2 | 1,572,506 | 1,140,801 | 0.7255 |
02 Aug 2010 12:02:47 | 1000415 | 11566203 | famous_unbm_1399_200_006663069_2 | 1,563,146 | 1,134,182 | 0.7256 |
02 Aug 2010 10:04:42 | 1000415 | 11566203 | famous_unbm_1399_200_006663069_2 | 1,553,786 | 1,127,473 | 0.7256 |
01 Aug 2010 21:18:19 | 1000415 | 11566203 | famous_unbm_1399_200_006663069_2 | 1,544,426 | 1,120,851 | 0.7257 |
01 Aug 2010 19:27:14 | 1000415 | 11566203 | famous_unbm_1399_200_006663069_2 | 1,535,066 | 1,114,252 | 0.7259 |
01 Aug 2010 17:25:23 | 1000415 | 11566203 | famous_unbm_1399_200_006663069_2 | 1,525,706 | 1,107,640 | 0.7260 |
01 Aug 2010 15:31:59 | 1000415 | 11566203 | famous_unbm_1399_200_006663069_2 | 1,516,346 | 1,101,017 | 0.7261 |
01 Aug 2010 11:45:47 | 1000415 | 11566203 | famous_unbm_1399_200_006663069_2 | 1,506,986 | 1,094,370 | 0.7262 |
01 Aug 2010 09:46:35 | 1000415 | 11566203 | famous_unbm_1399_200_006663069_2 | 1,497,626 | 1,087,642 | 0.7262 |
31 Jul 2010 22:37:17 | 1000415 | 11566203 | famous_unbm_1399_200_006663069_2 | 1,488,266 | 1,081,093 | 0.7264 |
31 Jul 2010 20:41:37 | 1000415 | 11566203 | famous_unbm_1399_200_006663069_2 | 1,478,906 | 1,074,813 | 0.7268 |
31 Jul 2010 18:40:55 | 1000415 | 11566203 | famous_unbm_1399_200_006663069_2 | 1,469,546 | 1,068,513 | 0.7271 |
31 Jul 2010 16:38:26 | 1000415 | 11566203 | famous_unbm_1399_200_006663069_2 | 1,460,186 | 1,061,821 | 0.7272 |
31 Jul 2010 11:40:44 | 1000415 | 11566203 | famous_unbm_1399_200_006663069_2 | 1,450,826 | 1,055,204 | 0.7273 |
31 Jul 2010 09:37:57 | 1000415 | 11566203 | famous_unbm_1399_200_006663069_2 | 1,441,466 | 1,048,601 | 0.7275 |
31 Jul 2010 07:48:22 | 1000415 | 11566203 | famous_unbm_1399_200_006663069_2 | 1,432,106 | 1,042,016 | 0.7276 |
30 Jul 2010 20:40:12 | 1000415 | 11566203 | famous_unbm_1399_200_006663069_2 | 1,422,746 | 1,035,403 | 0.7277 |
30 Jul 2010 18:33:37 | 1000415 | 11566203 | famous_unbm_1399_200_006663069_2 | 1,413,386 | 1,028,709 | 0.7278 |
30 Jul 2010 17:22:20 | 1000415 | 11566203 | famous_unbm_1399_200_006663069_2 | 1,404,026 | 1,022,013 | 0.7279 |
©2024 cpdn.org