Name | famous_vdqk_1799_200_006703570_1 |
Workunit | 6906823 |
Created | 26 Aug 2010, 16:48:17 UTC |
Sent | 28 Nov 2010, 18:50:58 UTC |
Report deadline | 28 Feb 2011, 2:18:09 UTC |
Received | 31 Dec 2010, 19:51:08 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 1119500 |
Run time | 5 days 18 hours 48 min 45 sec |
CPU time | 5 days 4 hours 28 min 46 sec |
Validate state | Invalid |
Credit | 463.31 |
Device peak FLOPS | 0.36 GFLOPS |
Application version | UK Met Office FAMOUS v6.11 i686-pc-linux-gnu |
Stderr | <core_client_version>6.10.17</core_client_version> <![CDATA[ <message> process exited with code 22 (0x16, -234) </message> <stderr_txt> CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 60 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 61 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 68 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 69 - Return code = 1 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 60 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 61 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 68 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 69 - Return code = 1 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Signal 15 received, exiting... (1015): called boinc_finish CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 60 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 61 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 68 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 69 - Return code = 1 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... (1040): No heartbeat from core client for 30 sec - exiting (1040): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (1040): No heartbeat from core client for 30 sec - exiting (1040): No heartbeat from core client for 30 sec - exiting (1040): No heartbeat from core client for 30 sec - exiting (1040): No heartbeat from core client for 30 sec - exiting Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4827, iMonCtr=1 Model crash detected, will try to restart... (4827): No heartbeat from core client for 30 sec - exiting Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4827, iMonCtr=1 (4827): No heartbeat from core client for 30 sec - exiting (4827): No heartbeat from core client for 30 sec - exiting (4827): No heartbeat from core client for 30 sec - exiting Model crash detected, will try to restart... CPDN Monitor - No 'heartbeat' from BOINC... (4827): No heartbeat from core client for 30 sec - exiting (4827): No heartbeat from core client for 30 sec - exiting (4827): No heartbeat from core client for 30 sec - exiting (4827): No heartbeat from core client for 30 sec - exiting (4827): No heartbeat from core client for 30 sec - exiting (4827): No heartbeat from core client for 30 sec - exiting (4827): No heartbeat from core client for 30 sec - exiting (4827): No heartbeat from core client for 30 sec - exiting (4886): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (4886): No heartbeat from core client for 30 sec - exiting Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4924, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4924, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4924, iMonCtr=1 Model crash detected, will try to restart... (4924): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (4924): No heartbeat from core client for 30 sec - exiting (4924): No heartbeat from core client for 30 sec - exiting Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5035, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5035, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( (5035): called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
31 Dec 2010 05:03:25 | 1119500 | 11777620 | famous_vdqk_1799_200_006703570_1 | 140,426 | 421,849 | 3.0041 |
22 Dec 2010 16:01:34 | 1119500 | 11777620 | famous_vdqk_1799_200_006703570_1 | 131,066 | 393,809 | 3.0047 |
21 Dec 2010 19:07:35 | 1119500 | 11777620 | famous_vdqk_1799_200_006703570_1 | 121,706 | 365,383 | 3.0022 |
17 Dec 2010 11:06:36 | 1119500 | 11777620 | famous_vdqk_1799_200_006703570_1 | 112,346 | 337,085 | 3.0004 |
12 Dec 2010 11:21:22 | 1119500 | 11777620 | famous_vdqk_1799_200_006703570_1 | 102,986 | 308,810 | 2.9986 |
11 Dec 2010 11:38:33 | 1119500 | 11777620 | famous_vdqk_1799_200_006703570_1 | 93,626 | 280,556 | 2.9966 |
10 Dec 2010 16:11:40 | 1119500 | 11777620 | famous_vdqk_1799_200_006703570_1 | 84,266 | 252,252 | 2.9935 |
09 Dec 2010 18:55:00 | 1119500 | 11777620 | famous_vdqk_1799_200_006703570_1 | 74,906 | 224,123 | 2.9921 |
08 Dec 2010 20:20:32 | 1119500 | 11777620 | famous_vdqk_1799_200_006703570_1 | 65,546 | 196,324 | 2.9952 |
01 Dec 2010 20:42:25 | 1119500 | 11777620 | famous_vdqk_1799_200_006703570_1 | 56,186 | 168,424 | 2.9976 |
01 Dec 2010 11:14:22 | 1119500 | 11777620 | famous_vdqk_1799_200_006703570_1 | 46,826 | 140,345 | 2.9972 |
01 Dec 2010 08:51:36 | 1119500 | 11777620 | famous_vdqk_1799_200_006703570_1 | 37,466 | 112,289 | 2.9971 |
30 Nov 2010 19:41:04 | 1119500 | 11777620 | famous_vdqk_1799_200_006703570_1 | 28,106 | 84,222 | 2.9966 |
30 Nov 2010 10:45:36 | 1119500 | 11777620 | famous_vdqk_1799_200_006703570_1 | 18,746 | 56,171 | 2.9964 |
30 Nov 2010 07:14:52 | 1119500 | 11777620 | famous_vdqk_1799_200_006703570_1 | 9,386 | 28,090 | 2.9928 |
©2024 cpdn.org