climateprediction.net home page
Task 11777620

Task 11777620

Name famous_vdqk_1799_200_006703570_1
Workunit 6906823
Created 26 Aug 2010, 16:48:17 UTC
Sent 28 Nov 2010, 18:50:58 UTC
Report deadline 28 Feb 2011, 2:18:09 UTC
Received 31 Dec 2010, 19:51:08 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1119500
Run time 5 days 18 hours 48 min 45 sec
CPU time 5 days 4 hours 28 min 46 sec
Validate state Invalid
Credit 463.31
Device peak FLOPS 0.36 GFLOPS
Application version UK Met Office FAMOUS v6.11
i686-pc-linux-gnu
Stderr
<core_client_version>6.10.17</core_client_version>
<![CDATA[
<message>
process exited with code 22 (0x16, -234)
</message>
<stderr_txt>
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 60 - Return code = 1

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 61 - Return code = 1

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 68 - Return code = 1

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 69 - Return code = 1
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 60 - Return code = 1

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 61 - Return code = 1

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 68 - Return code = 1

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 69 - Return code = 1
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Signal 15 received, exiting...
 (1015): called boinc_finish
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 60 - Return code = 1

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 61 - Return code = 1

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 68 - Return code = 1

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 69 - Return code = 1
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
 (1040): No heartbeat from core client for 30 sec - exiting
 (1040): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
 (1040): No heartbeat from core client for 30 sec - exiting
 (1040): No heartbeat from core client for 30 sec - exiting
 (1040): No heartbeat from core client for 30 sec - exiting
 (1040): No heartbeat from core client for 30 sec - exiting
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4827, iMonCtr=1
Model crash detected, will try to restart...
 (4827): No heartbeat from core client for 30 sec - exiting
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4827, iMonCtr=1
 (4827): No heartbeat from core client for 30 sec - exiting
 (4827): No heartbeat from core client for 30 sec - exiting
 (4827): No heartbeat from core client for 30 sec - exiting
Model crash detected, will try to restart...
CPDN Monitor - No 'heartbeat' from BOINC...
 (4827): No heartbeat from core client for 30 sec - exiting
 (4827): No heartbeat from core client for 30 sec - exiting
 (4827): No heartbeat from core client for 30 sec - exiting
 (4827): No heartbeat from core client for 30 sec - exiting
 (4827): No heartbeat from core client for 30 sec - exiting
 (4827): No heartbeat from core client for 30 sec - exiting
 (4827): No heartbeat from core client for 30 sec - exiting
 (4827): No heartbeat from core client for 30 sec - exiting
 (4886): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
 (4886): No heartbeat from core client for 30 sec - exiting
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4924, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4924, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4924, iMonCtr=1
Model crash detected, will try to restart...
 (4924): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
 (4924): No heartbeat from core client for 30 sec - exiting
 (4924): No heartbeat from core client for 30 sec - exiting
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5035, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5035, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
 (5035): called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
31 Dec 2010 05:03:25 1119500 11777620 famous_vdqk_1799_200_006703570_1 140,426 421,849 3.0041
22 Dec 2010 16:01:34 1119500 11777620 famous_vdqk_1799_200_006703570_1 131,066 393,809 3.0047
21 Dec 2010 19:07:35 1119500 11777620 famous_vdqk_1799_200_006703570_1 121,706 365,383 3.0022
17 Dec 2010 11:06:36 1119500 11777620 famous_vdqk_1799_200_006703570_1 112,346 337,085 3.0004
12 Dec 2010 11:21:22 1119500 11777620 famous_vdqk_1799_200_006703570_1 102,986 308,810 2.9986
11 Dec 2010 11:38:33 1119500 11777620 famous_vdqk_1799_200_006703570_1 93,626 280,556 2.9966
10 Dec 2010 16:11:40 1119500 11777620 famous_vdqk_1799_200_006703570_1 84,266 252,252 2.9935
09 Dec 2010 18:55:00 1119500 11777620 famous_vdqk_1799_200_006703570_1 74,906 224,123 2.9921
08 Dec 2010 20:20:32 1119500 11777620 famous_vdqk_1799_200_006703570_1 65,546 196,324 2.9952
01 Dec 2010 20:42:25 1119500 11777620 famous_vdqk_1799_200_006703570_1 56,186 168,424 2.9976
01 Dec 2010 11:14:22 1119500 11777620 famous_vdqk_1799_200_006703570_1 46,826 140,345 2.9972
01 Dec 2010 08:51:36 1119500 11777620 famous_vdqk_1799_200_006703570_1 37,466 112,289 2.9971
30 Nov 2010 19:41:04 1119500 11777620 famous_vdqk_1799_200_006703570_1 28,106 84,222 2.9966
30 Nov 2010 10:45:36 1119500 11777620 famous_vdqk_1799_200_006703570_1 18,746 56,171 2.9964
30 Nov 2010 07:14:52 1119500 11777620 famous_vdqk_1799_200_006703570_1 9,386 28,090 2.9928


©2024 cpdn.org