climateprediction.net home page
Task 11651974

Task 11651974

Name famous_ub66_1899_200_006647321_5
Workunit 6850693
Created 11 Aug 2010, 15:39:56 UTC
Sent 11 Aug 2010, 16:11:21 UTC
Report deadline 10 Nov 2010, 23:38:32 UTC
Received 22 Aug 2010, 18:19:18 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1045993
Run time 10 days 21 hours 7 min 42 sec
CPU time 8 days 21 hours 59 min 14 sec
Validate state Invalid
Credit 4,477.92
Device peak FLOPS 2.23 GFLOPS
Application version UK Met Office FAMOUS v6.11
i686-pc-linux-gnu
Stderr
<core_client_version>6.10.17</core_client_version>
<![CDATA[
<message>
process exited with code 22 (0x16, -234)
</message>
<stderr_txt>
CPDN Monitor - Quit request from BOINC...
Signal 15 received, exiting...
Signal 15 received, exiting...
CPDN Monitor - Quit request from BOINC...
 (2956): No heartbeat from core client for 30 sec - exiting
 (2956): No heartbeat from core client for 30 sec - exiting
 (2956): No heartbeat from core client for 30 sec - exiting
 (2956): No heartbeat from core client for 30 sec - exiting
 (2956): No heartbeat from core client for 30 sec - exiting
 (2956): No heartbeat from core client for 30 sec - exiting
 (2956): No heartbeat from core client for 30 sec - exiting
 (2956): No heartbeat from core client for 30 sec - exiting
 (2956): No heartbeat from core client for 30 sec - exiting
 (2956): No heartbeat from core client for 30 sec - exiting
 (2956): No heartbeat from core client for 30 sec - exiting
 (2956): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 60 - Return code = 1

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 61 - Return code = 1

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 68 - Return code = 1

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 69 - Return code = 1
 (16041): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
 (16127): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
 (16147): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
 (16147): No heartbeat from core client for 30 sec - exiting
 (16147): No heartbeat from core client for 30 sec - exiting
 (16147): No heartbeat from core client for 30 sec - exiting
 (16201): No heartbeat from core client for 30 sec - exiting
 (16201): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
 (16201): No heartbeat from core client for 30 sec - exiting
 (16235): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
 (16260): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
 (16260): No heartbeat from core client for 30 sec - exiting
 (16301): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
 (16301): No heartbeat from core client for 30 sec - exiting
 (16301): No heartbeat from core client for 30 sec - exiting
 (16301): No heartbeat from core client for 30 sec - exiting
 (16301): No heartbeat from core client for 30 sec - exiting
 (16301): No heartbeat from core client for 30 sec - exiting
 (16301): No heartbeat from core client for 30 sec - exiting
 (16317): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
 (16317): No heartbeat from core client for 30 sec - exiting
 (16317): No heartbeat from core client for 30 sec - exiting
 (16317): No heartbeat from core client for 30 sec - exiting
 (16333): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
 (16359): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
 (16375): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
 (16391): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
 (16391): No heartbeat from core client for 30 sec - exiting
 (16391): No heartbeat from core client for 30 sec - exiting
 (16391): No heartbeat from core client for 30 sec - exiting
 (16417): No heartbeat from core client for 30 sec - exiting
 (16417): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
 (16417): No heartbeat from core client for 30 sec - exiting
 (16417): No heartbeat from core client for 30 sec - exiting
 (16417): No heartbeat from core client for 30 sec - exiting
 (16437): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
 (16453): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
 (16453): No heartbeat from core client for 30 sec - exiting
 (16453): No heartbeat from core client for 30 sec - exiting
 (16453): No heartbeat from core client for 30 sec - exiting
 (16469): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
 (16469): No heartbeat from core client for 30 sec - exiting
 (16486): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
 (16502): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
 (16502): No heartbeat from core client for 30 sec - exiting
 (16502): No heartbeat from core client for 30 sec - exiting
 (16502): No heartbeat from core client for 30 sec - exiting
 (16528): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
 (16544): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
 (16544): No heartbeat from core client for 30 sec - exiting
 (16544): No heartbeat from core client for 30 sec - exiting
 (16560): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
 (16560): No heartbeat from core client for 30 sec - exiting
 (16560): No heartbeat from core client for 30 sec - exiting
 (16560): No heartbeat from core client for 30 sec - exiting
 (16582): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
 (16598): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
 (16598): No heartbeat from core client for 30 sec - exiting
 (16598): No heartbeat from core client for 30 sec - exiting
 (16614): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
 (16630): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
 (16646): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
 (16646): No heartbeat from core client for 30 sec - exiting
 (16680): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
 (16680): No heartbeat from core client for 30 sec - exiting
 (16704): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
 (16704): No heartbeat from core client for 30 sec - exiting
 (16732): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
 (16732): No heartbeat from core client for 30 sec - exiting
 (16732): No heartbeat from core client for 30 sec - exiting
 (16732): No heartbeat from core client for 30 sec - exiting
 (16748): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
 (16748): No heartbeat from core client for 30 sec - exiting
 (16748): No heartbeat from core client for 30 sec - exiting
 (16748): No heartbeat from core client for 30 sec - exiting
 (16765): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
 (16765): No heartbeat from core client for 30 sec - exiting
 (16799): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
 (16799): No heartbeat from core client for 30 sec - exiting
 (16799): No heartbeat from core client for 30 sec - exiting
 (16799): No heartbeat from core client for 30 sec - exiting
 (16799): No heartbeat from core client for 30 sec - exiting
 (16799): No heartbeat from core client for 30 sec - exiting
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=16815, iMonCtr=1
Model crash detected, will try to restart...
 (16815): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
 (16835): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
 (16835): No heartbeat from core client for 30 sec - exiting
 (16835): No heartbeat from core client for 30 sec - exiting
 (16835): No heartbeat from core client for 30 sec - exiting
 (16851): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
 (16851): No heartbeat from core client for 30 sec - exiting
 (16868): No heartbeat from core client for 30 sec - exiting
 (16868): No heartbeat from core client for 30 sec - exiting
 (16868): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=16884, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=16884, iMonCtr=1
Model crash detected, will try to restart...
 (16884): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=16920, iMonCtr=1
Model crash detected, will try to restart...
 (16920): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
 (16920): No heartbeat from core client for 30 sec - exiting
 (16968): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
 (16985): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
 (16985): No heartbeat from core client for 30 sec - exiting
 (17001): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
 (17001): No heartbeat from core client for 30 sec - exiting
 (17001): No heartbeat from core client for 30 sec - exiting
 (17001): No heartbeat from core client for 30 sec - exiting
 (17001): No heartbeat from core client for 30 sec - exiting
 (17001): No heartbeat from core client for 30 sec - exiting
 (17025): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
 (17041): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
 (17041): No heartbeat from core client for 30 sec - exiting
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=17057, iMonCtr=1
Model crash detected, will try to restart...
 (17057): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=17091, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
 (17091): called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
22 Aug 2010 08:42:22 1045993 11651974 famous_ub66_1899_200_006647321_5 1,357,226 767,696 0.5656
22 Aug 2010 06:57:01 1045993 11651974 famous_ub66_1899_200_006647321_5 1,347,866 762,351 0.5656
22 Aug 2010 05:19:42 1045993 11651974 famous_ub66_1899_200_006647321_5 1,338,506 757,020 0.5656
22 Aug 2010 03:50:03 1045993 11651974 famous_ub66_1899_200_006647321_5 1,329,146 751,695 0.5655
22 Aug 2010 02:40:43 1045993 11651974 famous_ub66_1899_200_006647321_5 1,319,786 746,366 0.5655
22 Aug 2010 00:38:25 1045993 11651974 famous_ub66_1899_200_006647321_5 1,310,426 741,048 0.5655
21 Aug 2010 22:49:01 1045993 11651974 famous_ub66_1899_200_006647321_5 1,301,066 735,729 0.5655
21 Aug 2010 21:11:59 1045993 11651974 famous_ub66_1899_200_006647321_5 1,291,706 730,420 0.5655
21 Aug 2010 19:32:47 1045993 11651974 famous_ub66_1899_200_006647321_5 1,282,346 725,101 0.5654
21 Aug 2010 17:56:36 1045993 11651974 famous_ub66_1899_200_006647321_5 1,272,986 719,791 0.5654
21 Aug 2010 16:18:41 1045993 11651974 famous_ub66_1899_200_006647321_5 1,263,626 714,476 0.5654
21 Aug 2010 14:52:57 1045993 11651974 famous_ub66_1899_200_006647321_5 1,254,266 709,158 0.5654
21 Aug 2010 13:03:52 1045993 11651974 famous_ub66_1899_200_006647321_5 1,244,906 703,836 0.5654
21 Aug 2010 11:23:35 1045993 11651974 famous_ub66_1899_200_006647321_5 1,235,546 698,523 0.5654
21 Aug 2010 09:51:43 1045993 11651974 famous_ub66_1899_200_006647321_5 1,226,186 693,210 0.5653
21 Aug 2010 08:14:16 1045993 11651974 famous_ub66_1899_200_006647321_5 1,216,826 687,897 0.5653
21 Aug 2010 06:32:41 1045993 11651974 famous_ub66_1899_200_006647321_5 1,207,466 682,593 0.5653
21 Aug 2010 05:01:15 1045993 11651974 famous_ub66_1899_200_006647321_5 1,198,106 677,275 0.5653
21 Aug 2010 03:40:09 1045993 11651974 famous_ub66_1899_200_006647321_5 1,188,746 671,975 0.5653
21 Aug 2010 03:40:09 1045993 11651974 famous_ub66_1899_200_006647321_5 1,179,386 666,687 0.5653


©2024 climateprediction.net