climateprediction.net home page
Task 11740866

Task 11740866

Name famous_v82d_1999_200_006696219_2
Workunit 6899472
Created 26 Aug 2010, 16:26:05 UTC
Sent 10 Dec 2010, 11:45:56 UTC
Report deadline 11 Mar 2011, 19:13:07 UTC
Received 24 Jul 2011, 17:57:03 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status -226 (0xFFFFFF1E) ERR_TOO_MANY_EXITS
Computer ID 1055955
Run time 14 days 20 hours 21 min 45 sec
CPU time 13 days 17 hours 24 min 5 sec
Validate state Invalid
Credit 4,385.28
Device peak FLOPS 1.07 GFLOPS
Application version UK Met Office FAMOUS v6.11
windows_intelx86
Stderr
<core_client_version>6.10.18</core_client_version>
<![CDATA[
<message>
too many exit(0)s
</message>
<stderr_txt>
CPDN Monitor - Quit request from BOINC...

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 60 - Return code = 16

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 61 - Return code = 16

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 60 - Return code = 16

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 61 - Return code = 16

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
CPDN Monitor - Quit request from BOINC...
14:37:47 (3220): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 60 - Return code = 16

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 61 - Return code = 16

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2176, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2804, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3596, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2960, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2356, iMonCtr=1
Model crash detected, will try to restart...

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 60 - Return code = 16

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 61 - Return code = 16

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3140, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3244, iMonCtr=1
Model crash detected, will try to restart...

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 60 - Return code = 16

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 61 - Return code = 16

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3172, iMonCtr=1
Model crash detected, will try to restart...

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 60 - Return code = 16

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 61 - Return code = 16

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2488, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1012, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2356, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3096, iMonCtr=1
Model crash detected, will try to restart...

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 60 - Return code = 16

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 61 - Return code = 16

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
09:32:34 (2308): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3620, iMonCtr=1
Model crash detected, will try to restart...
14:12:34 (2208): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CCPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2716, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2164, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3892, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3000, iMonCtr=1
Model crash detected, will try to restart...
12:08:48 (4016): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
11:30:55 (3200): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1132, iMonCtr=1
Model crash detected, will try to restart...
08:35:33 (2848): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
C
</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
26 Jul 2011 16:51:44 1055955 11740866 famous_v82d_1999_200_006696219_2 1,329,146 1,180,644 0.8883
26 Jul 2011 16:51:43 1055955 11740866 famous_v82d_1999_200_006696219_2 1,319,786 1,172,220 0.8882
26 Jul 2011 16:51:43 1055955 11740866 famous_v82d_1999_200_006696219_2 1,310,426 1,163,988 0.8883
26 Jul 2011 16:51:43 1055955 11740866 famous_v82d_1999_200_006696219_2 1,301,066 1,155,807 0.8884
25 Jul 2011 19:41:25 1055955 11740866 famous_v82d_1999_200_006696219_2 1,291,706 1,147,512 0.8884
25 Jul 2011 18:01:45 1055955 11740866 famous_v82d_1999_200_006696219_2 1,282,346 1,139,167 0.8883
25 Jul 2011 17:15:48 1055955 11740866 famous_v82d_1999_200_006696219_2 1,272,986 1,130,796 0.8883
25 Jul 2011 17:15:48 1055955 11740866 famous_v82d_1999_200_006696219_2 1,263,626 1,122,497 0.8883
25 Jul 2011 17:15:47 1055955 11740866 famous_v82d_1999_200_006696219_2 1,254,266 1,114,243 0.8884
25 Jul 2011 17:15:47 1055955 11740866 famous_v82d_1999_200_006696219_2 1,244,906 1,106,080 0.8885
25 Jul 2011 17:15:47 1055955 11740866 famous_v82d_1999_200_006696219_2 1,235,546 1,097,894 0.8886
25 Jul 2011 17:15:47 1055955 11740866 famous_v82d_1999_200_006696219_2 1,226,186 1,089,626 0.8886
25 Jul 2011 17:15:47 1055955 11740866 famous_v82d_1999_200_006696219_2 1,216,826 1,081,353 0.8887
25 Jul 2011 17:15:47 1055955 11740866 famous_v82d_1999_200_006696219_2 1,207,466 1,073,108 0.8887
25 Jul 2011 17:15:47 1055955 11740866 famous_v82d_1999_200_006696219_2 1,198,106 1,064,836 0.8888
08 Jul 2011 14:29:19 1055955 11740866 famous_v82d_1999_200_006696219_2 1,188,746 1,056,517 0.8888
08 Jul 2011 05:17:25 1055955 11740866 famous_v82d_1999_200_006696219_2 1,179,386 1,048,247 0.8888
08 Jul 2011 05:17:25 1055955 11740866 famous_v82d_1999_200_006696219_2 1,170,026 1,040,090 0.8889
08 Jul 2011 05:17:25 1055955 11740866 famous_v82d_1999_200_006696219_2 1,160,666 1,031,807 0.8890
30 Jun 2011 17:37:54 1055955 11740866 famous_v82d_1999_200_006696219_2 1,151,306 1,023,553 0.8890


©2024 cpdn.org