climateprediction.net home page
Task 11704305

Task 11704305

Name famous_v2f9_1199_200_006688907_1
Workunit 6892160
Created 26 Aug 2010, 15:48:57 UTC
Sent 4 Sep 2010, 8:08:26 UTC
Report deadline 4 Dec 2010, 15:35:37 UTC
Received 6 Sep 2010, 13:20:55 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 904604
Run time
CPU time 15 hours 19 min 14 sec
Validate state Invalid
Credit 988.30
Device peak FLOPS 2.17 GFLOPS
Application version UK Met Office FAMOUS v6.11
i686-apple-darwin
Stderr
<core_client_version>6.2.18</core_client_version>
<![CDATA[
<message>
process exited with code 22 (0x16, -234)
</message>
<stderr_txt>
 (12743): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
 (12901): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
 (12901): No heartbeat from core client for 30 sec - exiting
 (12901): No heartbeat from core client for 30 sec - exiting
 (12901): No heartbeat from core client for 30 sec - exiting
 (12901): No heartbeat from core client for 30 sec - exiting
 (12909): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
 (13424): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
 (13727): No heartbeat from core client for 30 sec - exiting
 (13727): No heartbeat from core client for 30 sec - exiting
 (13727): No heartbeat from core client for 30 sec - exiting
 (13727): No heartbeat from core client for 30 sec - exiting
 (13727): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
 (13727): No heartbeat from core client for 30 sec - exiting

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 60 - Return code = 1

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 61 - Return code = 1

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 68 - Return code = 1

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 69 - Return code = 1
 (13782): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
 (13782): No heartbeat from core client for 30 sec - exiting
 (13782): No heartbeat from core client for 30 sec - exiting
 (13782): No heartbeat from core client for 30 sec - exiting
 (13782): No heartbeat from core client for 30 sec - exiting
 (13782): No heartbeat from core client for 30 sec - exiting
 (13782): No heartbeat from core client for 30 sec - exiting
 (13782): No heartbeat from core client for 30 sec - exiting
 (13782): No heartbeat from core client for 30 sec - exiting
 (13782): No heartbeat from core client for 30 sec - exiting
 (13782): No heartbeat from core client for 30 sec - exiting
 (13805): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
 (13942): No heartbeat from core client for 30 sec - exiting
 (13942): No heartbeat from core client for 30 sec - exiting
 (13942): No heartbeat from core client for 30 sec - exiting
 (13942): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=0
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=0
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=0
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=0
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=0
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=0
Model crash detected, will try to restart...
 (14076): No heartbeat from core client for 30 sec - exiting
 (14076): No heartbeat from core client for 30 sec - exiting
 (14076): No heartbeat from core client for 30 sec - exiting
 (14076): No heartbeat from core client for 30 sec - exiting
 (14076): No heartbeat from core client for 30 sec - exiting
 (14076): No heartbeat from core client for 30 sec - exiting
 (14076): No heartbeat from core client for 30 sec - exiting
 (14076): No heartbeat from core client for 30 sec - exiting
 (14076): No heartbeat from core client for 30 sec - exiting
 (14076): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
 (14076): No heartbeat from core client for 30 sec - exiting

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 60 - Return code = 1

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 61 - Return code = 1

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 68 - Return code = 1

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 69 - Return code = 1
 (14823): No heartbeat from core client for 30 sec - exiting
 (14823): No heartbeat from core client for 30 sec - exiting
 (14823): No heartbeat from core client for 30 sec - exiting
 (14823): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 60 - Return code = 1

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 61 - Return code = 1

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 68 - Return code = 1

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 69 - Return code = 1
CPDN Monitor - Quit request from BOINC...
 (20187): No heartbeat from core client for 30 sec - exiting
 (20187): No heartbeat from core client for 30 sec - exiting
 (20187): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 60 - Return code = 1

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 61 - Return code = 1

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 68 - Return code = 1

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 69 - Return code = 1
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
 (1948): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 60 - Return code = 1

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 61 - Return code = 1

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 68 - Return code = 1

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 69 - Return code = 1
CPDN Monitor - Quit request from BOINC...

Model crashed: U_MODEL: Illegal combination of submodels                                                                                                                                                                                                                       tmp/pipe_dummy                                                                  

Model crashed: U_MODEL: Illegal combination of submodels                                                                                                                                                                                                                       tmp/pipe_dummy                                                                  

Model crashed: U_MODEL: Illegal combination of submodels                                                                                                                                                                                                                       tmp/pipe_dummy                                                                  

Model crashed: U_MODEL: Illegal combination of submodels                                                                                                                                                                                                                       tmp/pipe_dummy                                                                  

Model crashed: U_MODEL: Illegal combination of submodels                                                                                                                                                                                                                       tmp/pipe_dummy                                                                  

Model crashed: U_MODEL: Illegal combination of submodels                                                                                                                                                                                                                       tmp/pipe_dummy                                                                  
Sorry, too many model crashes! :-(
 (2128): called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
06 Sep 2010 12:27:15 904604 11704305 famous_v2f9_1199_200_006688907_1 299,546 54,951 0.1834
06 Sep 2010 11:08:09 904604 11704305 famous_v2f9_1199_200_006688907_1 290,186 53,290 0.1836
06 Sep 2010 10:28:14 904604 11704305 famous_v2f9_1199_200_006688907_1 280,826 51,606 0.1838
06 Sep 2010 09:36:21 904604 11704305 famous_v2f9_1199_200_006688907_1 271,466 49,959 0.1840
06 Sep 2010 09:00:21 904604 11704305 famous_v2f9_1199_200_006688907_1 262,106 48,185 0.1838
05 Sep 2010 14:39:16 904604 11704305 famous_v2f9_1199_200_006688907_1 243,386 44,705 0.1837
05 Sep 2010 13:56:46 904604 11704305 famous_v2f9_1199_200_006688907_1 234,026 42,983 0.1837
05 Sep 2010 13:14:28 904604 11704305 famous_v2f9_1199_200_006688907_1 224,666 41,084 0.1829
05 Sep 2010 12:36:10 904604 11704305 famous_v2f9_1199_200_006688907_1 215,306 39,245 0.1823
05 Sep 2010 11:53:30 904604 11704305 famous_v2f9_1199_200_006688907_1 205,946 37,451 0.1818
05 Sep 2010 11:11:06 904604 11704305 famous_v2f9_1199_200_006688907_1 196,586 35,737 0.1818
05 Sep 2010 10:32:58 904604 11704305 famous_v2f9_1199_200_006688907_1 187,226 34,063 0.1819
05 Sep 2010 09:55:29 904604 11704305 famous_v2f9_1199_200_006688907_1 177,866 32,411 0.1822
05 Sep 2010 09:12:35 904604 11704305 famous_v2f9_1199_200_006688907_1 168,506 30,723 0.1823
05 Sep 2010 08:33:55 904604 11704305 famous_v2f9_1199_200_006688907_1 159,146 29,014 0.1823
05 Sep 2010 07:53:14 904604 11704305 famous_v2f9_1199_200_006688907_1 149,786 27,292 0.1822
05 Sep 2010 07:10:23 904604 11704305 famous_v2f9_1199_200_006688907_1 140,426 25,549 0.1819
05 Sep 2010 06:28:22 904604 11704305 famous_v2f9_1199_200_006688907_1 131,066 23,821 0.1817
05 Sep 2010 05:51:06 904604 11704305 famous_v2f9_1199_200_006688907_1 121,706 22,093 0.1815
05 Sep 2010 05:09:06 904604 11704305 famous_v2f9_1199_200_006688907_1 112,346 20,351 0.1811


©2024 cpdn.org