climateprediction.net home page
Task 11924573

Task 11924573

Name famous_s3am_999_200_006727857_0
Workunit 6931198
Created 9 Oct 2010, 21:49:27 UTC
Sent 30 Oct 2010, 8:46:10 UTC
Report deadline 29 Jan 2011, 16:13:21 UTC
Received 13 Nov 2010, 17:42:28 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status -226 (0xFFFFFF1E) ERR_TOO_MANY_EXITS
Computer ID 1075728
Run time 2 days 16 hours 54 min 21 sec
CPU time 1 days 18 hours 47 min 1 sec
Validate state Invalid
Credit 1,111.83
Device peak FLOPS 2.58 GFLOPS
Application version UK Met Office FAMOUS v6.11
windows_intelx86
Stderr
<core_client_version>6.10.18</core_client_version>
<![CDATA[
<message>
too many exit(0)s
</message>
<stderr_txt>
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4572, selfPID=4572, iMonCtr=1
CPDN Monitor - Quit request from BOINC...
Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6556, selfPID=6556, iMonCtr=1
CPDN Monitor - Quit request from BOINC...
Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6728, selfPID=6728, iMonCtr=1
No Process Handle
Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4980, selfPID=4980, iMonCtr=1

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 60 - Return code = 16

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 61 - Return code = 16
CPDN Monitor - Quit request from BOINC...
No Process Handle
Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6732, selfPID=6732, iMonCtr=1
CPDN Monitor - Quit request from BOINC...
Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4376, selfPID=4376, iMonCtr=1
CPDN Monitor - Quit request from BOINC...
Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6088, selfPID=6088, iMonCtr=1
CPDN Monitor - Quit request from BOINC...
Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3288, selfPID=3288, iMonCtr=1
CPDN Monitor - Quit request from BOINC...
No Process Handle
Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6300, selfPID=6300, iMonCtr=1
CPDN Monitor - Quit request from BOINC...
Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6724, selfPID=6724, iMonCtr=1
CPDN Monitor - Quit request from BOINC...
No Process Handle
Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1208, selfPID=1208, iMonCtr=1
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
No Process Handle
Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5216, selfPID=5216, iMonCtr=1
CPDN Monitor - Quit request from BOINC...
Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6612, selfPID=6612, iMonCtr=1
CPDN Monitor - Quit request from BOINC...
Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6140, selfPID=6140, iMonCtr=1
CPDN Monitor - Quit request from BOINC...
Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4308, selfPID=4308, iMonCtr=1
CPDN Monitor - Quit request from BOINC...
No Process Handle
Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5864, selfPID=5864, iMonCtr=1
CPDN Monitor - Quit request from BOINC...
No Process Handle
Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3564, selfPID=3564, iMonCtr=1
CPDN Monitor - Quit request from BOINC...
Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5680, selfPID=5680, iMonCtr=1
CPDN Monitor - Quit request from BOINC...
Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2196, selfPID=2196, iMonCtr=1
CPDN Monitor - Quit request from BOINC...
No Process Handle
Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5188, selfPID=5188, iMonCtr=1
19:01:23 (4716): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
19:01:24 (4716): No heartbeat from core client for 30 sec - exiting
19:01:25 (4716): No heartbeat from core client for 30 sec - exiting
19:01:26 (4716): No heartbeat from core client for 30 sec - exiting
19:01:27 (4716): No heartbeat from core client for 30 sec - exiting
19:01:28 (4716): No heartbeat from core client for 30 sec - exiting
19:01:29 (4716): No heartbeat from core client for 30 sec - exiting
19:01:30 (4716): No heartbeat from core client for 30 sec - exiting
19:01:31 (4716): No heartbeat from core client for 30 sec - exiting
19:01:32 (4716): No heartbeat from core client for 30 sec - exiting
19:01:33 (4716): No heartbeat from core client for 30 sec - exiting
Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2260, selfPID=2260, iMonCtr=1
CPDN Monitor - Quit request from BOINC...
Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2236, selfPID=2236, iMonCtr=1
CPDN Monitor - Quit request from BOINC...
No Process Handle
Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4732, selfPID=4732, iMonCtr=1
CPDN Monitor - Quit request from BOINC...
No Process Handle
Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2108, selfPID=2108, iMonCtr=1
CPDN Monitor - Quit request from BOINC...
No Process Handle
Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2740, selfPID=2740, iMonCtr=1
CPDN Monitor - Quit request from BOINC...
No Process Handle
Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5096, selfPID=5096, iMonCtr=1
CPDN Monitor - Quit request from BOINC...
Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5556, selfPID=5556, iMonCtr=1
CPDN Monitor - Quit request from BOINC...
Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1372, selfPID=1372, iMonCtr=1
CPDN Monitor - Quit request from BOINC...
Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1532, selfPID=1532, iMonCtr=1
CPDN Monitor - Quit request from BOINC...
Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4324, selfPID=4324, iMonCtr=1
CPDN Monitor - Quit request from BOINC...
No Process Handle
Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2092, selfPID=2092, iMonCtr=1
CPDN Monitor - Quit request from BOINC...
Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5264, selfPID=5264, iMonCtr=1
CPDN Monitor - Quit request from BOINC...
Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1308, selfPID=1308, iMonCtr=1
CPDN Monitor - Quit request from BOINC...
No Process Handle
Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1484, selfPID=1484, iMonCtr=1
CPDN Monitor - Quit request from BOINC...
No Process Handle
Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2552, selfPID=2552, iMonCtr=1
CPDN Monitor - Quit request from BOINC...
No Process Handle
Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5352, selfPID=5352, iMonCtr=1
CPDN Monitor - Quit request from BOINC...
Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5708, selfPID=5708, iMonCtr=1
CPDN Monitor - Quit request from BOINC...
No Process Handle
Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2516, selfPID=2516, iMonCtr=1
CPDN Monitor - Quit request from BOINC...
No Process Handle
Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4316, selfPID=4316, iMonCtr=1
CPDN Monitor - Quit request from BOINC...
Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5244, selfPID=5244, iMonCtr=1
CPDN Monitor - Quit request from BOINC...
No Process Handle
Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4788, selfPID=4788, iMonCtr=1
CPDN Monitor - Quit request from BOINC...
Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2336, selfPID=2336, iMonCtr=1
CPDN Monitor - Quit request from BOINC...
No Process Handle
Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4800, selfPID=4800, iMonCtr=1
CPDN Monitor - Quit request from BOINC...
No Process Handle
Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4892, selfPID=4892, iMonCtr=1
CPDN Monitor - Quit request from BOINC...
No Process Handle
Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3280, selfPID=3280, iMonCtr=1
CPDN Monitor - Quit request from BOINC...
No Process Handle
Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5984, selfPID=5984, iMonCtr=1
CPDN Monitor - Quit request from BOINC...
No Process Handle
Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7032, selfPID=7032, iMonCtr=1
No Process Handle
Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6720, selfPID=6720, iMonCtr=1

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 60 - Return code = 16

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 61 - Return code = 16

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5908, iMonCtr=1
Model crash detected, will try to restart...
16:26:06 (4580): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
16:26:08 (4580): No heartbeat from core client for 30 sec - exiting
16:26:09 (4580): No heartbeat from core client for 30 sec - exiting
16:26:10 (4580): No heartbeat from core client for 30 sec - exiting
16:26:11 (4580): No heartbeat from core client for 30 sec - exiting
16:26:12 (4580): No heartbeat from core client for 30 sec - exiting
16:26:13 (4580): No heartbeat from core client for 30 sec - exiting
16:26:14 (4580): No heartbeat from core client for 30 sec - exiting
16:26:15 (4580): No heartbeat from core client for 30 sec - exiting
16:26:16 (4580): No heartbeat from core client for 30 sec - exiting
16:26:17 (4580): No heartbeat from core client for 30 sec - exiting
Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1128, selfPID=1128, iMonCtr=1
CPDN Monitor - Quit request from BOINC...
No Process Handle
Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1800, selfPID=1800, iMonCtr=1
16:44:04 (3636): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
16:44:05 (3636): No heartbeat from core client for 30 sec - exiting
16:44:07 (3636): No heartbeat from core client for 30 sec - exiting
16:44:08 (3636): No heartbeat from core client for 30 sec - exiting
16:44:09 (3636): No heartbeat from core client for 30 sec - exiting
16:44:10 (3636): No heartbeat from core client for 30 sec - exiting
16:44:11 (3636): No heartbeat from core client for 30 sec - exiting
16:44:12 (3636): No heartbeat from core client for 30 sec - exiting
16:44:13 (3636): No heartbeat from core client for 30 sec - exiting
16:44:14 (3636): No heartbeat from core client for 30 sec - exiting
16:44:15 (3636): No heartbeat from core client for 30 sec - exiting
Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4144, selfPID=4144, iMonCtr=1
17:22:50 (3720): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
17:22:52 (3720): No heartbeat from core client for 30 sec - exiting
17:22:53 (3720): No heartbeat from core client for 30 sec - exiting
17:22:54 (3720): No heartbeat from core client for 30 sec - exiting
17:22:55 (3720): No heartbeat from core client for 30 sec - exiting
17:22:56 (3720): No heartbeat from core client for 30 sec - exiting
17:22:57 (3720): No heartbeat from core client for 30 sec - exiting
17:22:58 (3720): No heartbeat from core client for 30 sec - exiting
17:22:59 (3720): No heartbeat from core client for 30 sec - exiting
17:23:01 (3720): No heartbeat from core client for 30 sec - exiting
17:23:02 (3720): No heartbeat from core client for 30 sec - exiting
Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4268, selfPID=4268, iMonCtr=1
CPDN Monitor - Quit request from BOINC...
No Process Handle
Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2920, selfPID=2920, iMonCtr=1
CPDN Monitor - Quit request from BOINC...
No Process Handle
Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5356, selfPID=5356, iMonCtr=1
CPDN Monitor - Quit request from BOINC...
No Process Handle
Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4304, selfPID=4304, iMonCtr=1
CPDN Monitor - Quit request from BOINC...
Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=496, selfPID=496, iMonCtr=1
CPDN Monitor - Quit request from BOINC...
Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5516, selfPID=5516, iMonCtr=1

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
13 Nov 2010 01:03:05 1075728 11924573 famous_s3am_999_200_006727857_0 336,986 150,476 0.4465
12 Nov 2010 22:32:42 1075728 11924573 famous_s3am_999_200_006727857_0 327,626 146,618 0.4475
12 Nov 2010 19:16:27 1075728 11924573 famous_s3am_999_200_006727857_0 318,266 142,513 0.4478
12 Nov 2010 00:13:20 1075728 11924573 famous_s3am_999_200_006727857_0 308,906 138,278 0.4476
11 Nov 2010 20:16:48 1075728 11924573 famous_s3am_999_200_006727857_0 299,546 134,162 0.4479
11 Nov 2010 00:09:00 1075728 11924573 famous_s3am_999_200_006727857_0 290,186 129,725 0.4470
10 Nov 2010 19:49:10 1075728 11924573 famous_s3am_999_200_006727857_0 280,826 125,528 0.4470
09 Nov 2010 23:36:47 1075728 11924573 famous_s3am_999_200_006727857_0 271,466 121,367 0.4471
09 Nov 2010 19:19:38 1075728 11924573 famous_s3am_999_200_006727857_0 262,106 116,740 0.4454
09 Nov 2010 00:01:38 1075728 11924573 famous_s3am_999_200_006727857_0 252,746 112,736 0.4460
08 Nov 2010 19:35:09 1075728 11924573 famous_s3am_999_200_006727857_0 243,386 108,580 0.4461
08 Nov 2010 17:11:51 1075728 11924573 famous_s3am_999_200_006727857_0 234,026 104,188 0.4452
07 Nov 2010 22:29:36 1075728 11924573 famous_s3am_999_200_006727857_0 224,666 100,279 0.4463
07 Nov 2010 17:48:09 1075728 11924573 famous_s3am_999_200_006727857_0 215,306 95,932 0.4456
07 Nov 2010 16:38:39 1075728 11924573 famous_s3am_999_200_006727857_0 205,946 91,690 0.4452
07 Nov 2010 16:38:39 1075728 11924573 famous_s3am_999_200_006727857_0 196,586 87,295 0.4441
06 Nov 2010 17:41:47 1075728 11924573 famous_s3am_999_200_006727857_0 187,226 83,281 0.4448
06 Nov 2010 14:40:24 1075728 11924573 famous_s3am_999_200_006727857_0 177,866 79,091 0.4447
06 Nov 2010 11:46:11 1075728 11924573 famous_s3am_999_200_006727857_0 168,506 74,905 0.4445
06 Nov 2010 11:41:03 1075728 11924573 famous_s3am_999_200_006727857_0 159,146 70,466 0.4428


©2024 cpdn.org