climateprediction.net home page
Task 11429601

Task 11429601

Name famous_u290_999_200_006635759_1
Workunit 6839131
Created 10 Jun 2010, 11:25:55 UTC
Sent 18 Jul 2010, 6:30:54 UTC
Report deadline 17 Oct 2010, 13:58:05 UTC
Received 3 Aug 2010, 12:44:19 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status -226 (0xFFFFFF1E) ERR_TOO_MANY_EXITS
Computer ID 1051426
Run time 8 days 11 hours 48 min 31 sec
CPU time 7 days 9 hours 5 min 26 sec
Validate state Invalid
Credit 2,532.38
Device peak FLOPS 0.75 GFLOPS
Application version UK Met Office FAMOUS v6.11
windows_intelx86
Stderr
<core_client_version>6.10.18</core_client_version>
<![CDATA[
<message>
too many exit(0)s
</message>
<stderr_txt>
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4828, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3808, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6332, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2832, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5132, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7148, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7164, iMonCtr=1
Model crash detected, will try to restart...
06:25:16 (2344): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
07:03:04 (3780): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1424, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6052, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6052, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 60 - Return code = 16

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 61 - Return code = 16

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7216, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 60 - Return code = 16

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 61 - Return code = 16

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
19:53:30 (4824): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
06:45:32 (5440): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - No 'heartbeat' from BOINC...
06:45:34 (5440): No heartbeat from core client for 30 sec - exiting
06:45:35 (5440): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - Suspend request from BOINC...
15:54:01 (892): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
15:54:02 (892): No heartbeat from core client for 30 sec - exiting

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 60 - Return code = 16

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 61 - Return code = 16

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8092, iMonCtr=1
Model crash detected, will try to restart...

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
03 Aug 2010 05:56:42 1051426 11429601 famous_u290_999_200_006635759_1 767,546 634,004 0.8260
02 Aug 2010 14:40:02 1051426 11429601 famous_u290_999_200_006635759_1 758,186 627,307 0.8274
02 Aug 2010 10:58:23 1051426 11429601 famous_u290_999_200_006635759_1 748,826 620,246 0.8283
02 Aug 2010 08:51:42 1051426 11429601 famous_u290_999_200_006635759_1 739,466 613,209 0.8293
02 Aug 2010 06:06:02 1051426 11429601 famous_u290_999_200_006635759_1 730,106 606,385 0.8305
02 Aug 2010 03:19:41 1051426 11429601 famous_u290_999_200_006635759_1 720,746 599,402 0.8316
02 Aug 2010 00:10:37 1051426 11429601 famous_u290_999_200_006635759_1 711,386 592,509 0.8329
01 Aug 2010 21:02:18 1051426 11429601 famous_u290_999_200_006635759_1 702,026 585,802 0.8344
01 Aug 2010 15:32:11 1051426 11429601 famous_u290_999_200_006635759_1 692,666 579,270 0.8363
01 Aug 2010 12:55:32 1051426 11429601 famous_u290_999_200_006635759_1 683,306 572,540 0.8379
01 Aug 2010 10:02:01 1051426 11429601 famous_u290_999_200_006635759_1 673,946 565,947 0.8398
31 Jul 2010 19:25:18 1051426 11429601 famous_u290_999_200_006635759_1 664,586 558,019 0.8396
31 Jul 2010 14:06:48 1051426 11429601 famous_u290_999_200_006635759_1 655,226 550,233 0.8398
31 Jul 2010 08:35:34 1051426 11429601 famous_u290_999_200_006635759_1 645,866 542,719 0.8403
31 Jul 2010 05:06:09 1051426 11429601 famous_u290_999_200_006635759_1 636,506 534,672 0.8400
31 Jul 2010 02:58:08 1051426 11429601 famous_u290_999_200_006635759_1 627,146 526,588 0.8397
31 Jul 2010 00:34:52 1051426 11429601 famous_u290_999_200_006635759_1 617,786 518,893 0.8399
30 Jul 2010 15:13:41 1051426 11429601 famous_u290_999_200_006635759_1 608,426 511,891 0.8413
30 Jul 2010 12:31:46 1051426 11429601 famous_u290_999_200_006635759_1 599,066 503,820 0.8410
30 Jul 2010 09:53:38 1051426 11429601 famous_u290_999_200_006635759_1 589,706 495,700 0.8406


©2024 climateprediction.net