Name | famous_vdrj_1599_200_006703605_1 |
Workunit | 6906858 |
Created | 26 Aug 2010, 16:48:21 UTC |
Sent | 28 Nov 2010, 17:36:42 UTC |
Report deadline | 28 Feb 2011, 1:03:53 UTC |
Received | 17 Jan 2011, 20:23:09 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | -226 (0xFFFFFF1E) ERR_TOO_MANY_EXITS |
Computer ID | 1117239 |
Run time | 7 days 15 hours 39 min 55 sec |
CPU time | 5 days 16 hours 44 min 4 sec |
Validate state | Invalid |
Credit | 3,026.49 |
Device peak FLOPS | 1.96 GFLOPS |
Application version | UK Met Office FAMOUS v6.11 windows_intelx86 |
Stderr | <core_client_version>6.10.58</core_client_version> <![CDATA[ <message> too many exit(0)s </message> <stderr_txt> 17:42:45 (6952): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 17:42:46 (6952): No heartbeat from core client for 30 sec - exiting 17:42:47 (6952): No heartbeat from core client for 30 sec - exiting 17:42:48 (6952): No heartbeat from core client for 30 sec - exiting 17:42:49 (6952): No heartbeat from core client for 30 sec - exiting BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 60 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 61 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4912, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=288, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 07:40:30 (4716): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 60 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 61 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 Suspended CPDN Monitor - Suspend request from BOINC... 15:05:59 (1508): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... 15:06:00 (1508): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4348, iMonCtr=1 Model crash detected, will try to restart... BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 60 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 61 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5312, iMonCtr=1 Model crash detected, will try to restart... </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
16 Jan 2011 18:07:23 | 1117239 | 11777795 | famous_vdrj_1599_200_006703605_1 | 917,306 | 488,291 | 0.5323 |
15 Jan 2011 20:29:40 | 1117239 | 11777795 | famous_vdrj_1599_200_006703605_1 | 907,946 | 483,455 | 0.5325 |
15 Jan 2011 18:42:06 | 1117239 | 11777795 | famous_vdrj_1599_200_006703605_1 | 898,586 | 478,269 | 0.5322 |
15 Jan 2011 17:09:45 | 1117239 | 11777795 | famous_vdrj_1599_200_006703605_1 | 889,226 | 473,441 | 0.5324 |
15 Jan 2011 15:31:30 | 1117239 | 11777795 | famous_vdrj_1599_200_006703605_1 | 879,866 | 468,627 | 0.5326 |
15 Jan 2011 14:02:17 | 1117239 | 11777795 | famous_vdrj_1599_200_006703605_1 | 870,506 | 463,807 | 0.5328 |
15 Jan 2011 13:14:01 | 1117239 | 11777795 | famous_vdrj_1599_200_006703605_1 | 861,146 | 458,980 | 0.5330 |
15 Jan 2011 13:14:01 | 1117239 | 11777795 | famous_vdrj_1599_200_006703605_1 | 851,786 | 454,196 | 0.5332 |
15 Jan 2011 13:14:01 | 1117239 | 11777795 | famous_vdrj_1599_200_006703605_1 | 842,426 | 449,496 | 0.5336 |
15 Jan 2011 13:11:42 | 1117239 | 11777795 | famous_vdrj_1599_200_006703605_1 | 833,066 | 444,852 | 0.5340 |
14 Jan 2011 22:12:51 | 1117239 | 11777795 | famous_vdrj_1599_200_006703605_1 | 823,706 | 439,951 | 0.5341 |
14 Jan 2011 19:58:50 | 1117239 | 11777795 | famous_vdrj_1599_200_006703605_1 | 814,346 | 435,334 | 0.5346 |
13 Jan 2011 21:39:09 | 1117239 | 11777795 | famous_vdrj_1599_200_006703605_1 | 804,986 | 430,566 | 0.5349 |
13 Jan 2011 19:59:48 | 1117239 | 11777795 | famous_vdrj_1599_200_006703605_1 | 795,626 | 425,712 | 0.5351 |
12 Jan 2011 21:05:52 | 1117239 | 11777795 | famous_vdrj_1599_200_006703605_1 | 786,266 | 420,855 | 0.5353 |
12 Jan 2011 19:21:53 | 1117239 | 11777795 | famous_vdrj_1599_200_006703605_1 | 776,906 | 416,261 | 0.5358 |
11 Jan 2011 12:33:49 | 1117239 | 11777795 | famous_vdrj_1599_200_006703605_1 | 767,546 | 411,203 | 0.5357 |
11 Jan 2011 11:26:55 | 1117239 | 11777795 | famous_vdrj_1599_200_006703605_1 | 758,186 | 406,300 | 0.5359 |
10 Jan 2011 20:43:38 | 1117239 | 11777795 | famous_vdrj_1599_200_006703605_1 | 748,826 | 401,499 | 0.5362 |
10 Jan 2011 19:10:48 | 1117239 | 11777795 | famous_vdrj_1599_200_006703605_1 | 739,466 | 396,640 | 0.5364 |
©2024 cpdn.org