|
Name | famous_uhhg_1599_200_006655503_3 |
Workunit | 6858875 |
Created | 10 Jun 2010, 14:18:10 UTC |
Sent | 24 Aug 2010, 21:58:39 UTC |
Report deadline | 24 Nov 2010, 5:25:50 UTC |
Received | 13 Sep 2010, 10:16:04 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 981175 |
Run time | 13 days 6 hours 3 min 35 sec |
CPU time | 10 days 3 hours 28 min 23 sec |
Validate state | Invalid |
Credit | 5,157.32 |
Device peak FLOPS | 2.03 GFLOPS |
Application version | UK Met Office FAMOUS v6.11 windows_intelx86 |
Stderr | <core_client_version>6.6.36</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 13:17:43 (4900): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 60 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 61 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 BUFFIN: Read Failed: Result too large BUFFIN: C I/O Error feof - Unit 60 - Return code = 16 BUFFIN: Read Failed: Result too large BUFFIN: C I/O Error feof - Unit 61 - Return code = 16 BUFFIN: Read Failed: Result too large BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 BUFFIN: Read Failed: Result too large BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5904, iMonCtr=1 Model crash detected, will try to restart... 21:17:52 (5904): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Model crashed: READHIST: Read ERROR on history file for namelist NLIHISTO tmp/pipe_dummy Model crashed: READHIST: Read ERROR on history file for namelist NLIHISTO tmp/pipe_dummy Model crashed: READHIST: Read ERROR on history file for namelist NLIHISTO tmp/pipe_dummy Model crashed: READHIST: Read ERROR on history file for namelist NLIHISTO tmp/pipe_dummy Model crashed: READHIST: Read ERROR on history file for namelist NLIHISTO tmp/pipe_dummy Model crashed: READHIST: Read ERROR on history file for namelist NLIHISTO tmp/pipe_dummy Sorry, too many model crashes! :-( 12:15:00 (5128): called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
13 Sep 2010 07:58:50 | 981175 | 11528359 | famous_uhhg_1599_200_006655503_3 | 1,563,146 | 873,462 | 0.5588 |
13 Sep 2010 06:32:36 | 981175 | 11528359 | famous_uhhg_1599_200_006655503_3 | 1,553,786 | 868,570 | 0.5590 |
13 Sep 2010 05:05:29 | 981175 | 11528359 | famous_uhhg_1599_200_006655503_3 | 1,544,426 | 863,713 | 0.5592 |
13 Sep 2010 03:40:19 | 981175 | 11528359 | famous_uhhg_1599_200_006655503_3 | 1,535,066 | 858,865 | 0.5595 |
13 Sep 2010 02:27:37 | 981175 | 11528359 | famous_uhhg_1599_200_006655503_3 | 1,525,706 | 854,022 | 0.5598 |
13 Sep 2010 01:03:29 | 981175 | 11528359 | famous_uhhg_1599_200_006655503_3 | 1,516,346 | 849,141 | 0.5600 |
13 Sep 2010 00:20:08 | 981175 | 11528359 | famous_uhhg_1599_200_006655503_3 | 1,506,986 | 844,247 | 0.5602 |
12 Sep 2010 21:46:36 | 981175 | 11528359 | famous_uhhg_1599_200_006655503_3 | 1,497,626 | 839,332 | 0.5604 |
12 Sep 2010 20:16:28 | 981175 | 11528359 | famous_uhhg_1599_200_006655503_3 | 1,488,266 | 834,472 | 0.5607 |
12 Sep 2010 18:45:43 | 981175 | 11528359 | famous_uhhg_1599_200_006655503_3 | 1,478,906 | 829,584 | 0.5609 |
12 Sep 2010 17:14:35 | 981175 | 11528359 | famous_uhhg_1599_200_006655503_3 | 1,469,546 | 824,732 | 0.5612 |
12 Sep 2010 15:38:01 | 981175 | 11528359 | famous_uhhg_1599_200_006655503_3 | 1,460,186 | 819,864 | 0.5615 |
12 Sep 2010 14:06:50 | 981175 | 11528359 | famous_uhhg_1599_200_006655503_3 | 1,450,826 | 814,949 | 0.5617 |
12 Sep 2010 12:32:54 | 981175 | 11528359 | famous_uhhg_1599_200_006655503_3 | 1,441,466 | 810,045 | 0.5620 |
12 Sep 2010 10:46:12 | 981175 | 11528359 | famous_uhhg_1599_200_006655503_3 | 1,432,106 | 805,151 | 0.5622 |
12 Sep 2010 09:04:39 | 981175 | 11528359 | famous_uhhg_1599_200_006655503_3 | 1,422,746 | 800,353 | 0.5625 |
12 Sep 2010 07:29:57 | 981175 | 11528359 | famous_uhhg_1599_200_006655503_3 | 1,413,386 | 795,511 | 0.5628 |
12 Sep 2010 06:05:15 | 981175 | 11528359 | famous_uhhg_1599_200_006655503_3 | 1,404,026 | 790,730 | 0.5632 |
12 Sep 2010 04:39:50 | 981175 | 11528359 | famous_uhhg_1599_200_006655503_3 | 1,394,666 | 785,939 | 0.5635 |
12 Sep 2010 03:20:49 | 981175 | 11528359 | famous_uhhg_1599_200_006655503_3 | 1,385,306 | 781,182 | 0.5639 |
©2024 climateprediction.net