climateprediction.net home page
Task 11960337

Task 11960337

Name famous_vmib_1599_200_006714937_5
Workunit 6918190
Created 2 Nov 2010, 14:14:32 UTC
Sent 2 Nov 2010, 14:18:14 UTC
Report deadline 1 Feb 2011, 21:45:25 UTC
Received 14 Nov 2010, 20:52:18 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 25 (0x00000019) Unknown error code
Computer ID 1111355
Run time 9 days 11 hours 2 min 18 sec
CPU time 8 days 23 hours 21 min
Validate state Invalid
Credit 5,126.44
Device peak FLOPS 2.49 GFLOPS
Application version UK Met Office FAMOUS v6.11
windows_intelx86
Stderr
<core_client_version>6.10.58</core_client_version>
<![CDATA[
<message>
The drive cannot locate a specific area or track on the disk. (0x19) - exit code 25 (0x19)
</message>
<stderr_txt>
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3604, iMonCtr=1
Model crash detected, will try to restart...

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 60 - Return code = 16

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 61 - Return code = 16

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
09:22:05 (4200): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
09:22:06 (4200): No heartbeat from core client for 30 sec - exiting
09:22:07 (4200): No heartbeat from core client for 30 sec - exiting
09:22:08 (4200): No heartbeat from core client for 30 sec - exiting
09:22:10 (4200): No heartbeat from core client for 30 sec - exiting
09:22:11 (4200): No heartbeat from core client for 30 sec - exiting
09:22:12 (4200): No heartbeat from core client for 30 sec - exiting
10:39:11 (3776): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
10:39:13 (3776): No heartbeat from core client for 30 sec - exiting
10:39:14 (3776): No heartbeat from core client for 30 sec - exiting
10:39:15 (3776): No heartbeat from core client for 30 sec - exiting
10:39:16 (3776): No heartbeat from core client for 30 sec - exiting
10:39:17 (3776): No heartbeat from core client for 30 sec - exiting
10:39:18 (3776): No heartbeat from core client for 30 sec - exiting
10:39:19 (3776): No heartbeat from core client for 30 sec - exiting
10:39:20 (3776): No heartbeat from core client for 30 sec - exiting
10:39:21 (3776): No heartbeat from core client for 30 sec - exiting
10:39:22 (3776): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4240, iMonCtr=1
Model crash detected, will try to restart...
CController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3960, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
18:31:08 (3712): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5844, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
15:51:14 (4516): called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
14 Nov 2010 19:39:21 1111355 11960337 famous_vmib_1599_200_006714937_5 1,553,786 772,317 0.4971
14 Nov 2010 18:16:33 1111355 11960337 famous_vmib_1599_200_006714937_5 1,544,426 767,602 0.4970
14 Nov 2010 16:49:09 1111355 11960337 famous_vmib_1599_200_006714937_5 1,535,066 762,874 0.4970
14 Nov 2010 13:42:41 1111355 11960337 famous_vmib_1599_200_006714937_5 1,525,706 758,129 0.4969
14 Nov 2010 12:25:45 1111355 11960337 famous_vmib_1599_200_006714937_5 1,516,346 753,383 0.4968
14 Nov 2010 11:00:15 1111355 11960337 famous_vmib_1599_200_006714937_5 1,506,986 748,642 0.4968
14 Nov 2010 09:44:04 1111355 11960337 famous_vmib_1599_200_006714937_5 1,497,626 743,908 0.4967
14 Nov 2010 08:21:51 1111355 11960337 famous_vmib_1599_200_006714937_5 1,488,266 739,167 0.4967
14 Nov 2010 08:21:38 1111355 11960337 famous_vmib_1599_200_006714937_5 1,478,906 734,402 0.4966
14 Nov 2010 08:21:38 1111355 11960337 famous_vmib_1599_200_006714937_5 1,469,546 729,671 0.4965
14 Nov 2010 08:21:38 1111355 11960337 famous_vmib_1599_200_006714937_5 1,460,186 724,927 0.4965
14 Nov 2010 08:21:38 1111355 11960337 famous_vmib_1599_200_006714937_5 1,450,826 720,184 0.4964
14 Nov 2010 08:21:38 1111355 11960337 famous_vmib_1599_200_006714937_5 1,441,466 715,461 0.4963
13 Nov 2010 23:55:18 1111355 11960337 famous_vmib_1599_200_006714937_5 1,432,106 710,703 0.4963
13 Nov 2010 22:42:46 1111355 11960337 famous_vmib_1599_200_006714937_5 1,422,746 706,535 0.4966
13 Nov 2010 21:30:18 1111355 11960337 famous_vmib_1599_200_006714937_5 1,413,386 702,637 0.4971
13 Nov 2010 20:23:32 1111355 11960337 famous_vmib_1599_200_006714937_5 1,404,026 698,740 0.4977
13 Nov 2010 19:16:20 1111355 11960337 famous_vmib_1599_200_006714937_5 1,394,666 694,849 0.4982
13 Nov 2010 18:14:02 1111355 11960337 famous_vmib_1599_200_006714937_5 1,385,306 690,956 0.4988
13 Nov 2010 17:06:15 1111355 11960337 famous_vmib_1599_200_006714937_5 1,375,946 687,053 0.4993


©2024 cpdn.org