climateprediction.net home page
Task 11691253

Task 11691253

Name famous_v0ei_599_200_006686296_4
Workunit 6889549
Created 26 Aug 2010, 15:39:54 UTC
Sent 30 Aug 2010, 4:39:19 UTC
Report deadline 29 Nov 2010, 12:06:30 UTC
Received 19 Sep 2010, 18:48:36 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 25 (0x00000019) Unknown error code
Computer ID 1086791
Run time 12 days 6 hours 44 min 26 sec
CPU time 11 days 15 hours 58 min 2 sec
Validate state Invalid
Credit 4,416.16
Device peak FLOPS 2.60 GFLOPS
Application version UK Met Office FAMOUS v6.11
windows_intelx86
Stderr
<core_client_version>6.10.58</core_client_version>
<![CDATA[
<message>
Het station kan een bepaald gebied of spoor op de schijf niet vinden. (0x19) - exit code 25 (0x19)
</message>
<stderr_txt>
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5108, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 60 - Return code = 16

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 61 - Return code = 16

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4404, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5680, iMonCtr=1
Model crash detected, will try to restart...

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 60 - Return code = 16

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 61 - Return code = 16

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5008, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3812, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2560, iMonCtr=1
Model crash detected, will try to restart...

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 60 - Return code = 16

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 61 - Return code = 16

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4928, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4892, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4968, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4564, iMonCtr=1
Model crash detected, will try to restart...

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 60 - Return code = 16

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 61 - Return code = 16

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4240, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4240, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4240, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4240, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4240, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4240, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
00:43:36 (4240): called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4568, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4244, iMonCtr=1
Model crash detected, will try to restart...
00:54:00 (2716): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
00:54:01 (2716): No heartbeat from core client for 30 sec - exiting
00:54:02 (2716): No heartbeat from core client for 30 sec - exiting
00:54:03 (2716): No heartbeat from core client for 30 sec - exiting
00:54:04 (2716): No heartbeat from core client for 30 sec - exiting
00:54:05 (2716): No heartbeat from core client for 30 sec - exiting
00:54:06 (2716): No heartbeat from core client for 30 sec - exiting
00:54:07 (2716): No heartbeat from core client for 30 sec - exiting
00:54:08 (2716): No heartbeat from core client for 30 sec - exiting
00:54:10 (2716): No heartbeat from core client for 30 sec - exiting
00:54:11 (2716): No heartbeat from core client for 30 sec - exiting
Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=612, selfPID=612, iMonCtr=1
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4864, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4500, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4664, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4196, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4196, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=416, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4224, iMonCtr=1
Model crash detected, will try to restart...
20:47:30 (4272): called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
11 Sep 2010 21:52:57 1086791 11691253 famous_v0ei_599_200_006686296_4 1,338,506 619,266 0.4627
11 Sep 2010 20:38:01 1086791 11691253 famous_v0ei_599_200_006686296_4 1,329,146 614,798 0.4626
11 Sep 2010 19:16:49 1086791 11691253 famous_v0ei_599_200_006686296_4 1,319,786 610,342 0.4625
11 Sep 2010 17:55:32 1086791 11691253 famous_v0ei_599_200_006686296_4 1,310,426 605,908 0.4624
11 Sep 2010 16:41:06 1086791 11691253 famous_v0ei_599_200_006686296_4 1,301,066 601,478 0.4623
11 Sep 2010 15:22:09 1086791 11691253 famous_v0ei_599_200_006686296_4 1,291,706 596,855 0.4621
11 Sep 2010 14:02:09 1086791 11691253 famous_v0ei_599_200_006686296_4 1,282,346 592,319 0.4619
11 Sep 2010 12:41:35 1086791 11691253 famous_v0ei_599_200_006686296_4 1,272,986 587,875 0.4618
11 Sep 2010 11:15:45 1086791 11691253 famous_v0ei_599_200_006686296_4 1,263,626 583,457 0.4617
11 Sep 2010 01:57:53 1086791 11691253 famous_v0ei_599_200_006686296_4 1,254,266 579,007 0.4616
11 Sep 2010 00:36:21 1086791 11691253 famous_v0ei_599_200_006686296_4 1,244,906 574,585 0.4615
11 Sep 2010 00:07:04 1086791 11691253 famous_v0ei_599_200_006686296_4 1,235,546 570,162 0.4615
10 Sep 2010 21:37:32 1086791 11691253 famous_v0ei_599_200_006686296_4 1,226,186 565,697 0.4613
10 Sep 2010 20:23:21 1086791 11691253 famous_v0ei_599_200_006686296_4 1,216,826 561,246 0.4612
10 Sep 2010 19:01:48 1086791 11691253 famous_v0ei_599_200_006686296_4 1,207,466 556,767 0.4611
10 Sep 2010 17:39:56 1086791 11691253 famous_v0ei_599_200_006686296_4 1,198,106 552,245 0.4609
10 Sep 2010 16:03:22 1086791 11691253 famous_v0ei_599_200_006686296_4 1,188,746 547,824 0.4608
10 Sep 2010 13:56:53 1086791 11691253 famous_v0ei_599_200_006686296_4 1,179,386 543,405 0.4608
10 Sep 2010 12:39:50 1086791 11691253 famous_v0ei_599_200_006686296_4 1,170,026 538,949 0.4606
10 Sep 2010 11:18:29 1086791 11691253 famous_v0ei_599_200_006686296_4 1,160,666 534,386 0.4604


©2024 cpdn.org