climateprediction.net home page
Task 11837720

Task 11837720

Name famous_vmz7_1999_200_006715545_4
Workunit 6918798
Created 26 Aug 2010, 17:55:09 UTC
Sent 29 Sep 2010, 23:19:02 UTC
Report deadline 30 Dec 2010, 6:46:13 UTC
Received 28 Oct 2010, 13:03:49 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1017601
Run time 8 days 7 hours 20 min 23 sec
CPU time 5 days 15 hours 28 min 23 sec
Validate state Invalid
Credit 2,872.08
Device peak FLOPS 1.90 GFLOPS
Application version UK Met Office FAMOUS v6.11
windows_intelx86
Stderr
<core_client_version>6.6.20</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4304, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
00:49:07 (9520): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - No 'heartbeat' from BOINC...
00:49:08 (9520): No heartbeat from core client for 30 sec - exiting
00:49:09 (9520): No heartbeat from core client for 30 sec - exiting

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 60 - Return code = 16

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 61 - Return code = 16

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 60 - Return code = 16

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 61 - Return code = 16

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
07:57:56 (2120): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
10:08:57 (944): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
08:25:42 (2192): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
00:00:43 (6276): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
03:08:11 (8032): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - No 'heartbeat' from BOINC...
03:24:55 (8032): No heartbeat from core client for 30 sec - exiting

Model crashed: READHIST: Read ERROR on history file for namelist NLIHISTO                                                                                                                                                                                                      tmp/pipe_dummy                                                                  

Model crashed: READHIST: Read ERROR on history file for namelist NLIHISTO                                                                                                                                                                                                      tmp/pipe_dummy                                                                  

Model crashed: READHIST: Read ERROR on history file for namelist NLIHISTO                                                                                                                                                                                                      tmp/pipe_dummy                                                                  

Model crashed: READHIST: Read ERROR on history file for namelist NLIHISTO                                                                                                                                                                                                      tmp/pipe_dummy                                                                  

Model crashed: READHIST: Read ERROR on history file for namelist NLIHISTO                                                                                                                                                                                                      tmp/pipe_dummy                                                                  

Model crashed: READHIST: Read ERROR on history file for namelist NLIHISTO                                                                                                                                                                                                      tmp/pipe_dummy                                                                  
Sorry, too many model crashes! :-(
09:03:50 (6748): called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
26 Oct 2010 23:16:42 1017601 11837720 famous_vmz7_1999_200_006715545_4 870,506 484,721 0.5568
26 Oct 2010 20:26:51 1017601 11837720 famous_vmz7_1999_200_006715545_4 861,146 479,899 0.5573
26 Oct 2010 18:04:34 1017601 11837720 famous_vmz7_1999_200_006715545_4 851,786 474,996 0.5576
26 Oct 2010 15:00:33 1017601 11837720 famous_vmz7_1999_200_006715545_4 842,426 469,944 0.5578
26 Oct 2010 09:22:32 1017601 11837720 famous_vmz7_1999_200_006715545_4 833,066 465,122 0.5583
26 Oct 2010 06:25:58 1017601 11837720 famous_vmz7_1999_200_006715545_4 823,706 460,322 0.5588
26 Oct 2010 03:07:45 1017601 11837720 famous_vmz7_1999_200_006715545_4 814,346 455,332 0.5591
25 Oct 2010 17:03:55 1017601 11837720 famous_vmz7_1999_200_006715545_4 804,986 450,553 0.5597
25 Oct 2010 14:10:20 1017601 11837720 famous_vmz7_1999_200_006715545_4 795,626 445,713 0.5602
25 Oct 2010 10:06:37 1017601 11837720 famous_vmz7_1999_200_006715545_4 786,266 440,668 0.5605
25 Oct 2010 07:02:26 1017601 11837720 famous_vmz7_1999_200_006715545_4 776,906 435,843 0.5610
25 Oct 2010 03:55:03 1017601 11837720 famous_vmz7_1999_200_006715545_4 767,546 430,992 0.5615
25 Oct 2010 03:20:59 1017601 11837720 famous_vmz7_1999_200_006715545_4 758,186 425,683 0.5614
24 Oct 2010 22:12:50 1017601 11837720 famous_vmz7_1999_200_006715545_4 748,826 420,219 0.5612
24 Oct 2010 20:15:59 1017601 11837720 famous_vmz7_1999_200_006715545_4 739,466 414,835 0.5610
24 Oct 2010 18:18:30 1017601 11837720 famous_vmz7_1999_200_006715545_4 730,106 409,430 0.5608
24 Oct 2010 11:18:22 1017601 11837720 famous_vmz7_1999_200_006715545_4 720,746 403,990 0.5605
24 Oct 2010 08:47:06 1017601 11837720 famous_vmz7_1999_200_006715545_4 711,386 398,623 0.5603
24 Oct 2010 06:29:56 1017601 11837720 famous_vmz7_1999_200_006715545_4 702,026 393,439 0.5604
23 Oct 2010 19:45:17 1017601 11837720 famous_vmz7_1999_200_006715545_4 692,666 388,151 0.5604


©2024 cpdn.org