climateprediction.net home page
Task 12366134

Task 12366134

Name famous_v3wf_1199_200_006690821_6
Workunit 6894074
Created 6 Dec 2010, 22:12:06 UTC
Sent 6 Dec 2010, 22:15:19 UTC
Report deadline 8 Mar 2011, 5:42:30 UTC
Received 14 Feb 2011, 12:44:14 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1117493
Run time 14 days 0 hours 6 min 53 sec
CPU time 12 days 19 hours 1 min 5 sec
Validate state Invalid
Credit 5,991.12
Device peak FLOPS 2.32 GFLOPS
Application version UK Met Office FAMOUS v6.11
windows_intelx86
Stderr
<core_client_version>6.10.58</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 60 - Return code = 16

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 61 - Return code = 16

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1244, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 60 - Return code = 16

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 61 - Return code = 16

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3316, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3316, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3316, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3316, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3316, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3316, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
21:45:33 (3316): called boinc_finish
17:14:34 (4968): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
10:12:58 (2544): Can't acquire lockfile (32) - waiting 35s
10:13:00 (4384): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
09:37:59 (6848): Can't acquire lockfile (32) - waiting 35s
09:38:07 (1696): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 60 - Return code = 16

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 61 - Return code = 16

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4684, iMonCtr=1
Model crash detected, will try to restart...
Controller:Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
09:35:37 (4984): Can't acquire lockfile (32) - waiting 35s
09:35:54 (5772): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 60 - Return code = 16

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 61 - Return code = 16

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
09:19:57 (5436): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7584, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
10:05:48 (7828): Can't acquire lockfile (32) - waiting 35s
10:06:07 (4372): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
12:46:04 (3580): Can't acquire lockfile (32) - waiting 35s
12:46:31 (1116): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=556, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...

Model crashed: ATM_DYN : INVALID THETA DETECTED.                                                                                                                                                                                                                               tmp/pipe_dummy                                                                  

Model crashed: ATM_DYN : INVALID THETA DETECTED.                                                                                                                                                                                                                               tmp/pipe_dummy                                                                  

Model crashed: ATM_DYN : INVALID THETA DETECTED.                                                                                                                                                                                                                               tmp/pipe_dummy                                                                  

Model crashed: ATM_DYN : INVALID THETA DETECTED.                                                                                                                                                                                                                               tmp/pipe_dummy                                                                  

Model crashed: ATM_DYN : INVALID THETA DETECTED.                                                                                                                                                                                                                               tmp/pipe_dummy                                                                  

Model crashed: ATM_DYN : INVALID THETA DETECTED.                                                                                                                                                                                                                               tmp/pipe_dummy                                                                  
Sorry, too many model crashes! :-(
12:43:12 (4112): called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
14 Feb 2011 11:50:06 1117493 12366134 famous_v3wf_1199_200_006690821_6 1,815,866 1,102,094 0.6069
14 Feb 2011 10:40:14 1117493 12366134 famous_v3wf_1199_200_006690821_6 1,806,506 1,097,932 0.6078
14 Feb 2011 09:24:47 1117493 12366134 famous_v3wf_1199_200_006690821_6 1,797,146 1,093,785 0.6086
13 Feb 2011 22:19:41 1117493 12366134 famous_v3wf_1199_200_006690821_6 1,787,786 1,088,789 0.6090
13 Feb 2011 21:02:42 1117493 12366134 famous_v3wf_1199_200_006690821_6 1,778,426 1,084,619 0.6099
13 Feb 2011 19:35:39 1117493 12366134 famous_v3wf_1199_200_006690821_6 1,769,066 1,080,299 0.6107
13 Feb 2011 18:31:24 1117493 12366134 famous_v3wf_1199_200_006690821_6 1,759,706 1,076,196 0.6116
13 Feb 2011 00:15:21 1117493 12366134 famous_v3wf_1199_200_006690821_6 1,750,346 1,071,826 0.6124
12 Feb 2011 22:32:55 1117493 12366134 famous_v3wf_1199_200_006690821_6 1,740,986 1,066,781 0.6127
12 Feb 2011 21:09:10 1117493 12366134 famous_v3wf_1199_200_006690821_6 1,731,626 1,062,602 0.6136
12 Feb 2011 12:23:39 1117493 12366134 famous_v3wf_1199_200_006690821_6 1,722,266 1,057,514 0.6140
11 Feb 2011 15:19:41 1117493 12366134 famous_v3wf_1199_200_006690821_6 1,712,906 1,053,398 0.6150
11 Feb 2011 14:01:53 1117493 12366134 famous_v3wf_1199_200_006690821_6 1,703,546 1,049,367 0.6160
11 Feb 2011 13:19:31 1117493 12366134 famous_v3wf_1199_200_006690821_6 1,694,186 1,045,338 0.6170
11 Feb 2011 13:19:31 1117493 12366134 famous_v3wf_1199_200_006690821_6 1,684,826 1,041,277 0.6180
11 Feb 2011 13:19:31 1117493 12366134 famous_v3wf_1199_200_006690821_6 1,675,466 1,037,257 0.6191
11 Feb 2011 13:19:31 1117493 12366134 famous_v3wf_1199_200_006690821_6 1,666,106 1,033,272 0.6202
10 Feb 2011 16:16:35 1117493 12366134 famous_v3wf_1199_200_006690821_6 1,656,746 1,029,050 0.6211
10 Feb 2011 14:57:06 1117493 12366134 famous_v3wf_1199_200_006690821_6 1,647,386 1,025,067 0.6222
10 Feb 2011 13:35:12 1117493 12366134 famous_v3wf_1199_200_006690821_6 1,638,026 1,021,086 0.6234


©2024 cpdn.org