climateprediction.net home page
Task 11992300

Task 11992300

Name famous_viu2_999_200_006710176_5
Workunit 6913429
Created 11 Nov 2010, 8:21:05 UTC
Sent 11 Nov 2010, 8:22:59 UTC
Report deadline 10 Feb 2011, 15:50:10 UTC
Received 14 Dec 2010, 9:41:52 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1112786
Run time 8 days 7 hours 13 min 54 sec
CPU time 7 days 22 hours 10 min 54 sec
Validate state Invalid
Credit 5,249.96
Device peak FLOPS 2.90 GFLOPS
Application version UK Met Office FAMOUS v6.11
windows_intelx86
Stderr
<core_client_version>6.10.58</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
03:01:51 (3224): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
11:13:58 (6908): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 60 - Return code = 16

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 61 - Return code = 16

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
17:40:15 (7752): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...

BUFFIN: Read Failed: Result too large
BUFFIN: C I/O Error feof - Unit 60 - Return code = 16

BUFFIN: Read Failed: Result too large
BUFFIN: C I/O Error feof - Unit 61 - Return code = 16

BUFFIN: Read Failed: Result too large
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16

BUFFIN: Read Failed: Result too large
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
00:34:05 (4388): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
12:06:51 (8664): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
16:11:03 (8476): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6072, iMonCtr=1
Model crash detected, will try to restart...
12:06:51 (5648): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
12:46:46 (1436): Can't acquire lockfile (32) - waiting 35s
12:46:53 (4796): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
12:47:52 (1436): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
18:31:14 (984): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
18:31:15 (984): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - Quit request from BOINC...
21:48:33 (6992): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
21:51:15 (5992): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5688, iMonCtr=1
Model crash detected, will try to restart...
23:27:08 (5396): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
08:03:57 (524): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
02:07:59 (5980): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
02:08:02 (5980): No heartbeat from core client for 30 sec - exiting
02:08:03 (5980): No heartbeat from core client for 30 sec - exiting
02:09:09 (4532): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
02:59:37 (1432): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
03:00:46 (4888): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
03:03:32 (2056): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
03:04:17 (2116): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
03:05:55 (4336): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
03:08:31 (356): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
03:12:38 (5040): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
03:14:28 (716): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
03:26:51 (5808): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...

BUFFOUT: Write Failed: Invalid argument
BUFFOUT: C I/O Error - Return code = 32

Model crashed: WRITDUMP: BAD BUFFOUT OF DATA                                                                                                                                                                                                                                   tmp/pipe_dummy                                                                  
Signal 11 received, exiting...
04:03:41 (3952): called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5216, iMonCtr=1
Model crash detected, will try to restart...
Signal 11 received, exiting...
04:03:47 (392): called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5216, iMonCtr=1
Model crash detected, will try to restart...
Signal 11 received, exiting...
04:03:58 (3384): called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5216, iMonCtr=1
Model crash detected, will try to restart...
Signal 11 received, exiting...
04:04:08 (1964): called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5216, iMonCtr=1
Model crash detected, will try to restart...
Signal 11 received, exiting...
04:04:14 (4752): called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5216, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
04:04:18 (5216): called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
14 Dec 2010 09:47:18 1112786 11992300 famous_viu2_999_200_006710176_5 1,591,226 683,905 0.4298
14 Dec 2010 09:47:18 1112786 11992300 famous_viu2_999_200_006710176_5 1,581,866 680,228 0.4300
14 Dec 2010 09:47:18 1112786 11992300 famous_viu2_999_200_006710176_5 1,572,506 675,863 0.4298
13 Dec 2010 23:44:07 1112786 11992300 famous_viu2_999_200_006710176_5 1,563,146 671,346 0.4295
13 Dec 2010 22:21:18 1112786 11992300 famous_viu2_999_200_006710176_5 1,553,786 666,887 0.4292
13 Dec 2010 21:01:34 1112786 11992300 famous_viu2_999_200_006710176_5 1,544,426 662,383 0.4289
13 Dec 2010 19:39:50 1112786 11992300 famous_viu2_999_200_006710176_5 1,535,066 657,897 0.4286
13 Dec 2010 18:28:38 1112786 11992300 famous_viu2_999_200_006710176_5 1,525,706 653,483 0.4283
13 Dec 2010 16:53:08 1112786 11992300 famous_viu2_999_200_006710176_5 1,516,346 649,174 0.4281
13 Dec 2010 15:24:37 1112786 11992300 famous_viu2_999_200_006710176_5 1,506,986 644,709 0.4278
13 Dec 2010 13:59:42 1112786 11992300 famous_viu2_999_200_006710176_5 1,497,626 640,315 0.4276
13 Dec 2010 12:41:49 1112786 11992300 famous_viu2_999_200_006710176_5 1,488,266 635,849 0.4272
13 Dec 2010 11:50:46 1112786 11992300 famous_viu2_999_200_006710176_5 1,478,906 631,456 0.4270
13 Dec 2010 11:50:46 1112786 11992300 famous_viu2_999_200_006710176_5 1,469,546 626,984 0.4267
13 Dec 2010 11:50:46 1112786 11992300 famous_viu2_999_200_006710176_5 1,460,186 622,514 0.4263
13 Dec 2010 11:50:46 1112786 11992300 famous_viu2_999_200_006710176_5 1,450,826 618,079 0.4260
13 Dec 2010 11:50:46 1112786 11992300 famous_viu2_999_200_006710176_5 1,441,466 613,640 0.4257
13 Dec 2010 11:50:46 1112786 11992300 famous_viu2_999_200_006710176_5 1,432,106 609,275 0.4254
13 Dec 2010 11:50:46 1112786 11992300 famous_viu2_999_200_006710176_5 1,422,746 604,850 0.4251
12 Dec 2010 23:24:45 1112786 11992300 famous_viu2_999_200_006710176_5 1,413,386 600,640 0.4250


©2024 climateprediction.net