climateprediction.net home page
Task 12733961

Task 12733961

Name hadcm3n_o0th_1900_40_007196392_0
Workunit 7394672
Created 28 Mar 2011, 13:58:14 UTC
Sent 2 Apr 2011, 16:04:12 UTC
Report deadline 2 Jul 2011, 23:31:23 UTC
Received 20 Jun 2011, 3:29:06 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 25 (0x00000019) Unknown error code
Computer ID 1022218
Run time 16 days 18 hours 24 min 57 sec
CPU time 12 days 22 hours 34 min 40 sec
Validate state Invalid
Credit 8,398.08
Device peak FLOPS 2.77 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>6.12.26</core_client_version>
<![CDATA[
<message>
Das Laufwerk kann einen bestimmten Bereich oder eine bestimmte Spur nicht finden. (0x19) - exit code 25 (0x19)
</message>
<stderr_txt>
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
BUFFIN: C I/O Error feof - Unit 63 - Return code = 16
BUFFIN: C I/O Error feof - Unit 64 - Return code = 16
BUFFIN: C I/O Error feof - Unit 65 - Return code = 16
BUFFIN: C I/O Error feof - Unit 66 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5088, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
15:14:20 (4520): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
20:00:38 (5524): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
01:03:12 (4644): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6136, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4060, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3792, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5660, iMonCtr=1
Model crash detected, will try to restart...
BUFFIN: C I/O Error feof - Unit 63 - Return code = 16
BUFFIN: C I/O Error feof - Unit 64 - Return code = 16
BUFFIN: C I/O Error feof - Unit 65 - Return code = 16
BUFFIN: C I/O Error feof - Unit 66 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
Error converting file to netcdf: dataout/o0thko.pjc7c10
Error converting file to netcdf: dataout/o0thko.pic7c10
Error converting file to netcdf: dataout/o0thko.pfc7c10
Error converting file to netcdf: dataout/o0thka.phc7c10
Error converting file to netcdf: dataout/o0thka.pgc7c10
Error converting file to netcdf: dataout/o0thka.pec7c10
Error converting file to netcdf: dataout/o0thka.pdc7c10
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
16 Jun 2011 13:16:55 1022218 12733961 hadcm3n_o0th_1900_40_007196392_0 699,840 1,093,159 1.5620
14 Jun 2011 15:44:38 1022218 12733961 hadcm3n_o0th_1900_40_007196392_0 673,920 1,051,482 1.5602
13 Jun 2011 10:34:44 1022218 12733961 hadcm3n_o0th_1900_40_007196392_0 648,000 1,009,860 1.5584
11 Jun 2011 08:05:20 1022218 12733961 hadcm3n_o0th_1900_40_007196392_0 622,080 965,394 1.5519
10 Jun 2011 17:44:22 1022218 12733961 hadcm3n_o0th_1900_40_007196392_0 596,160 921,351 1.5455
08 Jun 2011 17:39:59 1022218 12733961 hadcm3n_o0th_1900_40_007196392_0 570,240 880,398 1.5439
06 Jun 2011 17:09:08 1022218 12733961 hadcm3n_o0th_1900_40_007196392_0 544,320 840,733 1.5446
06 Jun 2011 06:19:04 1022218 12733961 hadcm3n_o0th_1900_40_007196392_0 518,400 802,086 1.5472
05 Jun 2011 20:14:24 1022218 12733961 hadcm3n_o0th_1900_40_007196392_0 492,480 764,110 1.5516
04 Jun 2011 08:07:48 1022218 12733961 hadcm3n_o0th_1900_40_007196392_0 466,560 725,663 1.5553
01 Jun 2011 17:58:33 1022218 12733961 hadcm3n_o0th_1900_40_007196392_0 440,640 686,171 1.5572
27 May 2011 17:14:48 1022218 12733961 hadcm3n_o0th_1900_40_007196392_0 414,720 643,828 1.5524
24 May 2011 20:57:14 1022218 12733961 hadcm3n_o0th_1900_40_007196392_0 388,800 601,519 1.5471
22 May 2011 19:46:31 1022218 12733961 hadcm3n_o0th_1900_40_007196392_0 362,880 559,626 1.5422
18 May 2011 19:46:39 1022218 12733961 hadcm3n_o0th_1900_40_007196392_0 336,960 518,294 1.5381
16 May 2011 18:26:17 1022218 12733961 hadcm3n_o0th_1900_40_007196392_0 311,040 477,597 1.5355
15 May 2011 18:33:34 1022218 12733961 hadcm3n_o0th_1900_40_007196392_0 285,120 436,947 1.5325
13 May 2011 08:44:17 1022218 12733961 hadcm3n_o0th_1900_40_007196392_0 259,200 395,886 1.5273
11 May 2011 18:29:30 1022218 12733961 hadcm3n_o0th_1900_40_007196392_0 233,280 355,139 1.5224
06 May 2011 19:32:03 1022218 12733961 hadcm3n_o0th_1900_40_007196392_0 207,360 315,365 1.5209


©2024 cpdn.org