climateprediction.net home page
Task 13651415

Task 13651415

Name hadcm3n_yea2_1900_40_007518117_4
Workunit 7715592
Created 21 Nov 2011, 14:36:41 UTC
Sent 21 Nov 2011, 14:52:27 UTC
Report deadline 20 Feb 2012, 22:19:38 UTC
Received 30 Jan 2012, 10:03:53 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 25 (0x00000019) Unknown error code
Computer ID 1175714
Run time 21 days 9 hours 13 min 23 sec
CPU time 20 days 16 hours 4 min 28 sec
Validate state Invalid
Credit 9,953.28
Device peak FLOPS 2.50 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>6.12.34</core_client_version>
<![CDATA[
<message>
Das Laufwerk kann einen bestimmten Bereich oder eine bestimmte Spur nicht finden. (0x19) - exit code 25 (0x19)
</message>
<stderr_txt>
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3572, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1748, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
09:00:35 (4008): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3832, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2936, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3388, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4976, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4072, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4092, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4448, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3016, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3536, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3164, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4620, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4620, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4620, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4620, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4620, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4620, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4620, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4620, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4620, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4620, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4620, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4620, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4620, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4620, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4620, iMonCtr=1
Model crash detected, will try to restart...
BUFFIN: C I/O Error feof - Unit 63 - Return code = 16
BUFFIN: C I/O Error feof - Unit 64 - Return code = 16
BUFFIN: C I/O Error feof - Unit 65 - Return code = 16
BUFFIN: C I/O Error feof - Unit 66 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
Error converting file to netcdf: dataout/yea2ko.pjb7c10
Error converting file to netcdf: dataout/yea2ko.pib7c10
Error converting file to netcdf: dataout/yea2ko.pfb7c10
Error converting file to netcdf: dataout/yea2ka.phb7c10
Error converting file to netcdf: dataout/yea2ka.pgb7c10
Error converting file to netcdf: dataout/yea2ka.peb7c10
Error converting file to netcdf: dataout/yea2ka.pdb7c10
Suspended CPDN Monitor - Suspend request from BOINC...
Atmos Hold Restart file rename failed on atmos_restart.hold
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3512, iMonCtr=1
Model crash detected, will try to restart...
CController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3344, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
30 Jan 2012 06:15:30 1175714 13651415 hadcm3n_yea2_1900_40_007518117_4 829,440 1,779,828 2.1458
29 Jan 2012 16:00:27 1175714 13651415 hadcm3n_yea2_1900_40_007518117_4 803,520 1,728,990 2.1518
29 Jan 2012 00:43:46 1175714 13651415 hadcm3n_yea2_1900_40_007518117_4 777,600 1,678,149 2.1581
28 Jan 2012 09:28:31 1175714 13651415 hadcm3n_yea2_1900_40_007518117_4 751,680 1,627,221 2.1648
27 Jan 2012 15:22:00 1175714 13651415 hadcm3n_yea2_1900_40_007518117_4 725,760 1,576,575 2.1723
26 Jan 2012 11:00:30 1175714 13651415 hadcm3n_yea2_1900_40_007518117_4 699,840 1,526,273 2.1809
24 Jan 2012 16:41:56 1175714 13651415 hadcm3n_yea2_1900_40_007518117_4 673,920 1,475,973 2.1901
24 Jan 2012 00:45:44 1175714 13651415 hadcm3n_yea2_1900_40_007518117_4 648,000 1,425,686 2.2001
20 Jan 2012 14:51:16 1175714 13651415 hadcm3n_yea2_1900_40_007518117_4 622,080 1,375,000 2.2103
19 Jan 2012 08:06:23 1175714 13651415 hadcm3n_yea2_1900_40_007518117_4 596,160 1,322,574 2.2185
17 Jan 2012 13:44:21 1175714 13651415 hadcm3n_yea2_1900_40_007518117_4 570,240 1,270,402 2.2278
16 Jan 2012 12:21:02 1175714 13651415 hadcm3n_yea2_1900_40_007518117_4 544,320 1,218,249 2.2381
15 Jan 2012 19:24:56 1175714 13651415 hadcm3n_yea2_1900_40_007518117_4 518,400 1,165,678 2.2486
15 Jan 2012 04:44:37 1175714 13651415 hadcm3n_yea2_1900_40_007518117_4 492,480 1,113,250 2.2605
14 Jan 2012 13:38:11 1175714 13651415 hadcm3n_yea2_1900_40_007518117_4 466,560 1,060,499 2.2730
13 Jan 2012 21:12:38 1175714 13651415 hadcm3n_yea2_1900_40_007518117_4 440,640 1,007,845 2.2872
12 Jan 2012 12:04:07 1175714 13651415 hadcm3n_yea2_1900_40_007518117_4 414,720 955,236 2.3033
10 Jan 2012 17:05:27 1175714 13651415 hadcm3n_yea2_1900_40_007518117_4 388,800 902,332 2.3208
09 Jan 2012 11:44:05 1175714 13651415 hadcm3n_yea2_1900_40_007518117_4 362,880 849,599 2.3413
08 Jan 2012 13:30:59 1175714 13651415 hadcm3n_yea2_1900_40_007518117_4 336,960 790,675 2.3465


©2024 cpdn.org