climateprediction.net home page
Task 14844817

Task 14844817

Name hadcm3n_yeri_1980_40_008000272_2
Workunit 8155386
Created 27 Jun 2012, 8:04:52 UTC
Sent 27 Jun 2012, 8:05:02 UTC
Report deadline 26 Sep 2012, 15:32:13 UTC
Received 7 Aug 2012, 11:16:13 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 25 (0x00000019) Unknown error code
Computer ID 1156294
Run time 20 days 7 hours 35 min 33 sec
CPU time 17 days 14 hours 16 min 19 sec
Validate state Invalid
Credit 4,665.60
Device peak FLOPS 2.26 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>6.10.58</core_client_version>
<![CDATA[
<message>
Das Laufwerk kann einen bestimmten Bereich oder eine bestimmte Spur nicht finden. (0x19) - exit code 25 (0x19)
</message>
<stderr_txt>
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3484, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4608, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5096, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5096, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5096, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5096, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5096, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5096, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4476, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4332, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3232, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4868, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4452, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5060, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
20:07:36 (4572): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3504, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4320, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3252, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4716, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3880, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3672, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5088, iMonCtr=1
Model crash detected, will try to restart...
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
06 Aug 2012 12:39:26 1156294 14844817 hadcm3n_yeri_1980_40_008000272_2 388,800 1,480,835 3.8087
04 Aug 2012 08:10:27 1156294 14844817 hadcm3n_yeri_1980_40_008000272_2 362,880 1,390,137 3.8308
01 Aug 2012 08:15:19 1156294 14844817 hadcm3n_yeri_1980_40_008000272_2 336,960 1,292,302 3.8352
29 Jul 2012 16:05:54 1156294 14844817 hadcm3n_yeri_1980_40_008000272_2 311,040 1,204,848 3.8736
27 Jul 2012 08:50:13 1156294 14844817 hadcm3n_yeri_1980_40_008000272_2 285,120 1,115,176 3.9113
24 Jul 2012 11:14:38 1156294 14844817 hadcm3n_yeri_1980_40_008000272_2 259,200 1,018,516 3.9295
22 Jul 2012 11:22:26 1156294 14844817 hadcm3n_yeri_1980_40_008000272_2 233,280 929,861 3.9860
20 Jul 2012 10:52:26 1156294 14844817 hadcm3n_yeri_1980_40_008000272_2 207,360 852,725 4.1123
18 Jul 2012 10:59:34 1156294 14844817 hadcm3n_yeri_1980_40_008000272_2 181,440 774,777 4.2702
16 Jul 2012 04:50:42 1156294 14844817 hadcm3n_yeri_1980_40_008000272_2 155,520 695,733 4.4736
13 Jul 2012 18:40:14 1156294 14844817 hadcm3n_yeri_1980_40_008000272_2 129,600 615,426 4.7487
11 Jul 2012 13:19:40 1156294 14844817 hadcm3n_yeri_1980_40_008000272_2 103,680 542,825 5.2356
03 Jul 2012 17:20:56 1156294 14844817 hadcm3n_yeri_1980_40_008000272_2 77,760 223,235 2.8708
02 Jul 2012 11:45:07 1156294 14844817 hadcm3n_yeri_1980_40_008000272_2 51,840 145,633 2.8093
29 Jun 2012 06:20:52 1156294 14844817 hadcm3n_yeri_1980_40_008000272_2 25,920 68,805 2.6545


©2024 climateprediction.net