climateprediction.net home page
Task 13819816

Task 13819816

Name hadcm3n_yht1_1900_40_007515932_4
Workunit 7713407
Created 25 Dec 2011, 18:21:26 UTC
Sent 25 Dec 2011, 18:23:18 UTC
Report deadline 26 Mar 2012, 1:50:29 UTC
Received 17 Mar 2012, 2:14:07 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 193 (0x000000C1) EXIT_SIGNAL
Computer ID 1137859
Run time 17 days 21 hours 50 min 26 sec
CPU time 13 days 10 hours 8 min 4 sec
Validate state Invalid
Credit 6,220.80
Device peak FLOPS 2.19 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>6.10.58</core_client_version>
<![CDATA[
<message>
 - exit code 193 (0xc1)
</message>
<stderr_txt>
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3620, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3332, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4424, iMonCtr=1
Model crash detected, will try to restart...
ContController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4092, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3772, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3792, iMonCtr=1
Model crash detected, will try to restart...
CController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3616, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3632, iMonCtr=1
Model crash detected, will try to restart...
CController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3608, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3772, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3712, iMonCtr=1
Model crash detected, will try to restart...
CoController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4188, iMonCtr=1
Model crash detected, wiController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3744, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3908, iMonCtr=1
Model crash detected, will try to restart...
ControllCCController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3568, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3652, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3696, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3584, iMonCtr=1
Model crash detected, will try to restart...
CCController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3780, iMonCtr=1
Model crash detected, will try to restart...
CSuspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CCSuspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3388, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3572, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4264, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CSuspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3584, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
BUFFIN: C I/O Error feof - Unit 63 - Return code = 16
BUFFIN: C I/O Error feof - Unit 64 - Return code = 16
BUFFIN: C I/O Error feof - Unit 65 - Return code = 16
BUFFIN: C I/O Error feof - Unit 66 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
Error converting file to netcdf: dataout/yht1ko.pjb5c10
Error converting file to netcdf: dataout/yht1ko.pib5c10
Error converting file to netcdf: dataout/yht1ko.pfb5c10
Error converting file to netcdf: dataout/yht1ka.phb5c10
Error converting file to netcdf: dataout/yht1ka.pgb5c10
Error converting file to netcdf: dataout/yht1ka.peb5c10
Error converting file to netcdf: dataout/yht1ka.pdb5c10
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3560, iMonCtr=1
Model crash detected, will try to restart...
CCSuspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3728, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3736, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3696, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3696, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3868, iMonCtr=1
Model crash detected, will try to restart...
CController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3896, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
09:22:54 (3980): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CSignal 11 received, exiting...
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
17 Mar 2012 01:16:20 1137859 13819816 hadcm3n_yht1_1900_40_007515932_4 518,400 1,159,672 2.2370
10 Mar 2012 15:09:54 1137859 13819816 hadcm3n_yht1_1900_40_007515932_4 492,480 1,101,277 2.2362
05 Mar 2012 04:38:06 1137859 13819816 hadcm3n_yht1_1900_40_007515932_4 466,560 1,041,726 2.2328
29 Feb 2012 17:28:33 1137859 13819816 hadcm3n_yht1_1900_40_007515932_4 440,640 982,340 2.2293
24 Feb 2012 01:50:57 1137859 13819816 hadcm3n_yht1_1900_40_007515932_4 414,720 922,693 2.2249
20 Feb 2012 00:18:19 1137859 13819816 hadcm3n_yht1_1900_40_007515932_4 388,800 863,184 2.2201
17 Feb 2012 03:09:25 1137859 13819816 hadcm3n_yht1_1900_40_007515932_4 362,880 805,838 2.2207
13 Feb 2012 03:18:46 1137859 13819816 hadcm3n_yht1_1900_40_007515932_4 336,960 753,365 2.2358
11 Feb 2012 17:31:40 1137859 13819816 hadcm3n_yht1_1900_40_007515932_4 311,040 702,478 2.2585
06 Feb 2012 04:40:09 1137859 13819816 hadcm3n_yht1_1900_40_007515932_4 285,120 645,453 2.2638
02 Feb 2012 04:37:47 1137859 13819816 hadcm3n_yht1_1900_40_007515932_4 259,200 586,958 2.2645
28 Jan 2012 03:48:06 1137859 13819816 hadcm3n_yht1_1900_40_007515932_4 233,280 527,274 2.2603
23 Jan 2012 10:15:35 1137859 13819816 hadcm3n_yht1_1900_40_007515932_4 207,360 468,804 2.2608
21 Jan 2012 03:49:25 1137859 13819816 hadcm3n_yht1_1900_40_007515932_4 181,440 410,372 2.2618
16 Jan 2012 05:53:59 1137859 13819816 hadcm3n_yht1_1900_40_007515932_4 155,520 351,462 2.2599
11 Jan 2012 17:17:03 1137859 13819816 hadcm3n_yht1_1900_40_007515932_4 129,600 292,031 2.2533
07 Jan 2012 02:15:54 1137859 13819816 hadcm3n_yht1_1900_40_007515932_4 103,680 234,261 2.2595
03 Jan 2012 04:40:19 1137859 13819816 hadcm3n_yht1_1900_40_007515932_4 77,760 175,978 2.2631
30 Dec 2011 19:14:50 1137859 13819816 hadcm3n_yht1_1900_40_007515932_4 51,840 116,816 2.2534
28 Dec 2011 18:22:11 1137859 13819816 hadcm3n_yht1_1900_40_007515932_4 25,920 58,306 2.2495


©2024 cpdn.org