climateprediction.net home page
Task 13635075

Task 13635075

Name hadcm3n_yc77_1900_40_007519593_2
Workunit 7717068
Created 15 Nov 2011, 18:38:08 UTC
Sent 18 Nov 2011, 14:24:44 UTC
Report deadline 17 Feb 2012, 21:51:55 UTC
Received 6 Dec 2011, 14:54:51 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 193 (0x000000C1) EXIT_SIGNAL
Computer ID 1034263
Run time 14 days 13 hours 50 min 1 sec
CPU time 8 days 6 hours 26 min 40 sec
Validate state Invalid
Credit 3,110.40
Device peak FLOPS 1.49 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>6.10.18</core_client_version>
<![CDATA[
<message>
 - exit code 193 (0xc1)
</message>
<stderr_txt>
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8624, iMonCtr=1
Model crash detected, will try to restart...
19:25:24 (7324): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8604, iMonCtr=1
Model crash detected, will try to restart...
23:15:03 (6844): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5380, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7224, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7232, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6964, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6944, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7144, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CCPDN Monitor - Quit request from BOINC...
BUFFIN: C I/O Error feof - Unit 63 - Return code = 16
BUFFIN: C I/O Error feof - Unit 64 - Return code = 16
BUFFIN: C I/O Error feof - Unit 65 - Return code = 16
BUFFIN: C I/O Error feof - Unit 66 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
Error converting file to netcdf: dataout/yc77ko.pja6c10
Error converting file to netcdf: dataout/yc77ko.pia6c10
Error converting file to netcdf: dataout/yc77ko.pfa6c10
Error converting file to netcdf: dataout/yc77ka.pha6c10
Error converting file to netcdf: dataout/yc77ka.pga6c10
Error converting file to netcdf: dataout/yc77ka.pea6c10
Error converting file to netcdf: dataout/yc77ka.pda6c10
20:38:11 (6920): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
20:38:12 (6920): No heartbeat from core client for 30 sec - exiting
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=13204, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7212, iMonCtr=1
Model crash detected, will try to restart...
02:53:23 (7912): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3384, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4868, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7892, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7544, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8012, iMonCtr=1
Model crash detected, will try to restart...
CController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6852, iMonCtr=1
Model crash detected, will try to restart...
03:48:29 (6876): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
06:37:36 (1520): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Signal 11 received, exiting...
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
05 Dec 2011 14:57:55 1034263 13635075 hadcm3n_yc77_1900_40_007519593_2 259,200 714,380 2.7561
03 Dec 2011 04:30:19 1034263 13635075 hadcm3n_yc77_1900_40_007519593_2 233,280 641,798 2.7512
01 Dec 2011 14:37:56 1034263 13635075 hadcm3n_yc77_1900_40_007519593_2 207,360 567,975 2.7391
30 Nov 2011 03:24:52 1034263 13635075 hadcm3n_yc77_1900_40_007519593_2 181,440 496,172 2.7346
28 Nov 2011 01:28:23 1034263 13635075 hadcm3n_yc77_1900_40_007519593_2 155,520 426,409 2.7418
26 Nov 2011 08:34:31 1034263 13635075 hadcm3n_yc77_1900_40_007519593_2 129,600 357,547 2.7589
23 Nov 2011 22:18:31 1034263 13635075 hadcm3n_yc77_1900_40_007519593_2 103,680 286,227 2.7607
22 Nov 2011 19:38:29 1034263 13635075 hadcm3n_yc77_1900_40_007519593_2 77,760 218,500 2.8099
21 Nov 2011 13:19:38 1034263 13635075 hadcm3n_yc77_1900_40_007519593_2 51,840 149,144 2.8770
20 Nov 2011 01:06:08 1034263 13635075 hadcm3n_yc77_1900_40_007519593_2 25,920 75,724 2.9215


©2024 cpdn.org