climateprediction.net home page
Task 16046358

Task 16046358

Name hadcm3n_ofu1_1900_40_008475644_0
Workunit 8626483
Created 27 Sep 2013, 10:38:35 UTC
Sent 27 Sep 2013, 12:49:42 UTC
Report deadline 27 Dec 2013, 20:16:53 UTC
Received 18 Nov 2013, 7:30:34 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 193 (0x000000C1) EXIT_SIGNAL
Computer ID 1092776
Run time 8 days 19 hours 10 min 40 sec
CPU time 8 days 3 hours 11 min 21 sec
Validate state Invalid
Credit 3,110.40
Device peak FLOPS 2.79 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>7.0.64</core_client_version>
<![CDATA[
<message>
(unknown error) - exit code 193 (0xc1)
</message>
<stderr_txt>
08:06:39 (6380): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7828, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5672, iMonCtr=1
Model crash detected, will try to restart...
08:24:25 (7788): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6804, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7484, iMonCtr=1
Model crash detected, will try to restart...
CController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5832, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5832, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1452, iMonCtr=1
Model crash detected, will try to restart...
12:00:18 (7284): No heartbeat from core client for 30 sec - exiting
12:00:19 (7284): No heartbeat from core client for 30 sec - exiting
12:00:20 (7284): No heartbeat from core client for 30 sec - exiting
12:00:21 (7284): No heartbeat from core client for 30 sec - exiting
12:00:22 (7284): No heartbeat from core client for 30 sec - exiting
12:00:23 (7284): No heartbeat from core client for 30 sec - exiting
12:00:24 (7284): No heartbeat from core client for 30 sec - exiting
12:00:25 (7284): No heartbeat from core client for 30 sec - exiting
12:00:26 (7284): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8140, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6484, iMonCtr=1
Model crash detected, will try to restart...
08:34:58 (7720): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8184, iMonCtr=1
Model crash detected, will try to restart...
CController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7124, iMonCtr=1
Model crash detected, will try to restart...
08:38:15 (7340): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
08:38:16 (7340): No heartbeat from core client for 30 sec - exiting
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7496, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7496, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4196, iMonCtr=1
Model crash detected, will try to restart...
C15:33:28 (260): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8452, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6392, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6392, iMonCtr=1
Model crash detected, will try to restart...
16:05:16 (7544): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
16:05:17 (7544): No heartbeat from core client for 30 sec - exiting
16:05:18 (7544): No heartbeat from core client for 30 sec - exiting
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5932, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5932, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6024, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6312, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6312, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7876, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7756, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7756, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3204, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3204, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Signal 11 received, exiting...
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
15 Nov 2013 12:46:43 1092776 16046358 hadcm3n_ofu1_1900_40_008475644_0 259,200 702,645 2.7108
12 Nov 2013 14:42:05 1092776 16046358 hadcm3n_ofu1_1900_40_008475644_0 233,280 633,095 2.7139
08 Nov 2013 10:42:47 1092776 16046358 hadcm3n_ofu1_1900_40_008475644_0 207,360 562,239 2.7114
06 Nov 2013 07:22:06 1092776 16046358 hadcm3n_ofu1_1900_40_008475644_0 181,440 492,348 2.7136
31 Oct 2013 09:24:20 1092776 16046358 hadcm3n_ofu1_1900_40_008475644_0 155,520 420,475 2.7037
25 Oct 2013 12:20:30 1092776 16046358 hadcm3n_ofu1_1900_40_008475644_0 129,600 351,142 2.7094
23 Oct 2013 09:06:48 1092776 16046358 hadcm3n_ofu1_1900_40_008475644_0 103,680 281,029 2.7105
18 Oct 2013 11:05:56 1092776 16046358 hadcm3n_ofu1_1900_40_008475644_0 77,760 211,147 2.7154
16 Oct 2013 06:38:56 1092776 16046358 hadcm3n_ofu1_1900_40_008475644_0 51,840 140,703 2.7142
02 Oct 2013 09:14:44 1092776 16046358 hadcm3n_ofu1_1900_40_008475644_0 25,920 70,856 2.7336


©2024 cpdn.org