climateprediction.net home page
Task 15686299

Task 15686299

Name hadcm3n_u7dd_2020_40_008337568_1
Workunit 8488429
Created 27 Mar 2013, 3:23:56 UTC
Sent 27 Mar 2013, 3:24:10 UTC
Report deadline 26 Jun 2013, 10:51:21 UTC
Received 13 May 2013, 10:38:16 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 193 (0x000000C1) EXIT_SIGNAL
Computer ID 1146914
Run time 11 days 19 hours 1 min 24 sec
CPU time 11 days 12 hours 38 min 14 sec
Validate state Invalid
Credit 6,220.80
Device peak FLOPS 2.42 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>7.0.28</core_client_version>
<![CDATA[
<message>
 - exit code 193 (0xc1)
</message>
<stderr_txt>
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6056, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5144, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5144, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5144, iMonCtr=1
Model crash detected, will try to restart...
13:08:02 (2608): No heartbeat from core client for 30 sec - exiting
13:08:03 (2608): No heartbeat from core client for 30 sec - exiting
13:08:04 (2608): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6120, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5932, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5432, iMonCtr=1
Model crash detected, will try to restart...
09:58:58 (5924): No heartbeat from core client for 30 sec - exiting
09:58:59 (5924): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4328, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6008, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4752, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4752, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5100, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2204, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5844, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5844, iMonCtr=1
Model crash detected, will try to restart...
11:04:02 (3588): No heartbeat from core client for 30 sec - exiting
11:04:04 (3588): No heartbeat from core client for 30 sec - exiting
11:04:05 (3588): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5692, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5788, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5536, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5536, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5536, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6036, iMonCtr=1
Model crash detected, will try to restart...
Signal 11 received, exiting...
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
13 May 2013 07:33:06 1146914 15686299 hadcm3n_u7dd_2020_40_008337568_1 518,400 995,889 1.9211
10 May 2013 11:07:07 1146914 15686299 hadcm3n_u7dd_2020_40_008337568_1 492,480 947,970 1.9249
09 May 2013 09:21:13 1146914 15686299 hadcm3n_u7dd_2020_40_008337568_1 466,560 896,248 1.9210
08 May 2013 00:21:41 1146914 15686299 hadcm3n_u7dd_2020_40_008337568_1 440,640 845,614 1.9191
06 May 2013 03:42:09 1146914 15686299 hadcm3n_u7dd_2020_40_008337568_1 414,720 795,834 1.9190
01 May 2013 03:07:23 1146914 15686299 hadcm3n_u7dd_2020_40_008337568_1 388,800 744,713 1.9154
29 Apr 2013 05:56:53 1146914 15686299 hadcm3n_u7dd_2020_40_008337568_1 362,880 694,510 1.9139
26 Apr 2013 07:22:28 1146914 15686299 hadcm3n_u7dd_2020_40_008337568_1 336,960 639,450 1.8977
25 Apr 2013 05:30:58 1146914 15686299 hadcm3n_u7dd_2020_40_008337568_1 311,040 588,370 1.8916
23 Apr 2013 02:32:01 1146914 15686299 hadcm3n_u7dd_2020_40_008337568_1 285,120 536,944 1.8832
20 Apr 2013 06:48:14 1146914 15686299 hadcm3n_u7dd_2020_40_008337568_1 259,200 488,370 1.8841
19 Apr 2013 03:01:43 1146914 15686299 hadcm3n_u7dd_2020_40_008337568_1 233,280 439,274 1.8830
18 Apr 2013 01:53:03 1146914 15686299 hadcm3n_u7dd_2020_40_008337568_1 207,360 389,527 1.8785
16 Apr 2013 01:27:07 1146914 15686299 hadcm3n_u7dd_2020_40_008337568_1 181,440 341,395 1.8816
11 Apr 2013 06:23:15 1146914 15686299 hadcm3n_u7dd_2020_40_008337568_1 155,520 292,152 1.8785
08 Apr 2013 03:40:11 1146914 15686299 hadcm3n_u7dd_2020_40_008337568_1 129,600 247,572 1.9103
06 Apr 2013 05:07:11 1146914 15686299 hadcm3n_u7dd_2020_40_008337568_1 103,680 194,702 1.8779
03 Apr 2013 07:04:47 1146914 15686299 hadcm3n_u7dd_2020_40_008337568_1 77,760 139,854 1.7985
01 Apr 2013 11:35:30 1146914 15686299 hadcm3n_u7dd_2020_40_008337568_1 51,840 92,156 1.7777
30 Mar 2013 00:31:38 1146914 15686299 hadcm3n_u7dd_2020_40_008337568_1 25,920 45,940 1.7724


©2024 cpdn.org