climateprediction.net home page
Task 13353159

Task 13353159

Name hadcm3n_o255_1940_40_007444824_1
Workunit 7642327
Created 9 Sep 2011, 11:28:47 UTC
Sent 9 Sep 2011, 11:29:53 UTC
Report deadline 9 Dec 2011, 18:57:04 UTC
Received 25 Oct 2011, 15:28:05 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1166047
Run time 12 days 14 hours 38 min 46 sec
CPU time 11 days 20 hours 16 min 3 sec
Validate state Invalid
Credit 5,909.76
Device peak FLOPS 2.46 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
i686-apple-darwin
Stderr
<core_client_version>6.12.35</core_client_version>
<![CDATA[
<message>
process exited with code 22 (0x16, -234)
</message>
<stderr_txt>
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
11:39:57 (1294): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
04:37:00 (49853): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
09:48:36 (59542): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
19:19:14 (11387): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
18:08:27 (11594): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
18:08:28 (11594): No heartbeat from core client for 30 sec - exiting
18:08:29 (11594): No heartbeat from core client for 30 sec - exiting
18:10:34 (15285): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
18:12:51 (15379): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
execl(/Library/Application Support/BOINC Data/projects/climateprediction.net/hadcm3n_um_6.07_i686-apple-darwin, 130980) failed!
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8885, iMonCtr=1
Model crash detected, will try to restart...
execl(/Library/Application Support/BOINC Data/projects/climateprediction.net/hadcm3n_um_6.07_i686-apple-darwin, 130980) failed!
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8885, iMonCtr=1
Model crash detected, will try to restart...
execl(/Library/Application Support/BOINC Data/projects/climateprediction.net/hadcm3n_um_6.07_i686-apple-darwin, 130980) failed!
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8885, iMonCtr=1
Model crash detected, will try to restart...
execl(/Library/Application Support/BOINC Data/projects/climateprediction.net/hadcm3n_um_6.07_i686-apple-darwin, 130980) failed!
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8885, iMonCtr=1
Model crash detected, will try to restart...
execl(/Library/Application Support/BOINC Data/projects/climateprediction.net/hadcm3n_um_6.07_i686-apple-darwin, 130980) failed!
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8885, iMonCtr=1
Model crash detected, will try to restart...
execl(/Library/Application Support/BOINC Data/projects/climateprediction.net/hadcm3n_um_6.07_i686-apple-darwin, 130980) failed!
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8885, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
19 Oct 2011 08:16:00 1166047 13353159 hadcm3n_o255_1940_40_007444824_1 492,480 1,006,674 2.0441
13 Oct 2011 10:16:31 1166047 13353159 hadcm3n_o255_1940_40_007444824_1 466,560 953,542 2.0438
12 Oct 2011 09:30:52 1166047 13353159 hadcm3n_o255_1940_40_007444824_1 440,640 900,271 2.0431
11 Oct 2011 08:30:58 1166047 13353159 hadcm3n_o255_1940_40_007444824_1 414,720 847,287 2.0430
10 Oct 2011 07:58:35 1166047 13353159 hadcm3n_o255_1940_40_007444824_1 388,800 794,165 2.0426
09 Oct 2011 15:48:14 1166047 13353159 hadcm3n_o255_1940_40_007444824_1 362,880 741,105 2.0423
09 Oct 2011 00:29:14 1166047 13353159 hadcm3n_o255_1940_40_007444824_1 336,960 687,828 2.0413
08 Oct 2011 09:19:15 1166047 13353159 hadcm3n_o255_1940_40_007444824_1 311,040 634,596 2.0402
07 Oct 2011 08:50:51 1166047 13353159 hadcm3n_o255_1940_40_007444824_1 285,120 581,117 2.0381
06 Oct 2011 08:33:04 1166047 13353159 hadcm3n_o255_1940_40_007444824_1 259,200 527,818 2.0363
05 Oct 2011 05:21:26 1166047 13353159 hadcm3n_o255_1940_40_007444824_1 233,280 474,600 2.0345
04 Oct 2011 04:27:38 1166047 13353159 hadcm3n_o255_1940_40_007444824_1 207,360 422,101 2.0356
03 Oct 2011 03:11:47 1166047 13353159 hadcm3n_o255_1940_40_007444824_1 181,440 369,593 2.0370
02 Oct 2011 11:12:46 1166047 13353159 hadcm3n_o255_1940_40_007444824_1 155,520 316,586 2.0357
01 Oct 2011 20:19:58 1166047 13353159 hadcm3n_o255_1940_40_007444824_1 129,600 263,567 2.0337
01 Oct 2011 03:53:51 1166047 13353159 hadcm3n_o255_1940_40_007444824_1 103,680 211,030 2.0354
30 Sep 2011 02:25:03 1166047 13353159 hadcm3n_o255_1940_40_007444824_1 77,760 158,419 2.0373
29 Sep 2011 01:42:14 1166047 13353159 hadcm3n_o255_1940_40_007444824_1 51,840 105,596 2.0370
27 Sep 2011 06:51:40 1166047 13353159 hadcm3n_o255_1940_40_007444824_1 25,920 52,774 2.0360


©2024 cpdn.org