climateprediction.net home page
Task 13115349

Task 13115349

Name hadcm3n_yhnl_1900_40_007355739_1
Workunit 7553169
Created 6 Jul 2011, 14:42:03 UTC
Sent 10 Jul 2011, 3:14:09 UTC
Report deadline 9 Oct 2011, 10:41:20 UTC
Received 19 Aug 2011, 5:04:09 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1051720
Run time 25 days 4 hours 53 min 38 sec
CPU time 14 days 11 hours 12 min 51 sec
Validate state Invalid
Credit 5,287.68
Device peak FLOPS 2.49 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>6.10.18</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6200, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
21:25:27 (452): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
05:01:13 (6676): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=15144, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8036, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8036, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8036, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8036, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8036, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8036, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
16 Aug 2011 12:05:11 1051720 13115349 hadcm3n_yhnl_1900_40_007355739_1 440,640 1,202,449 2.7289
14 Aug 2011 12:31:56 1051720 13115349 hadcm3n_yhnl_1900_40_007355739_1 414,720 1,132,598 2.7310
12 Aug 2011 10:43:40 1051720 13115349 hadcm3n_yhnl_1900_40_007355739_1 388,800 1,068,368 2.7479
11 Aug 2011 12:31:33 1051720 13115349 hadcm3n_yhnl_1900_40_007355739_1 362,880 1,001,956 2.7611
09 Aug 2011 13:27:03 1051720 13115349 hadcm3n_yhnl_1900_40_007355739_1 336,960 934,528 2.7734
04 Aug 2011 06:09:29 1051720 13115349 hadcm3n_yhnl_1900_40_007355739_1 311,040 867,439 2.7888
01 Aug 2011 06:37:28 1051720 13115349 hadcm3n_yhnl_1900_40_007355739_1 285,120 800,832 2.8088
30 Jul 2011 06:27:13 1051720 13115349 hadcm3n_yhnl_1900_40_007355739_1 259,200 726,963 2.8046
28 Jul 2011 09:29:06 1051720 13115349 hadcm3n_yhnl_1900_40_007355739_1 233,280 650,126 2.7869
26 Jul 2011 22:40:47 1051720 13115349 hadcm3n_yhnl_1900_40_007355739_1 207,360 576,731 2.7813
26 Jul 2011 22:40:47 1051720 13115349 hadcm3n_yhnl_1900_40_007355739_1 181,440 501,268 2.7627
25 Jul 2011 15:55:37 1051720 13115349 hadcm3n_yhnl_1900_40_007355739_1 155,520 428,566 2.7557
25 Jul 2011 13:04:38 1051720 13115349 hadcm3n_yhnl_1900_40_007355739_1 129,600 358,796 2.7685
25 Jul 2011 13:04:38 1051720 13115349 hadcm3n_yhnl_1900_40_007355739_1 103,680 286,968 2.7678
25 Jul 2011 13:04:38 1051720 13115349 hadcm3n_yhnl_1900_40_007355739_1 77,760 214,970 2.7645
25 Jul 2011 13:04:38 1051720 13115349 hadcm3n_yhnl_1900_40_007355739_1 51,840 142,801 2.7546
11 Jul 2011 02:05:52 1051720 13115349 hadcm3n_yhnl_1900_40_007355739_1 25,920 71,172 2.7458


©2024 cpdn.org