climateprediction.net home page
Task 13593277

Task 13593277

Name hadcm3n_yaxj_1900_40_007520521_2
Workunit 7717996
Created 4 Nov 2011, 12:16:49 UTC
Sent 4 Nov 2011, 12:19:32 UTC
Report deadline 3 Feb 2012, 19:46:43 UTC
Received 4 Dec 2011, 5:32:01 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 964976
Run time 15 days 12 hours 39 min 42 sec
CPU time 12 days 21 hours 36 min 22 sec
Validate state Invalid
Credit 8,709.12
Device peak FLOPS 3.22 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>6.10.60</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
17:11:17 (9528): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
13:53:54 (6068): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
12:55:49 (1304): No heartbeat from core client for 30 sec - exiting
12:55:50 (1304): No heartbeat from core client for 30 sec - exiting
12:55:51 (1304): No heartbeat from core client for 30 sec - exiting
12:55:52 (1304): No heartbeat from core client for 30 sec - exiting
12:55:53 (1304): No heartbeat from core client for 30 sec - exiting
12:55:54 (1304): No heartbeat from core client for 30 sec - exiting
12:55:55 (1304): No heartbeat from core client for 30 sec - exiting
12:55:56 (1304): No heartbeat from core client for 30 sec - exiting
12:55:57 (1304): No heartbeat from core client for 30 sec - exiting
12:55:58 (1304): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - No 'heartbeat' from BOINC...
12:55:59 (1304): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
01:03:22 (1184): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - No 'heartbeat' from BOINC...
05:11:36 (2956): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2656, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2656, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2656, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2656, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2656, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2656, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
03 Dec 2011 05:45:35 964976 13593277 hadcm3n_yaxj_1900_40_007520521_2 725,760 1,091,595 1.5041
02 Dec 2011 16:34:03 964976 13593277 hadcm3n_yaxj_1900_40_007520521_2 699,840 1,053,980 1.5060
02 Dec 2011 01:09:53 964976 13593277 hadcm3n_yaxj_1900_40_007520521_2 673,920 1,017,320 1.5096
01 Dec 2011 06:35:01 964976 13593277 hadcm3n_yaxj_1900_40_007520521_2 648,000 978,423 1.5099
30 Nov 2011 13:07:51 964976 13593277 hadcm3n_yaxj_1900_40_007520521_2 622,080 940,433 1.5118
29 Nov 2011 07:51:02 964976 13593277 hadcm3n_yaxj_1900_40_007520521_2 596,160 902,172 1.5133
28 Nov 2011 18:08:12 964976 13593277 hadcm3n_yaxj_1900_40_007520521_2 570,240 862,858 1.5131
26 Nov 2011 23:48:01 964976 13593277 hadcm3n_yaxj_1900_40_007520521_2 544,320 821,862 1.5099
25 Nov 2011 01:08:53 964976 13593277 hadcm3n_yaxj_1900_40_007520521_2 518,400 783,659 1.5117
24 Nov 2011 04:20:26 964976 13593277 hadcm3n_yaxj_1900_40_007520521_2 492,480 743,356 1.5094
23 Nov 2011 15:08:50 964976 13593277 hadcm3n_yaxj_1900_40_007520521_2 466,560 703,295 1.5074
23 Nov 2011 02:21:04 964976 13593277 hadcm3n_yaxj_1900_40_007520521_2 440,640 664,949 1.5091
21 Nov 2011 22:10:52 964976 13593277 hadcm3n_yaxj_1900_40_007520521_2 414,720 626,978 1.5118
20 Nov 2011 11:28:25 964976 13593277 hadcm3n_yaxj_1900_40_007520521_2 388,800 588,462 1.5135
19 Nov 2011 22:25:21 964976 13593277 hadcm3n_yaxj_1900_40_007520521_2 362,880 549,039 1.5130
18 Nov 2011 23:41:01 964976 13593277 hadcm3n_yaxj_1900_40_007520521_2 336,960 508,976 1.5105
18 Nov 2011 09:29:22 964976 13593277 hadcm3n_yaxj_1900_40_007520521_2 311,040 468,453 1.5061
17 Nov 2011 20:19:48 964976 13593277 hadcm3n_yaxj_1900_40_007520521_2 285,120 428,687 1.5035
17 Nov 2011 06:07:39 964976 13593277 hadcm3n_yaxj_1900_40_007520521_2 259,200 388,317 1.4981
16 Nov 2011 15:22:08 964976 13593277 hadcm3n_yaxj_1900_40_007520521_2 233,280 347,778 1.4908


©2024 cpdn.org