climateprediction.net home page
Task 13294609

Task 13294609

Name hadcm3n_p0gh_1940_40_007422613_2
Workunit 7620248
Created 25 Aug 2011, 7:27:50 UTC
Sent 25 Aug 2011, 7:32:33 UTC
Report deadline 24 Nov 2011, 14:59:44 UTC
Received 16 Oct 2011, 1:22:04 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1158390
Run time 13 days 14 hours 43 min 16 sec
CPU time 12 days 17 hours 58 min 11 sec
Validate state Invalid
Credit 9,020.16
Device peak FLOPS 2.47 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>6.12.34</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
22:52:54 (4188): No heartbeat from core client for 30 sec - exiting
22:52:55 (4188): No heartbeat from core client for 30 sec - exiting
22:52:56 (4188): No heartbeat from core client for 30 sec - exiting
22:52:57 (4188): No heartbeat from core client for 30 sec - exiting
22:52:58 (4188): No heartbeat from core client for 30 sec - exiting
22:52:59 (4188): No heartbeat from core client for 30 sec - exiting
22:53:00 (4188): No heartbeat from core client for 30 sec - exiting
22:53:01 (4188): No heartbeat from core client for 30 sec - exiting
22:53:02 (4188): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
20:36:04 (3640): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
13:50:56 (3648): No heartbeat from core client for 30 sec - exiting
13:50:57 (3648): No heartbeat from core client for 30 sec - exiting
13:50:58 (3648): No heartbeat from core client for 30 sec - exiting
13:50:59 (3648): No heartbeat from core client for 30 sec - exiting
13:51:00 (3648): No heartbeat from core client for 30 sec - exiting
13:51:01 (3648): No heartbeat from core client for 30 sec - exiting
13:51:03 (3648): No heartbeat from core client for 30 sec - exiting
13:51:04 (3648): No heartbeat from core client for 30 sec - exiting
13:51:05 (3648): No heartbeat from core client for 30 sec - exiting
13:51:06 (3648): No heartbeat from core client for 30 sec - exiting
13:51:07 (3648): No heartbeat from core client for 30 sec - exiting
13:51:08 (3648): No heartbeat from core client for 30 sec - exiting
13:51:09 (3648): No heartbeat from core client for 30 sec - exiting
13:51:10 (3648): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
11:24:35 (3344): No heartbeat from core client for 30 sec - exiting
11:24:36 (3344): No heartbeat from core client for 30 sec - exiting
11:24:37 (3344): No heartbeat from core client for 30 sec - exiting
11:24:38 (3344): No heartbeat from core client for 30 sec - exiting
11:24:39 (3344): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
21:27:05 (4012): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
07:28:21 (4904): No heartbeat from core client for 30 sec - exiting
07:28:23 (4904): No heartbeat from core client for 30 sec - exiting
07:28:24 (4904): No heartbeat from core client for 30 sec - exiting
07:28:25 (4904): No heartbeat from core client for 30 sec - exiting
07:28:26 (4904): No heartbeat from core client for 30 sec - exiting
07:28:27 (4904): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
08:25:25 (3204): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
08:40:47 (3592): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2688, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2688, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2688, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2688, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2688, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2688, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
15 Oct 2011 21:11:37 1158390 13294609 hadcm3n_p0gh_1940_40_007422613_2 751,680 1,097,914 1.4606
14 Oct 2011 16:01:29 1158390 13294609 hadcm3n_p0gh_1940_40_007422613_2 725,760 1,059,031 1.4592
13 Oct 2011 19:16:05 1158390 13294609 hadcm3n_p0gh_1940_40_007422613_2 699,840 1,020,415 1.4581
11 Oct 2011 23:47:13 1158390 13294609 hadcm3n_p0gh_1940_40_007422613_2 673,920 981,839 1.4569
04 Oct 2011 02:40:53 1158390 13294609 hadcm3n_p0gh_1940_40_007422613_2 648,000 943,414 1.4559
21 Sep 2011 17:27:47 1158390 13294609 hadcm3n_p0gh_1940_40_007422613_2 622,080 905,729 1.4560
21 Sep 2011 03:38:27 1158390 13294609 hadcm3n_p0gh_1940_40_007422613_2 596,160 868,033 1.4560
18 Sep 2011 17:55:11 1158390 13294609 hadcm3n_p0gh_1940_40_007422613_2 570,240 829,948 1.4554
17 Sep 2011 19:56:32 1158390 13294609 hadcm3n_p0gh_1940_40_007422613_2 544,320 792,063 1.4551
15 Sep 2011 19:26:48 1158390 13294609 hadcm3n_p0gh_1940_40_007422613_2 518,400 754,339 1.4551
14 Sep 2011 16:29:39 1158390 13294609 hadcm3n_p0gh_1940_40_007422613_2 492,480 715,602 1.4531
13 Sep 2011 03:06:42 1158390 13294609 hadcm3n_p0gh_1940_40_007422613_2 466,560 677,371 1.4518
12 Sep 2011 13:08:47 1158390 13294609 hadcm3n_p0gh_1940_40_007422613_2 440,640 639,237 1.4507
09 Sep 2011 17:49:50 1158390 13294609 hadcm3n_p0gh_1940_40_007422613_2 414,720 601,476 1.4503
08 Sep 2011 22:14:36 1158390 13294609 hadcm3n_p0gh_1940_40_007422613_2 388,800 563,239 1.4487
07 Sep 2011 23:40:19 1158390 13294609 hadcm3n_p0gh_1940_40_007422613_2 362,880 525,349 1.4477
06 Sep 2011 22:16:30 1158390 13294609 hadcm3n_p0gh_1940_40_007422613_2 336,960 487,127 1.4457
05 Sep 2011 21:57:19 1158390 13294609 hadcm3n_p0gh_1940_40_007422613_2 311,040 449,540 1.4453
05 Sep 2011 06:53:13 1158390 13294609 hadcm3n_p0gh_1940_40_007422613_2 285,120 412,063 1.4452
04 Sep 2011 20:12:54 1158390 13294609 hadcm3n_p0gh_1940_40_007422613_2 259,200 374,609 1.4453


©2024 cpdn.org