climateprediction.net home page
Task 13466273

Task 13466273

Name hadcm3n_t3yz_1940_40_007443458_4
Workunit 7640961
Created 7 Oct 2011, 16:22:21 UTC
Sent 7 Oct 2011, 16:22:33 UTC
Report deadline 6 Jan 2012, 23:49:44 UTC
Received 12 Oct 2011, 14:41:00 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1158664
Run time 4 days 14 hours 12 min 39 sec
CPU time 4 days 14 hours 11 min 13 sec
Validate state Invalid
Credit 3,732.48
Device peak FLOPS 3.49 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>6.10.58</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
14:34:51 (2776): No heartbeat from core client for 30 sec - exiting
14:34:52 (2776): No heartbeat from core client for 30 sec - exiting
14:34:53 (2776): No heartbeat from core client for 30 sec - exiting
14:34:54 (2776): No heartbeat from core client for 30 sec - exiting
14:34:55 (2776): No heartbeat from core client for 30 sec - exiting
14:34:56 (2776): No heartbeat from core client for 30 sec - exiting
14:34:57 (2776): No heartbeat from core client for 30 sec - exiting
14:34:58 (2776): No heartbeat from core client for 30 sec - exiting
14:34:59 (2776): No heartbeat from core client for 30 sec - exiting
14:35:00 (2776): No heartbeat from core client for 30 sec - exiting
14:35:01 (2776): No heartbeat from core client for 30 sec - exiting
14:35:02 (2776): No heartbeat from core client for 30 sec - exiting
14:35:03 (2776): No heartbeat from core client for 30 sec - exiting
14:35:04 (2776): No heartbeat from core client for 30 sec - exiting
14:35:05 (2776): No heartbeat from core client for 30 sec - exiting
14:35:06 (2776): No heartbeat from core client for 30 sec - exiting
14:35:07 (2776): No heartbeat from core client for 30 sec - exiting
14:35:08 (2776): No heartbeat from core client for 30 sec - exiting
14:35:09 (2776): No heartbeat from core client for 30 sec - exiting
14:35:10 (2776): No heartbeat from core client for 30 sec - exiting
14:35:11 (2776): No heartbeat from core client for 30 sec - exiting
14:35:12 (2776): No heartbeat from core client for 30 sec - exiting
14:35:13 (2776): No heartbeat from core client for 30 sec - exiting
14:35:14 (2776): No heartbeat from core client for 30 sec - exiting
14:35:15 (2776): No heartbeat from core client for 30 sec - exiting
14:35:16 (2776): No heartbeat from core client for 30 sec - exiting
14:35:17 (2776): No heartbeat from core client for 30 sec - exiting
14:35:18 (2776): No heartbeat from core client for 30 sec - exiting
14:35:19 (2776): No heartbeat from core client for 30 sec - exiting
14:35:20 (2776): No heartbeat from core client for 30 sec - exiting
14:35:21 (2776): No heartbeat from core client for 30 sec - exiting
14:35:22 (2776): No heartbeat from core client for 30 sec - exiting
14:35:23 (2776): No heartbeat from core client for 30 sec - exiting
14:35:24 (2776): No heartbeat from core client for 30 sec - exiting
14:35:25 (2776): No heartbeat from core client for 30 sec - exiting
14:35:26 (2776): No heartbeat from core client for 30 sec - exiting
14:35:27 (2776): No heartbeat from core client for 30 sec - exiting
14:35:28 (2776): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
14:37:46 (1504): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=996, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=996, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=996, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=996, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=996, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=996, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
12 Oct 2011 10:57:25 1158664 13466273 hadcm3n_t3yz_1940_40_007443458_4 311,040 389,007 1.2507
12 Oct 2011 01:48:55 1158664 13466273 hadcm3n_t3yz_1940_40_007443458_4 285,120 354,651 1.2439
11 Oct 2011 14:41:02 1158664 13466273 hadcm3n_t3yz_1940_40_007443458_4 259,200 317,350 1.2243
11 Oct 2011 04:19:56 1158664 13466273 hadcm3n_t3yz_1940_40_007443458_4 233,280 280,335 1.2017
10 Oct 2011 18:51:22 1158664 13466273 hadcm3n_t3yz_1940_40_007443458_4 207,360 246,475 1.1886
10 Oct 2011 11:02:24 1158664 13466273 hadcm3n_t3yz_1940_40_007443458_4 181,440 218,290 1.2031
10 Oct 2011 02:43:05 1158664 13466273 hadcm3n_t3yz_1940_40_007443458_4 155,520 188,523 1.2122
09 Oct 2011 18:39:33 1158664 13466273 hadcm3n_t3yz_1940_40_007443458_4 129,600 159,433 1.2302
09 Oct 2011 11:18:55 1158664 13466273 hadcm3n_t3yz_1940_40_007443458_4 103,680 131,063 1.2641
09 Oct 2011 02:35:58 1158664 13466273 hadcm3n_t3yz_1940_40_007443458_4 77,760 102,053 1.3124
08 Oct 2011 18:53:29 1158664 13466273 hadcm3n_t3yz_1940_40_007443458_4 51,840 71,427 1.3778
08 Oct 2011 08:43:30 1158664 13466273 hadcm3n_t3yz_1940_40_007443458_4 25,920 36,381 1.4036


©2024 climateprediction.net