climateprediction.net home page
Task 16098494

Task 16098494

Name hadcm3n_oa8g_1900_40_008468387_3
Workunit 8619226
Created 27 Nov 2013, 11:46:41 UTC
Sent 27 Nov 2013, 11:46:53 UTC
Report deadline 26 Feb 2014, 19:14:04 UTC
Received 21 Dec 2013, 20:28:36 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1286304
Run time 6 days 20 hours 12 min 43 sec
CPU time 6 days 4 hours 30 min 29 sec
Validate state Invalid
Credit 5,598.72
Device peak FLOPS 3.60 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>7.2.33</core_client_version>
<![CDATA[
<message>
The device does not recognize the command.
 (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
22:34:33 (6648): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
15:38:29 (1212): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
18:03:33 (10804): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
20:06:34 (8004): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
20:09:41 (11052): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
20:12:48 (10364): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
20:15:55 (3168): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
20:25:06 (9456): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
20:28:10 (13112): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
20:31:16 (1676): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
20:34:20 (8036): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
20:37:28 (11020): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
20:40:36 (8824): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
20:43:41 (10452): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
20:50:38 (1508): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
20:53:41 (10788): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
20:59:54 (680): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
21:02:58 (11692): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
21:05:59 (8476): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
21:09:04 (13084): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
21:12:07 (1060): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
21:15:06 (10336): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
21:21:12 (10872): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
21:24:21 (8284): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
21:27:29 (13148): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
21:30:32 (11040): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
21:33:37 (7880): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
21:39:37 (1508): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
21:42:39 (10872): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
21:48:40 (11316): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
21:51:42 (12396): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
21:57:42 (12144): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
22:54:24 (6976): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
08:11:43 (1820): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
23:16:17 (2056): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
15:10:43 (620): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
15:17:15 (7544): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
22:15:13 (8676): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
22:18:13 (6776): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6944, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6944, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6944, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6944, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6944, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6944, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
21 Dec 2013 11:01:57 1286304 16098494 hadcm3n_oa8g_1900_40_008468387_3 466,560 512,850 1.0992
15 Dec 2013 22:30:57 1286304 16098494 hadcm3n_oa8g_1900_40_008468387_3 440,640 484,529 1.0996
15 Dec 2013 13:42:42 1286304 16098494 hadcm3n_oa8g_1900_40_008468387_3 414,720 456,161 1.0999
14 Dec 2013 18:37:06 1286304 16098494 hadcm3n_oa8g_1900_40_008468387_3 388,800 427,411 1.0993
11 Dec 2013 20:28:45 1286304 16098494 hadcm3n_oa8g_1900_40_008468387_3 362,880 398,529 1.0982
11 Dec 2013 13:29:55 1286304 16098494 hadcm3n_oa8g_1900_40_008468387_3 336,960 373,082 1.1072
08 Dec 2013 20:18:55 1286304 16098494 hadcm3n_oa8g_1900_40_008468387_3 311,040 345,506 1.1108
08 Dec 2013 09:35:08 1286304 16098494 hadcm3n_oa8g_1900_40_008468387_3 285,120 314,993 1.1048
07 Dec 2013 23:34:54 1286304 16098494 hadcm3n_oa8g_1900_40_008468387_3 259,200 285,159 1.1002
07 Dec 2013 12:42:53 1286304 16098494 hadcm3n_oa8g_1900_40_008468387_3 233,280 255,343 1.0946
06 Dec 2013 17:28:22 1286304 16098494 hadcm3n_oa8g_1900_40_008468387_3 207,360 225,288 1.0865
05 Dec 2013 20:47:18 1286304 16098494 hadcm3n_oa8g_1900_40_008468387_3 181,440 195,708 1.0786
05 Dec 2013 11:48:21 1286304 16098494 hadcm3n_oa8g_1900_40_008468387_3 155,520 166,116 1.0681
04 Dec 2013 15:11:11 1286304 16098494 hadcm3n_oa8g_1900_40_008468387_3 129,600 136,522 1.0534
03 Dec 2013 21:21:37 1286304 16098494 hadcm3n_oa8g_1900_40_008468387_3 103,680 106,855 1.0306
01 Dec 2013 14:59:35 1286304 16098494 hadcm3n_oa8g_1900_40_008468387_3 77,760 77,947 1.0024
30 Nov 2013 17:59:55 1286304 16098494 hadcm3n_oa8g_1900_40_008468387_3 51,840 48,825 0.9418
27 Nov 2013 20:44:34 1286304 16098494 hadcm3n_oa8g_1900_40_008468387_3 25,920 20,036 0.7730


©2024 cpdn.org