climateprediction.net home page
Task 16183249

Task 16183249

Name hadcm3n_occf_1900_40_008471122_1
Workunit 8621961
Created 31 Dec 2013, 17:05:25 UTC
Sent 31 Dec 2013, 17:05:27 UTC
Report deadline 2 Apr 2014, 0:32:38 UTC
Received 17 Feb 2014, 16:25:21 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 980226
Run time 17 days 8 hours 44 min 53 sec
CPU time 12 days 16 hours 13 min 56 sec
Validate state Invalid
Credit 5,909.76
Device peak FLOPS 1.91 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>6.10.18</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
15:00:34 (2600): No heartbeat from core client for 30 sec - exiting
15:00:35 (2600): No heartbeat from core client for 30 sec - exiting
15:00:36 (2600): No heartbeat from core client for 30 sec - exiting
15:00:37 (2600): No heartbeat from core client for 30 sec - exiting
15:00:38 (2600): No heartbeat from core client for 30 sec - exiting
15:00:40 (2600): No heartbeat from core client for 30 sec - exiting
15:00:41 (2600): No heartbeat from core client for 30 sec - exiting
15:00:42 (2600): No heartbeat from core client for 30 sec - exiting
15:00:43 (2600): No heartbeat from core client for 30 sec - exiting
15:00:44 (2600): No heartbeat from core client for 30 sec - exiting
15:00:45 (2600): No heartbeat from core client for 30 sec - exiting
15:00:52 (2600): No heartbeat from core client for 30 sec - exiting
15:00:55 (2600): No heartbeat from core client for 30 sec - exiting
15:01:07 (2600): No heartbeat from core client for 30 sec - exiting
15:01:10 (2600): No heartbeat from core client for 30 sec - exiting
15:01:13 (2600): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
ControlCPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
16:07:37 (480): No heartbeat from core client for 30 sec - exiting
16:07:40 (480): No heartbeat from core client for 30 sec - exiting
16:07:44 (480): No heartbeat from core client for 30 sec - exiting
16:07:45 (480): No heartbeat from core client for 30 sec - exiting
16:08:19 (480): No heartbeat from core client for 30 sec - exiting
16:08:22 (480): No heartbeat from core client for 30 sec - exiting
16:08:23 (480): No heartbeat from core client for 30 sec - exiting
16:08:26 (480): No heartbeat from core client for 30 sec - exiting
16:08:28 (480): No heartbeat from core client for 30 sec - exiting
16:08:30 (480): No heartbeat from core client for 30 sec - exiting
16:08:33 (480): No heartbeat from core client for 30 sec - exiting
16:08:36 (480): No heartbeat from core client for 30 sec - exiting
16:08:38 (480): No heartbeat from core client for 30 sec - exiting
16:08:39 (480): No heartbeat from core client for 30 sec - exiting
16:08:41 (480): No heartbeat from core client for 30 sec - exiting
16:08:45 (480): No heartbeat from core client for 30 sec - exiting
16:08:51 (480): No heartbeat from core client for 30 sec - exiting
16:09:01 (480): No heartbeat from core client for 30 sec - exiting
16:09:03 (480): No heartbeat from core client for 30 sec - exiting
16:09:06 (480): No heartbeat from core client for 30 sec - exiting
16:12:49 (480): No heartbeat from core client for 30 sec - exiting
16:12:52 (480): No heartbeat from core client for 30 sec - exiting
16:12:54 (480): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
17:57:32 (5120): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
17:57:37 (5120): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
16:27:27 (4756): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
Signal 11 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4464, iMonCtr=1
Model crash detected, will try to restart...
Signal 11 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4464, iMonCtr=1
Model crash detected, will try to restart...
Signal 11 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4464, iMonCtr=1
Model crash detected, will try to restart...
Signal 11 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4464, iMonCtr=1
Model crash detected, will try to restart...
Signal 11 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4464, iMonCtr=1
Model crash detected, will try to restart...
Signal 11 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4464, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
14 Feb 2014 18:13:59 980226 16183249 hadcm3n_occf_1900_40_008471122_1 492,480 1,077,695 2.1883
12 Feb 2014 19:55:20 980226 16183249 hadcm3n_occf_1900_40_008471122_1 466,560 1,021,061 2.1885
10 Feb 2014 18:43:09 980226 16183249 hadcm3n_occf_1900_40_008471122_1 440,640 964,139 2.1880
07 Feb 2014 19:02:44 980226 16183249 hadcm3n_occf_1900_40_008471122_1 414,720 908,488 2.1906
05 Feb 2014 18:37:22 980226 16183249 hadcm3n_occf_1900_40_008471122_1 388,800 851,612 2.1904
03 Feb 2014 19:06:23 980226 16183249 hadcm3n_occf_1900_40_008471122_1 362,880 796,032 2.1937
30 Jan 2014 21:50:02 980226 16183249 hadcm3n_occf_1900_40_008471122_1 336,960 738,691 2.1922
30 Jan 2014 17:43:03 980226 16183249 hadcm3n_occf_1900_40_008471122_1 311,040 682,300 2.1936
26 Jan 2014 22:25:41 980226 16183249 hadcm3n_occf_1900_40_008471122_1 285,120 625,491 2.1938
24 Jan 2014 14:27:09 980226 16183249 hadcm3n_occf_1900_40_008471122_1 259,200 569,877 2.1986
21 Jan 2014 21:46:13 980226 16183249 hadcm3n_occf_1900_40_008471122_1 233,280 512,433 2.1966
19 Jan 2014 20:22:33 980226 16183249 hadcm3n_occf_1900_40_008471122_1 207,360 455,063 2.1946
16 Jan 2014 21:34:12 980226 16183249 hadcm3n_occf_1900_40_008471122_1 181,440 397,516 2.1909
14 Jan 2014 20:55:56 980226 16183249 hadcm3n_occf_1900_40_008471122_1 155,520 340,917 2.1921
12 Jan 2014 19:09:32 980226 16183249 hadcm3n_occf_1900_40_008471122_1 129,600 284,502 2.1952
10 Jan 2014 21:20:50 980226 16183249 hadcm3n_occf_1900_40_008471122_1 103,680 227,237 2.1917
07 Jan 2014 20:21:56 980226 16183249 hadcm3n_occf_1900_40_008471122_1 77,760 169,741 2.1829
05 Jan 2014 17:23:26 980226 16183249 hadcm3n_occf_1900_40_008471122_1 51,840 113,222 2.1841
02 Jan 2014 19:47:26 980226 16183249 hadcm3n_occf_1900_40_008471122_1 25,920 55,566 2.1438


©2024 cpdn.org