climateprediction.net home page
Task 17354207

Task 17354207

Name hadcm3n_xae9_1940_40_009149767_0
Workunit 9280103
Created 6 Nov 2014, 14:38:12 UTC
Sent 6 Nov 2014, 14:38:25 UTC
Report deadline 5 Feb 2015, 22:05:36 UTC
Received 7 Dec 2014, 20:15:09 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1343946
Run time 15 days 20 hours 57 min 38 sec
CPU time 14 days 9 hours 45 min 35 sec
Validate state Invalid
Credit 1,866.24
Device peak FLOPS 2.24 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>7.4.27</core_client_version>
<![CDATA[
<message>
The device does not recognize the command.
 (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=19524, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
18:23:43 (3644): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
20:30:12 (4924): No heartbeat from core client for 30 sec - exiting
20:30:13 (4924): No heartbeat from core client for 30 sec - exiting
20:30:14 (4924): No heartbeat from core client for 30 sec - exiting
20:30:15 (4924): No heartbeat from core client for 30 sec - exiting
20:30:16 (4924): No heartbeat from core client for 30 sec - exiting
20:30:17 (4924): No heartbeat from core client for 30 sec - exiting
20:30:18 (4924): No heartbeat from core client for 30 sec - exiting
20:30:19 (4924): No heartbeat from core client for 30 sec - exiting
20:30:20 (4924): No heartbeat from core client for 30 sec - exiting
20:30:21 (4924): No heartbeat from core client for 30 sec - exiting
20:30:22 (4924): No heartbeat from core client for 30 sec - exiting
20:30:23 (4924): No heartbeat from core client for 30 sec - exiting
20:30:24 (4924): No heartbeat from core client for 30 sec - exiting
20:30:25 (4924): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5464, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7140, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7140, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7140, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7140, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7140, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7140, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
22:19:29 (4036): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
21:42:46 (4996): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
15:06:23 (5896): No heartbeat from core client for 30 sec - exiting
15:07:21 (5896): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4324, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4324, iMonCtr=1
Model crash detected, will try to restart...
13:57:28 (3008): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
13:57:29 (3008): No heartbeat from core client for 30 sec - exiting
13:57:30 (3008): No heartbeat from core client for 30 sec - exiting
13:57:32 (3008): No heartbeat from core client for 30 sec - exiting
13:57:33 (3008): No heartbeat from core client for 30 sec - exiting
13:57:34 (3008): No heartbeat from core client for 30 sec - exiting
13:57:35 (3008): No heartbeat from core client for 30 sec - exiting
13:57:36 (3008): No heartbeat from core client for 30 sec - exiting
13:57:37 (3008): No heartbeat from core client for 30 sec - exiting
13:57:38 (3008): No heartbeat from core client for 30 sec - exiting
13:57:39 (3008): No heartbeat from core client for 30 sec - exiting
13:57:40 (3008): No heartbeat from core client for 30 sec - exiting
13:57:41 (3008): No heartbeat from core client for 30 sec - exiting
13:57:42 (3008): No heartbeat from core client for 30 sec - exiting
13:57:44 (3008): No heartbeat from core client for 30 sec - exiting
13:57:45 (3008): No heartbeat from core client for 30 sec - exiting
13:57:46 (3008): No heartbeat from core client for 30 sec - exiting
13:57:47 (3008): No heartbeat from core client for 30 sec - exiting
13:57:48 (3008): No heartbeat from core client for 30 sec - exiting
13:58:35 (3340): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5124, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5124, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5124, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5124, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
19 Nov 2014 09:16:52 1343946 17354207 hadcm3n_xae9_1940_40_009149767_0 155,520 703,895 4.5261
17 Nov 2014 22:08:35 1343946 17354207 hadcm3n_xae9_1940_40_009149767_0 129,600 585,345 4.5166
16 Nov 2014 11:09:19 1343946 17354207 hadcm3n_xae9_1940_40_009149767_0 103,680 466,821 4.5025
13 Nov 2014 07:24:05 1343946 17354207 hadcm3n_xae9_1940_40_009149767_0 77,760 348,687 4.4841
11 Nov 2014 20:21:34 1343946 17354207 hadcm3n_xae9_1940_40_009149767_0 51,840 231,245 4.4607
10 Nov 2014 10:45:30 1343946 17354207 hadcm3n_xae9_1940_40_009149767_0 25,920 116,892 4.5097


©2024 climateprediction.net