climateprediction.net home page
Task 16585777

Task 16585777

Name hadcm3n_88xb_1980_40_008720986_0
Workunit 8866964
Created 23 Apr 2014, 12:29:53 UTC
Sent 5 May 2014, 13:16:57 UTC
Report deadline 4 Aug 2014, 20:44:08 UTC
Received 28 May 2014, 6:57:17 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1310842
Run time 8 days 11 hours 8 min 46 sec
CPU time 6 days 21 hours 36 min 33 sec
Validate state Invalid
Credit 2,488.32
Device peak FLOPS 1.77 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>7.2.42</core_client_version>
<![CDATA[
<message>
The device does not recognize the command.
 (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8844, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4652, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4652, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4652, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4652, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4652, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4364, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5060, iMonCtr=1
Model crash detected, will try to restart...
CCController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3184, iMonCtr=1
Model crash detected, will try to restart...
15:57:10 (5760): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4888, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2252, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2252, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2252, iMonCtr=1
Model crash detected, will try to restart...
19:09:10 (2596): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
19:09:11 (2596): No heartbeat from core client for 30 sec - exiting
19:21:23 (6792): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7076, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4056, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4056, iMonCtr=1
Model crash detected, will try to restart...
CController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4452, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1276, iMonCtr=1
Model crash detected, will try to restart...
18:55:20 (4700): No heartbeat from core client for 30 sec - exiting
18:55:22 (4700): No heartbeat from core client for 30 sec - exiting
18:55:23 (4700): No heartbeat from core client for 30 sec - exiting
18:55:24 (4700): No heartbeat from core client for 30 sec - exiting
18:55:28 (4700): No heartbeat from core client for 30 sec - exiting
18:55:29 (4700): No heartbeat from core client for 30 sec - exiting
18:55:30 (4700): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
forrtl: The requested operation cannot be performed on a file with a user-mapped section open.

Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5860, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5860, iMonCtr=1
Model crash detected, will try to restart...
11:52:50 (684): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2468, iMonCtr=1
Model crash detected, will try to restart...
19:00:35 (4720): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
09:10:57 (4624): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5040, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5040, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5040, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5040, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5040, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5040, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
27 May 2014 13:12:21 1310842 16585777 hadcm3n_88xb_1980_40_008720986_0 207,360 592,130 2.8556
25 May 2014 05:10:52 1310842 16585777 hadcm3n_88xb_1980_40_008720986_0 181,440 510,597 2.8141
20 May 2014 12:30:26 1310842 16585777 hadcm3n_88xb_1980_40_008720986_0 155,520 437,236 2.8114
18 May 2014 09:21:44 1310842 16585777 hadcm3n_88xb_1980_40_008720986_0 129,600 357,329 2.7572
16 May 2014 09:17:02 1310842 16585777 hadcm3n_88xb_1980_40_008720986_0 103,680 279,663 2.6974
13 May 2014 13:10:26 1310842 16585777 hadcm3n_88xb_1980_40_008720986_0 77,760 206,243 2.6523
10 May 2014 12:56:13 1310842 16585777 hadcm3n_88xb_1980_40_008720986_0 51,840 137,876 2.6596
07 May 2014 04:35:28 1310842 16585777 hadcm3n_88xb_1980_40_008720986_0 25,920 64,716 2.4968


©2024 cpdn.org