climateprediction.net home page
Task 13537277

Task 13537277

Name hadcm3n_yhxr_1900_40_007515804_0
Workunit 7713279
Created 28 Oct 2011, 12:43:52 UTC
Sent 23 Nov 2011, 10:29:59 UTC
Report deadline 22 Feb 2012, 17:57:10 UTC
Received 22 Dec 2011, 19:37:28 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 975606
Run time 15 days 3 hours 52 min 58 sec
CPU time 13 days 2 hours 18 min 3 sec
Validate state Invalid
Credit 5,909.76
Device peak FLOPS 2.75 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>6.10.58</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
forrtl: The requested operation cannot be performed on a file with a user-mapped section open.

Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3944, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
15:22:25 (3896): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
forrtl: The requested operation cannot be performed on a file with a user-mapped section open.

CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3452, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3452, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3452, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4044, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4044, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4044, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
21 Dec 2011 19:53:43 975606 13537277 hadcm3n_yhxr_1900_40_007515804_0 492,480 1,124,062 2.2825
20 Dec 2011 14:10:39 975606 13537277 hadcm3n_yhxr_1900_40_007515804_0 466,560 1,065,247 2.2832
18 Dec 2011 19:12:26 975606 13537277 hadcm3n_yhxr_1900_40_007515804_0 440,640 1,005,911 2.2828
17 Dec 2011 13:39:56 975606 13537277 hadcm3n_yhxr_1900_40_007515804_0 414,720 947,353 2.2843
15 Dec 2011 20:01:19 975606 13537277 hadcm3n_yhxr_1900_40_007515804_0 388,800 887,380 2.2824
14 Dec 2011 13:54:19 975606 13537277 hadcm3n_yhxr_1900_40_007515804_0 362,880 828,181 2.2822
12 Dec 2011 21:30:21 975606 13537277 hadcm3n_yhxr_1900_40_007515804_0 336,960 769,645 2.2841
11 Dec 2011 12:32:41 975606 13537277 hadcm3n_yhxr_1900_40_007515804_0 311,040 710,342 2.2838
09 Dec 2011 20:22:42 975606 13537277 hadcm3n_yhxr_1900_40_007515804_0 285,120 651,028 2.2833
08 Dec 2011 12:34:09 975606 13537277 hadcm3n_yhxr_1900_40_007515804_0 259,200 590,592 2.2785
06 Dec 2011 17:22:47 975606 13537277 hadcm3n_yhxr_1900_40_007515804_0 233,280 530,916 2.2759
04 Dec 2011 22:23:19 975606 13537277 hadcm3n_yhxr_1900_40_007515804_0 207,360 472,331 2.2778
03 Dec 2011 15:07:47 975606 13537277 hadcm3n_yhxr_1900_40_007515804_0 181,440 412,394 2.2729
01 Dec 2011 21:28:50 975606 13537277 hadcm3n_yhxr_1900_40_007515804_0 155,520 352,772 2.2683
30 Nov 2011 15:46:03 975606 13537277 hadcm3n_yhxr_1900_40_007515804_0 129,600 292,909 2.2601
28 Nov 2011 22:17:26 975606 13537277 hadcm3n_yhxr_1900_40_007515804_0 103,680 234,420 2.2610
27 Nov 2011 14:37:14 975606 13537277 hadcm3n_yhxr_1900_40_007515804_0 77,760 177,035 2.2767
25 Nov 2011 22:01:32 975606 13537277 hadcm3n_yhxr_1900_40_007515804_0 51,840 118,222 2.2805
24 Nov 2011 16:08:33 975606 13537277 hadcm3n_yhxr_1900_40_007515804_0 25,920 59,516 2.2961


©2024 cpdn.org