climateprediction.net home page
Task 15935370

Task 15935370

Name hadcm3n_4e6s_1980_40_008410884_0
Workunit 8561740
Created 22 Aug 2013, 7:03:21 UTC
Sent 23 Aug 2013, 19:10:31 UTC
Report deadline 23 Nov 2013, 2:37:42 UTC
Received 5 Oct 2013, 13:50:15 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1289224
Run time 38 days 23 hours 47 min 47 sec
CPU time 27 days 19 hours 26 min 24 sec
Validate state Invalid
Credit 5,909.76
Device peak FLOPS 1.25 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>7.0.64</core_client_version>
<![CDATA[
<message>
The device does not recognize the command.
 (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
15:13:13 (4176): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
01:00:10 (5080): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3500, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2936, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3868, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3868, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5288, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5288, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5288, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
02 Oct 2013 11:36:44 1289224 15935370 hadcm3n_4e6s_1980_40_008410884_0 492,480 2,377,876 4.8284
30 Sep 2013 14:33:51 1289224 15935370 hadcm3n_4e6s_1980_40_008410884_0 466,560 2,252,690 4.8283
28 Sep 2013 17:23:24 1289224 15935370 hadcm3n_4e6s_1980_40_008410884_0 440,640 2,127,297 4.8277
26 Sep 2013 15:29:16 1289224 15935370 hadcm3n_4e6s_1980_40_008410884_0 414,720 2,001,906 4.8271
25 Sep 2013 09:21:52 1289224 15935370 hadcm3n_4e6s_1980_40_008410884_0 388,800 1,875,343 4.8234
23 Sep 2013 10:03:41 1289224 15935370 hadcm3n_4e6s_1980_40_008410884_0 362,880 1,745,934 4.8113
20 Sep 2013 02:42:06 1289224 15935370 hadcm3n_4e6s_1980_40_008410884_0 336,960 1,618,582 4.8035
18 Sep 2013 05:41:24 1289224 15935370 hadcm3n_4e6s_1980_40_008410884_0 311,040 1,492,625 4.7988
16 Sep 2013 08:51:53 1289224 15935370 hadcm3n_4e6s_1980_40_008410884_0 285,120 1,366,675 4.7933
14 Sep 2013 12:44:04 1289224 15935370 hadcm3n_4e6s_1980_40_008410884_0 259,200 1,241,376 4.7893
12 Sep 2013 07:08:56 1289224 15935370 hadcm3n_4e6s_1980_40_008410884_0 233,280 1,114,013 4.7754
10 Sep 2013 01:16:11 1289224 15935370 hadcm3n_4e6s_1980_40_008410884_0 207,360 986,095 4.7555
07 Sep 2013 18:11:53 1289224 15935370 hadcm3n_4e6s_1980_40_008410884_0 181,440 860,309 4.7416
05 Sep 2013 16:46:13 1289224 15935370 hadcm3n_4e6s_1980_40_008410884_0 155,520 736,357 4.7348
03 Sep 2013 14:22:50 1289224 15935370 hadcm3n_4e6s_1980_40_008410884_0 129,600 613,593 4.7345
01 Sep 2013 05:23:01 1289224 15935370 hadcm3n_4e6s_1980_40_008410884_0 103,680 487,818 4.7050
30 Aug 2013 00:22:37 1289224 15935370 hadcm3n_4e6s_1980_40_008410884_0 77,760 365,548 4.7010
28 Aug 2013 01:51:03 1289224 15935370 hadcm3n_4e6s_1980_40_008410884_0 51,840 245,099 4.7280
26 Aug 2013 05:33:25 1289224 15935370 hadcm3n_4e6s_1980_40_008410884_0 25,920 124,781 4.8141


©2024 climateprediction.net