climateprediction.net home page
Task 13407466

Task 13407466

Name hadcm3n_u3ti_1980_40_007458659_1
Workunit 7656162
Created 22 Sep 2011, 15:47:08 UTC
Sent 22 Sep 2011, 15:53:35 UTC
Report deadline 22 Dec 2011, 23:20:46 UTC
Received 1 Oct 2011, 9:51:42 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1156610
Run time 6 days 11 hours 1 min 54 sec
CPU time 5 days 12 hours 23 min 36 sec
Validate state Invalid
Credit 2,488.32
Device peak FLOPS 1.90 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>6.12.34</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
16:31:59 (3880): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
16:32:00 (3880): No heartbeat from core client for 30 sec - exiting
18:38:59 (2456): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
13:11:05 (5080): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
22:33:42 (4216): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
22:33:43 (4216): No heartbeat from core client for 30 sec - exiting
00:59:48 (4232): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
00:59:49 (4232): No heartbeat from core client for 30 sec - exiting
00:59:51 (4232): No heartbeat from core client for 30 sec - exiting
09:24:24 (2368): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
07:18:45 (5084): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
07:18:47 (5084): No heartbeat from core client for 30 sec - exiting
07:18:48 (5084): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - Suspend request from BOINC...
17:13:00 (2792): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
17:13:01 (2792): No heartbeat from core client for 30 sec - exiting
17:13:02 (2792): No heartbeat from core client for 30 sec - exiting
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4208, iMonCtr=1
Model crash detected, will try to restart...
06:42:27 (2092): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
06:47:22 (2332): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5680, iMonCtr=1
Model crash detected, will try to restart...
16:11:12 (2240): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
16:11:14 (2240): No heartbeat from core client for 30 sec - exiting
16:11:15 (2240): No heartbeat from core client for 30 sec - exiting
16:11:16 (2240): No heartbeat from core client for 30 sec - exiting
16:11:17 (2240): No heartbeat from core client for 30 sec - exiting
16:11:18 (2240): No heartbeat from core client for 30 sec - exiting
16:11:19 (2240): No heartbeat from core client for 30 sec - exiting
16:11:20 (2240): No heartbeat from core client for 30 sec - exiting
16:11:21 (2240): No heartbeat from core client for 30 sec - exiting
16:11:22 (2240): No heartbeat from core client for 30 sec - exiting
16:11:23 (2240): No heartbeat from core client for 30 sec - exiting
16:11:24 (2240): No heartbeat from core client for 30 sec - exiting
16:11:25 (2240): No heartbeat from core client for 30 sec - exiting
16:11:26 (2240): No heartbeat from core client for 30 sec - exiting
16:11:27 (2240): No heartbeat from core client for 30 sec - exiting
16:11:28 (2240): No heartbeat from core client for 30 sec - exiting
16:11:29 (2240): No heartbeat from core client for 30 sec - exiting
16:11:30 (2240): No heartbeat from core client for 30 sec - exiting
16:11:31 (2240): No heartbeat from core client for 30 sec - exiting
16:11:32 (2240): No heartbeat from core client for 30 sec - exiting
16:11:33 (2240): No heartbeat from core client for 30 sec - exiting
16:11:34 (2240): No heartbeat from core client for 30 sec - exiting
16:11:35 (2240): No heartbeat from core client for 30 sec - exiting
16:11:36 (2240): No heartbeat from core client for 30 sec - exiting
16:11:37 (2240): No heartbeat from core client for 30 sec - exiting
16:11:38 (2240): No heartbeat from core client for 30 sec - exiting
16:14:26 (2984): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
07:29:24 (5880): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
07:29:25 (5880): No heartbeat from core client for 30 sec - exiting
09:52:08 (2408): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
09:52:09 (2408): No heartbeat from core client for 30 sec - exiting
17:43:57 (2260): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
17:43:58 (2260): No heartbeat from core client for 30 sec - exiting
17:43:59 (2260): No heartbeat from core client for 30 sec - exiting
17:44:00 (2260): No heartbeat from core client for 30 sec - exiting
17:44:01 (2260): No heartbeat from core client for 30 sec - exiting
17:44:02 (2260): No heartbeat from core client for 30 sec - exiting
17:44:03 (2260): No heartbeat from core client for 30 sec - exiting
17:44:04 (2260): No heartbeat from core client for 30 sec - exiting
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2264, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2264, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2264, iMonCtr=1
Model crash detected, will try to restart...
17:50:27 (2264): No heartbeat from core client for 30 sec - exiting
17:50:28 (2264): No heartbeat from core client for 30 sec - exiting
17:50:29 (2264): No heartbeat from core client for 30 sec - exiting
17:50:30 (2264): No heartbeat from core client for 30 sec - exiting
17:50:31 (2264): No heartbeat from core client for 30 sec - exiting
17:50:32 (2264): No heartbeat from core client for 30 sec - exiting
17:50:33 (2264): No heartbeat from core client for 30 sec - exiting
17:50:34 (2264): No heartbeat from core client for 30 sec - exiting
17:50:35 (2264): No heartbeat from core client for 30 sec - exiting
17:50:36 (2264): No heartbeat from core client for 30 sec - exiting
17:50:37 (2264): No heartbeat from core client for 30 sec - exiting
17:50:38 (2264): No heartbeat from core client for 30 sec - exiting
17:50:39 (2264): No heartbeat from core client for 30 sec - exiting
17:50:40 (2264): No heartbeat from core client for 30 sec - exiting
17:50:41 (2264): No heartbeat from core client for 30 sec - exiting
17:50:42 (2264): No heartbeat from core client for 30 sec - exiting
17:50:43 (2264): No heartbeat from core client for 30 sec - exiting
17:50:44 (2264): No heartbeat from core client for 30 sec - exiting
17:50:45 (2264): No heartbeat from core client for 30 sec - exiting
17:50:46 (2264): No heartbeat from core client for 30 sec - exiting
17:50:47 (2264): No heartbeat from core client for 30 sec - exiting
17:50:48 (2264): No heartbeat from core client for 30 sec - exiting
17:50:49 (2264): No heartbeat from core client for 30 sec - exiting
17:50:50 (2264): No heartbeat from core client for 30 sec - exiting
17:50:51 (2264): No heartbeat from core client for 30 sec - exiting
17:50:52 (2264): No heartbeat from core client for 30 sec - exiting
17:50:53 (2264): No heartbeat from core client for 30 sec - exiting
17:50:54 (2264): No heartbeat from core client for 30 sec - exiting
17:50:55 (2264): No heartbeat from core client for 30 sec - exiting
17:50:56 (2264): No heartbeat from core client for 30 sec - exiting
17:50:57 (2264): No heartbeat from core client for 30 sec - exiting
17:50:58 (2264): No heartbeat from core client for 30 sec - exiting
17:50:59 (2264): No heartbeat from core client for 30 sec - exiting
17:51:01 (2264): No heartbeat from core client for 30 sec - exiting
17:51:02 (2264): No heartbeat from core client for 30 sec - exiting
17:51:03 (2264): No heartbeat from core client for 30 sec - exiting
17:51:04 (2264): No heartbeat from core client for 30 sec - exiting
17:51:05 (2264): No heartbeat from core client for 30 sec - exiting
17:51:06 (2264): No heartbeat from core client for 30 sec - exiting
17:51:07 (2264): No heartbeat from core client for 30 sec - exiting
17:51:08 (2264): No heartbeat from core client for 30 sec - exiting
17:51:09 (2264): No heartbeat from core client for 30 sec - exiting
17:51:10 (2264): No heartbeat from core client for 30 sec - exiting
17:51:11 (2264): No heartbeat from core client for 30 sec - exiting
17:51:12 (2264): No heartbeat from core client for 30 sec - exiting
17:51:13 (2264): No heartbeat from core client for 30 sec - exiting
17:51:14 (2264): No heartbeat from core client for 30 sec - exiting
17:51:15 (2264): No heartbeat from core client for 30 sec - exiting
17:51:16 (2264): No heartbeat from core client for 30 sec - exiting
17:51:17 (2264): No heartbeat from core client for 30 sec - exiting
17:51:18 (2264): No heartbeat from core client for 30 sec - exiting
17:51:19 (2264): No heartbeat from core client for 30 sec - exiting
17:51:20 (2264): No heartbeat from core client for 30 sec - exiting
17:51:21 (2264): No heartbeat from core client for 30 sec - exiting
17:51:22 (2264): No heartbeat from core client for 30 sec - exiting
17:51:23 (2264): No heartbeat from core client for 30 sec - exiting
17:51:24 (2264): No heartbeat from core client for 30 sec - exiting
17:51:25 (2264): No heartbeat from core client for 30 sec - exiting
17:51:26 (2264): No heartbeat from core client for 30 sec - exiting
17:51:27 (2264): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
17:51:28 (2264): No heartbeat from core client for 30 sec - exiting
17:51:29 (2264): No heartbeat from core client for 30 sec - exiting
17:51:30 (2264): No heartbeat from core client for 30 sec - exiting
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2256, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2256, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2256, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
01 Oct 2011 09:49:27 1156610 13407466 hadcm3n_u3ti_1980_40_007458659_1 207,360 447,602 2.1586
01 Oct 2011 09:49:27 1156610 13407466 hadcm3n_u3ti_1980_40_007458659_1 181,440 385,866 2.1267
01 Oct 2011 09:49:27 1156610 13407466 hadcm3n_u3ti_1980_40_007458659_1 155,520 327,815 2.1079
01 Oct 2011 09:49:27 1156610 13407466 hadcm3n_u3ti_1980_40_007458659_1 129,600 273,099 2.1072
01 Oct 2011 09:49:27 1156610 13407466 hadcm3n_u3ti_1980_40_007458659_1 103,680 219,230 2.1145
01 Oct 2011 09:49:27 1156610 13407466 hadcm3n_u3ti_1980_40_007458659_1 77,760 165,102 2.1232
01 Oct 2011 09:49:27 1156610 13407466 hadcm3n_u3ti_1980_40_007458659_1 51,840 111,683 2.1544
01 Oct 2011 09:49:27 1156610 13407466 hadcm3n_u3ti_1980_40_007458659_1 25,920 58,728 2.2657


©2024 cpdn.org