climateprediction.net home page
Task 16020896

Task 16020896

Name hadcm3n_7woe_1980_40_008453633_1
Workunit 8604489
Created 17 Sep 2013, 10:41:58 UTC
Sent 17 Sep 2013, 11:21:07 UTC
Report deadline 17 Dec 2013, 18:48:18 UTC
Received 12 Nov 2013, 7:01:26 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1237173
Run time 1 days 1 hours 12 min 30 sec
CPU time 17 hours 21 min 15 sec
Validate state Invalid
Credit 311.04
Device peak FLOPS 3.34 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>7.0.64</core_client_version>
<![CDATA[
<message>
The device does not recognize the command.
 (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
10:40:54 (72152): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
10:40:55 (72152): No heartbeat from core client for 30 sec - exiting
10:54:19 (71816): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
12:02:12 (74152): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
12:28:02 (74448): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
12:28:04 (74448): No heartbeat from core client for 30 sec - exiting
12:28:05 (74448): No heartbeat from core client for 30 sec - exiting
12:35:17 (74132): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
17:39:39 (5224): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
18:07:30 (5532): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
18:27:06 (10012): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
19:33:35 (11036): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
20:15:26 (11096): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
20:20:09 (9588): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=11780, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=11780, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=11780, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=11780, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=11780, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=11780, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
16:29:37 (30456): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
16:29:43 (30456): No heartbeat from core client for 30 sec - exiting
16:29:44 (30456): No heartbeat from core client for 30 sec - exiting
16:29:45 (30456): No heartbeat from core client for 30 sec - exiting
16:35:57 (14904): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
16:36:02 (14904): No heartbeat from core client for 30 sec - exiting
Atmos Hold Restart file rename failed on atmos_restart.hold
19:28:37 (38104): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
19:28:38 (38104): No heartbeat from core client for 30 sec - exiting
19:28:39 (38104): No heartbeat from core client for 30 sec - exiting
19:28:40 (38104): No heartbeat from core client for 30 sec - exiting
19:28:41 (38104): No heartbeat from core client for 30 sec - exiting
19:28:42 (38104): No heartbeat from core client for 30 sec - exiting
19:28:43 (38104): No heartbeat from core client for 30 sec - exiting
19:28:44 (38104): No heartbeat from core client for 30 sec - exiting
21:09:23 (35340): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
21:09:30 (35340): No heartbeat from core client for 30 sec - exiting
21:09:31 (35340): No heartbeat from core client for 30 sec - exiting
21:09:32 (35340): No heartbeat from core client for 30 sec - exiting
21:09:33 (35340): No heartbeat from core client for 30 sec - exiting
21:09:34 (35340): No heartbeat from core client for 30 sec - exiting
21:09:35 (35340): No heartbeat from core client for 30 sec - exiting
21:09:36 (35340): No heartbeat from core client for 30 sec - exiting
21:09:37 (35340): No heartbeat from core client for 30 sec - exiting
21:09:38 (35340): No heartbeat from core client for 30 sec - exiting
21:09:39 (35340): No heartbeat from core client for 30 sec - exiting
Atmos Hold Restart file rename failed on atmos_restart.hold
21:50:35 (7560): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
21:50:36 (7560): No heartbeat from core client for 30 sec - exiting
21:50:37 (7560): No heartbeat from core client for 30 sec - exiting
21:50:38 (7560): No heartbeat from core client for 30 sec - exiting
21:50:39 (7560): No heartbeat from core client for 30 sec - exiting
21:50:40 (7560): No heartbeat from core client for 30 sec - exiting
21:50:41 (7560): No heartbeat from core client for 30 sec - exiting
22:29:04 (26360): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
22:29:13 (26360): No heartbeat from core client for 30 sec - exiting
22:29:14 (26360): No heartbeat from core client for 30 sec - exiting
22:29:15 (26360): No heartbeat from core client for 30 sec - exiting
22:29:16 (26360): No heartbeat from core client for 30 sec - exiting
22:29:17 (26360): No heartbeat from core client for 30 sec - exiting
22:29:18 (26360): No heartbeat from core client for 30 sec - exiting
22:29:19 (26360): No heartbeat from core client for 30 sec - exiting
23:44:34 (17788): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
23:54:50 (41572): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
23:54:53 (41572): No heartbeat from core client for 30 sec - exiting
Atmos Hold Restart file rename failed on atmos_restart.hold
00:15:09 (36036): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
00:15:10 (36036): No heartbeat from core client for 30 sec - exiting
00:15:11 (36036): No heartbeat from core client for 30 sec - exiting
00:15:12 (36036): No heartbeat from core client for 30 sec - exiting
00:15:13 (36036): No heartbeat from core client for 30 sec - exiting
00:15:14 (36036): No heartbeat from core client for 30 sec - exiting
00:15:15 (36036): No heartbeat from core client for 30 sec - exiting
00:15:16 (36036): No heartbeat from core client for 30 sec - exiting
00:15:17 (36036): No heartbeat from core client for 30 sec - exiting
00:15:18 (36036): No heartbeat from core client for 30 sec - exiting
00:15:19 (36036): No heartbeat from core client for 30 sec - exiting
Atmos Hold Restart file rename failed on atmos_restart.hold
01:09:28 (41596): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
01:09:29 (41596): No heartbeat from core client for 30 sec - exiting
01:09:30 (41596): No heartbeat from core client for 30 sec - exiting
01:09:31 (41596): No heartbeat from core client for 30 sec - exiting
01:09:32 (41596): No heartbeat from core client for 30 sec - exiting
01:09:33 (41596): No heartbeat from core client for 30 sec - exiting
01:09:34 (41596): No heartbeat from core client for 30 sec - exiting
01:09:35 (41596): No heartbeat from core client for 30 sec - exiting
01:09:36 (41596): No heartbeat from core client for 30 sec - exiting
01:09:37 (41596): No heartbeat from core client for 30 sec - exiting
01:09:38 (41596): No heartbeat from core client for 30 sec - exiting
02:13:48 (42280): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
02:13:52 (42280): No heartbeat from core client for 30 sec - exiting
02:13:53 (42280): No heartbeat from core client for 30 sec - exiting
02:13:54 (42280): No heartbeat from core client for 30 sec - exiting
02:13:55 (42280): No heartbeat from core client for 30 sec - exiting
02:13:56 (42280): No heartbeat from core client for 30 sec - exiting
02:13:57 (42280): No heartbeat from core client for 30 sec - exiting
02:13:58 (42280): No heartbeat from core client for 30 sec - exiting
02:13:59 (42280): No heartbeat from core client for 30 sec - exiting
02:14:00 (42280): No heartbeat from core client for 30 sec - exiting
02:14:01 (42280): No heartbeat from core client for 30 sec - exiting
Atmos Hold Restart file rename failed on atmos_restart.hold
02:22:16 (40940): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
02:22:17 (40940): No heartbeat from core client for 30 sec - exiting
02:22:18 (40940): No heartbeat from core client for 30 sec - exiting
02:22:19 (40940): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - Quit request from BOINC...
11:15:42 (9480): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
11:15:44 (9480): No heartbeat from core client for 30 sec - exiting
11:29:09 (5052): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
11:34:20 (10144): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
20:37:43 (103392): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
20:41:15 (103624): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
20:41:17 (103624): No heartbeat from core client for 30 sec - exiting
20:41:18 (103624): No heartbeat from core client for 30 sec - exiting
20:41:19 (103624): No heartbeat from core client for 30 sec - exiting
20:41:20 (103624): No heartbeat from core client for 30 sec - exiting
00:41:30 (104420): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
01:47:05 (104992): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
23:06:42 (117532): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Atmos Hold Restart file rename failed on atmos_restart.hold
23:43:14 (108640): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
00:15:26 (114744): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=115252, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=115252, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=115252, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=115252, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=115252, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=115252, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
11 Oct 2013 05:28:05 1237173 16020896 hadcm3n_7woe_1980_40_008453633_1 25,920 45,379 1.7507


©2024 climateprediction.net