climateprediction.net home page
Task 16036595

Task 16036595

Name hadcm3n_o8cl_1900_40_008465944_0
Workunit 8616783
Created 27 Sep 2013, 9:19:33 UTC
Sent 9 Oct 2013, 3:42:58 UTC
Report deadline 8 Jan 2014, 11:10:09 UTC
Received 12 Nov 2013, 7:01:26 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1237173
Run time 21 hours 27 min 28 sec
CPU time 9 hours 47 min 22 sec
Validate state Invalid
Credit 311.04
Device peak FLOPS 3.30 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>7.0.64</core_client_version>
<![CDATA[
<message>
The device does not recognize the command.
 (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
23:06:42 (112244): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
23:43:14 (105984): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
23:43:18 (105984): No heartbeat from core client for 30 sec - exiting
Atmos Hold Restart file rename failed on atmos_restart.hold
00:15:27 (114540): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
00:15:28 (114540): No heartbeat from core client for 30 sec - exiting
00:15:29 (114540): No heartbeat from core client for 30 sec - exiting
00:15:30 (114540): No heartbeat from core client for 30 sec - exiting
00:15:31 (114540): No heartbeat from core client for 30 sec - exiting
00:15:32 (114540): No heartbeat from core client for 30 sec - exiting
00:15:33 (114540): No heartbeat from core client for 30 sec - exiting
00:15:34 (114540): No heartbeat from core client for 30 sec - exiting
00:15:35 (114540): No heartbeat from core client for 30 sec - exiting
00:15:36 (114540): No heartbeat from core client for 30 sec - exiting
00:22:13 (110392): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
00:49:53 (114580): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
00:53:55 (115472): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
00:54:14 (115472): No heartbeat from core client for 30 sec - exiting
00:54:15 (115472): No heartbeat from core client for 30 sec - exiting
00:54:16 (115472): No heartbeat from core client for 30 sec - exiting
00:54:17 (115472): No heartbeat from core client for 30 sec - exiting
00:54:18 (115472): No heartbeat from core client for 30 sec - exiting
00:54:19 (115472): No heartbeat from core client for 30 sec - exiting
00:54:20 (115472): No heartbeat from core client for 30 sec - exiting
00:54:21 (115472): No heartbeat from core client for 30 sec - exiting
00:54:22 (115472): No heartbeat from core client for 30 sec - exiting
00:54:23 (115472): No heartbeat from core client for 30 sec - exiting
01:31:12 (101740): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
04:10:50 (109104): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
04:11:02 (109104): No heartbeat from core client for 30 sec - exiting
04:11:03 (109104): No heartbeat from core client for 30 sec - exiting
04:11:04 (109104): No heartbeat from core client for 30 sec - exiting
04:11:05 (109104): No heartbeat from core client for 30 sec - exiting
04:11:06 (109104): No heartbeat from core client for 30 sec - exiting
04:11:07 (109104): No heartbeat from core client for 30 sec - exiting
04:11:08 (109104): No heartbeat from core client for 30 sec - exiting
04:11:09 (109104): No heartbeat from core client for 30 sec - exiting
04:11:10 (109104): No heartbeat from core client for 30 sec - exiting
Atmos Hold Restart file rename failed on atmos_restart.hold
CPDN Monitor - Quit request from BOINC...
09:25:07 (105456): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
09:25:25 (105456): No heartbeat from core client for 30 sec - exiting
09:30:47 (104004): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
09:30:51 (104004): No heartbeat from core client for 30 sec - exiting
09:30:52 (104004): No heartbeat from core client for 30 sec - exiting
09:30:53 (104004): No heartbeat from core client for 30 sec - exiting
09:30:55 (104004): No heartbeat from core client for 30 sec - exiting
09:30:56 (104004): No heartbeat from core client for 30 sec - exiting
09:30:57 (104004): No heartbeat from core client for 30 sec - exiting
09:30:58 (104004): No heartbeat from core client for 30 sec - exiting
09:30:59 (104004): No heartbeat from core client for 30 sec - exiting
09:31:00 (104004): No heartbeat from core client for 30 sec - exiting
09:31:01 (104004): No heartbeat from core client for 30 sec - exiting
Atmos Hold Restart file rename failed on atmos_restart.hold
09:41:09 (110772): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
09:41:10 (110772): No heartbeat from core client for 30 sec - exiting
09:41:11 (110772): No heartbeat from core client for 30 sec - exiting
09:41:12 (110772): No heartbeat from core client for 30 sec - exiting
13:02:20 (105208): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
13:02:25 (105208): No heartbeat from core client for 30 sec - exiting
13:03:10 (107824): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
14:50:08 (107376): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
14:50:21 (107376): No heartbeat from core client for 30 sec - exiting
14:50:23 (107376): No heartbeat from core client for 30 sec - exiting
14:50:24 (107376): No heartbeat from core client for 30 sec - exiting
BUFFIN: C I/O Error feof - Unit 63 - Return code = 16
BUFFIN: C I/O Error feof - Unit 64 - Return code = 16
BUFFIN: C I/O Error feof - Unit 65 - Return code = 16
BUFFIN: C I/O Error feof - Unit 66 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
Atmos Hold Restart file rename failed on atmos_restart.hold
16:08:14 (112820): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
16:08:15 (112820): No heartbeat from core client for 30 sec - exiting
16:08:16 (112820): No heartbeat from core client for 30 sec - exiting
16:08:17 (112820): No heartbeat from core client for 30 sec - exiting
16:08:18 (112820): No heartbeat from core client for 30 sec - exiting
16:08:19 (112820): No heartbeat from core client for 30 sec - exiting
16:08:20 (112820): No heartbeat from core client for 30 sec - exiting
16:08:22 (112820): No heartbeat from core client for 30 sec - exiting
16:08:23 (112820): No heartbeat from core client for 30 sec - exiting
16:08:24 (112820): No heartbeat from core client for 30 sec - exiting
16:08:25 (112820): No heartbeat from core client for 30 sec - exiting
16:08:26 (112820): No heartbeat from core client for 30 sec - exiting
Atmos Hold Restart file rename failed on atmos_restart.hold
16:46:02 (104224): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
16:46:03 (104224): No heartbeat from core client for 30 sec - exiting
16:46:04 (104224): No heartbeat from core client for 30 sec - exiting
16:46:05 (104224): No heartbeat from core client for 30 sec - exiting
16:46:06 (104224): No heartbeat from core client for 30 sec - exiting
16:46:07 (104224): No heartbeat from core client for 30 sec - exiting
16:46:08 (104224): No heartbeat from core client for 30 sec - exiting
16:46:09 (104224): No heartbeat from core client for 30 sec - exiting
16:46:10 (104224): No heartbeat from core client for 30 sec - exiting
16:46:11 (104224): No heartbeat from core client for 30 sec - exiting
16:46:12 (104224): No heartbeat from core client for 30 sec - exiting
16:46:13 (104224): No heartbeat from core client for 30 sec - exiting
16:46:14 (104224): No heartbeat from core client for 30 sec - exiting
16:46:15 (104224): No heartbeat from core client for 30 sec - exiting
16:46:16 (104224): No heartbeat from core client for 30 sec - exiting
16:53:36 (111276): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
16:53:49 (111276): No heartbeat from core client for 30 sec - exiting
16:56:40 (114336): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
17:03:45 (97004): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
17:04:41 (97004): No heartbeat from core client for 30 sec - exiting
17:04:42 (97004): No heartbeat from core client for 30 sec - exiting
17:04:43 (97004): No heartbeat from core client for 30 sec - exiting
17:04:44 (97004): No heartbeat from core client for 30 sec - exiting
17:04:45 (97004): No heartbeat from core client for 30 sec - exiting
17:04:46 (97004): No heartbeat from core client for 30 sec - exiting
17:04:47 (97004): No heartbeat from core client for 30 sec - exiting
19:07:26 (116200): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=107412, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=107412, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=107412, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=107412, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=107412, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=107412, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
12 Nov 2013 07:06:24 1237173 16036595 hadcm3n_o8cl_1900_40_008465944_0 25,920 28,836 1.1125


©2024 climateprediction.net