climateprediction.net home page
Task 16060113

Task 16060113

Name hadcm3n_oa8o_1900_40_008468395_1
Workunit 8619234
Created 7 Oct 2013, 11:01:20 UTC
Sent 7 Oct 2013, 11:09:42 UTC
Report deadline 6 Jan 2014, 18:36:53 UTC
Received 17 Oct 2013, 15:38:33 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1246737
Run time 5 days 17 hours 2 min 39 sec
CPU time 5 days 0 hours 58 min 3 sec
Validate state Invalid
Credit 1,555.20
Device peak FLOPS 2.24 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>7.0.64</core_client_version>
<![CDATA[
<message>
The device does not recognize the command.
 (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
Suspended CPDN Monitor - Suspend request from BOINC...
08:27:49 (4264): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
08:29:20 (3532): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1568, iMonCtr=1
Model crash detected, will try to restart...
09:39:41 (3808): No heartbeat from core client for 30 sec - exiting
09:39:42 (3808): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
09:39:43 (3808): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
08:23:04 (3540): No heartbeat from core client for 30 sec - exiting
08:23:06 (3540): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
08:25:09 (5068): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
08:43:27 (3632): No heartbeat from core client for 30 sec - exiting
08:43:28 (3632): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
09:06:40 (5220): No heartbeat from core client for 30 sec - exiting
09:06:41 (5220): No heartbeat from core client for 30 sec - exiting
09:06:42 (5220): No heartbeat from core client for 30 sec - exiting
09:06:44 (5220): No heartbeat from core client for 30 sec - exiting
09:06:45 (5220): No heartbeat from core client for 30 sec - exiting
09:06:46 (5220): No heartbeat from core client for 30 sec - exiting
09:06:47 (5220): No heartbeat from core client for 30 sec - exiting
09:06:48 (5220): No heartbeat from core client for 30 sec - exiting
09:06:49 (5220): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
09:07:36 (5764): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
09:08:22 (352): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
13:41:59 (5088): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
13:42:01 (5088): No heartbeat from core client for 30 sec - exiting
13:42:02 (5088): No heartbeat from core client for 30 sec - exiting
13:42:03 (5088): No heartbeat from core client for 30 sec - exiting
13:42:04 (5088): No heartbeat from core client for 30 sec - exiting
13:43:40 (4084): No heartbeat from core client for 30 sec - exiting
13:43:42 (4084): No heartbeat from core client for 30 sec - exiting
13:43:43 (4084): No heartbeat from core client for 30 sec - exiting
13:43:44 (4084): No heartbeat from core client for 30 sec - exiting
13:43:45 (4084): No heartbeat from core client for 30 sec - exiting
13:43:46 (4084): No heartbeat from core client for 30 sec - exiting
13:43:47 (4084): No heartbeat from core client for 30 sec - exiting
13:43:48 (4084): No heartbeat from core client for 30 sec - exiting
13:43:49 (4084): No heartbeat from core client for 30 sec - exiting
13:43:50 (4084): No heartbeat from core client for 30 sec - exiting
13:43:51 (4084): No heartbeat from core client for 30 sec - exiting
13:43:52 (4084): No heartbeat from core client for 30 sec - exiting
13:43:53 (4084): No heartbeat from core client for 30 sec - exiting
13:43:54 (4084): No heartbeat from core client for 30 sec - exiting
13:43:55 (4084): No heartbeat from core client for 30 sec - exiting
13:43:56 (4084): No heartbeat from core client for 30 sec - exiting
13:43:57 (4084): No heartbeat from core client for 30 sec - exiting
13:43:58 (4084): No heartbeat from core client for 30 sec - exiting
13:43:59 (4084): No heartbeat from core client for 30 sec - exiting
13:44:00 (4084): No heartbeat from core client for 30 sec - exiting
13:44:01 (4084): No heartbeat from core client for 30 sec - exiting
13:44:02 (4084): No heartbeat from core client for 30 sec - exiting
13:44:03 (4084): No heartbeat from core client for 30 sec - exiting
13:44:04 (4084): No heartbeat from core client for 30 sec - exiting
13:44:05 (4084): No heartbeat from core client for 30 sec - exiting
13:44:06 (4084): No heartbeat from core client for 30 sec - exiting
13:44:07 (4084): No heartbeat from core client for 30 sec - exiting
13:44:08 (4084): No heartbeat from core client for 30 sec - exiting
13:44:09 (4084): No heartbeat from core client for 30 sec - exiting
13:44:10 (4084): No heartbeat from core client for 30 sec - exiting
13:44:11 (4084): No heartbeat from core client for 30 sec - exiting
13:44:12 (4084): No heartbeat from core client for 30 sec - exiting
13:44:13 (4084): No heartbeat from core client for 30 sec - exiting
13:44:14 (4084): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
13:47:01 (2632): No heartbeat from core client for 30 sec - exiting
13:47:03 (2632): No heartbeat from core client for 30 sec - exiting
13:47:04 (2632): No heartbeat from core client for 30 sec - exiting
13:47:05 (2632): No heartbeat from core client for 30 sec - exiting
13:47:06 (2632): No heartbeat from core client for 30 sec - exiting
13:47:07 (2632): No heartbeat from core client for 30 sec - exiting
13:47:08 (2632): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
13:47:10 (2632): No heartbeat from core client for 30 sec - exiting
10:02:35 (4464): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4540, iMonCtr=1
Model crash detected, will try to restart...
10:05:15 (1956): No heartbeat from core client for 30 sec - exiting
10:05:16 (1956): No heartbeat from core client for 30 sec - exiting
10:05:17 (1956): No heartbeat from core client for 30 sec - exiting
10:05:18 (1956): No heartbeat from core client for 30 sec - exiting
10:05:19 (1956): No heartbeat from core client for 30 sec - exiting
10:05:20 (1956): No heartbeat from core client for 30 sec - exiting
10:05:21 (1956): No heartbeat from core client for 30 sec - exiting
10:05:22 (1956): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
13:26:58 (5108): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
20:54:25 (3308): No heartbeat from core client for 30 sec - exiting
20:54:26 (3308): No heartbeat from core client for 30 sec - exiting
20:54:27 (3308): No heartbeat from core client for 30 sec - exiting
20:54:28 (3308): No heartbeat from core client for 30 sec - exiting
20:54:29 (3308): No heartbeat from core client for 30 sec - exiting
20:54:30 (3308): No heartbeat from core client for 30 sec - exiting
20:54:31 (3308): No heartbeat from core client for 30 sec - exiting
20:54:32 (3308): No heartbeat from core client for 30 sec - exiting
20:54:33 (3308): No heartbeat from core client for 30 sec - exiting
20:54:34 (3308): No heartbeat from core client for 30 sec - exiting
20:54:36 (3308): No heartbeat from core client for 30 sec - exiting
20:54:37 (3308): No heartbeat from core client for 30 sec - exiting
20:54:38 (3308): No heartbeat from core client for 30 sec - exiting
20:54:39 (3308): No heartbeat from core client for 30 sec - exiting
20:54:40 (3308): No heartbeat from core client for 30 sec - exiting
20:54:41 (3308): No heartbeat from core client for 30 sec - exiting
20:54:42 (3308): No heartbeat from core client for 30 sec - exiting
20:54:43 (3308): No heartbeat from core client for 30 sec - exiting
20:54:44 (3308): No heartbeat from core client for 30 sec - exiting
20:54:45 (3308): No heartbeat from core client for 30 sec - exiting
20:54:46 (3308): No heartbeat from core client for 30 sec - exiting
20:54:48 (3308): No heartbeat from core client for 30 sec - exiting
20:54:49 (3308): No heartbeat from core client for 30 sec - exiting
20:54:50 (3308): No heartbeat from core client for 30 sec - exiting
20:54:51 (3308): No heartbeat from core client for 30 sec - exiting
20:54:52 (3308): No heartbeat from core client for 30 sec - exiting
20:54:53 (3308): No heartbeat from core client for 30 sec - exiting
20:54:54 (3308): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
20:57:04 (4808): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3592, iMonCtr=1
Model crash detected, will try to restart...
08:41:53 (2388): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
10:17:20 (4260): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
09:22:24 (3956): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
09:23:45 (2176): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
09:27:08 (5692): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
13:42:31 (3548): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4616, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4616, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4616, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4616, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4616, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4616, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
15 Oct 2013 15:51:25 1246737 16060113 hadcm3n_oa8o_1900_40_008468395_1 129,600 397,019 3.0634
13 Oct 2013 22:11:40 1246737 16060113 hadcm3n_oa8o_1900_40_008468395_1 103,680 320,083 3.0872
12 Oct 2013 11:53:40 1246737 16060113 hadcm3n_oa8o_1900_40_008468395_1 77,760 242,673 3.1208
10 Oct 2013 16:35:47 1246737 16060113 hadcm3n_oa8o_1900_40_008468395_1 51,840 163,646 3.1568
08 Oct 2013 17:55:13 1246737 16060113 hadcm3n_oa8o_1900_40_008468395_1 25,920 77,083 2.9739


©2024 climateprediction.net