climateprediction.net home page
Task 15874080

Task 15874080

Name hadcm3n_n408_1920_40_008378166_1
Workunit 8529025
Created 30 Jun 2013, 11:52:39 UTC
Sent 30 Jun 2013, 12:56:38 UTC
Report deadline 29 Sep 2013, 20:23:49 UTC
Received 21 Jul 2013, 11:14:28 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1194844
Run time 2 days 21 hours 22 min 53 sec
CPU time 2 days 19 hours 6 min 25 sec
Validate state Invalid
Credit 2,488.32
Device peak FLOPS 3.61 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>7.0.64</core_client_version>
<![CDATA[
<message>
The device does not recognize the command.
 (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
18:24:56 (13692): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
18:26:29 (14728): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
18:28:32 (15112): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
18:28:40 (15112): No heartbeat from core client for 30 sec - exiting
18:28:41 (15112): No heartbeat from core client for 30 sec - exiting
18:31:47 (10812): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
18:33:02 (10864): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
18:33:39 (10864): No heartbeat from core client for 30 sec - exiting
18:33:40 (10864): No heartbeat from core client for 30 sec - exiting
18:33:41 (10864): No heartbeat from core client for 30 sec - exiting
18:34:17 (15444): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
18:35:23 (15532): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
18:35:56 (13708): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
18:37:30 (9080): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
18:38:41 (8604): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
18:38:44 (8604): No heartbeat from core client for 30 sec - exiting
18:38:45 (8604): No heartbeat from core client for 30 sec - exiting
18:40:36 (14392): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
18:40:52 (14392): No heartbeat from core client for 30 sec - exiting
18:40:53 (14392): No heartbeat from core client for 30 sec - exiting
18:40:54 (14392): No heartbeat from core client for 30 sec - exiting
18:41:55 (17156): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
18:43:06 (16980): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
18:44:53 (15080): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
18:45:43 (17252): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
18:45:49 (17252): No heartbeat from core client for 30 sec - exiting
18:45:50 (17252): No heartbeat from core client for 30 sec - exiting
18:47:31 (17356): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
11:28:49 (47532): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
12:33:39 (24760): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=12660, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=12660, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=12660, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=12660, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=12660, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=12660, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
23 Jul 2013 20:33:40 1194844 15874080 hadcm3n_n408_1920_40_008378166_1 207,360 224,976 1.0850
03 Jul 2013 11:25:47 1194844 15874080 hadcm3n_n408_1920_40_008378166_1 181,440 196,944 1.0854
03 Jul 2013 02:03:48 1194844 15874080 hadcm3n_n408_1920_40_008378166_1 155,520 168,782 1.0853
02 Jul 2013 18:02:09 1194844 15874080 hadcm3n_n408_1920_40_008378166_1 129,600 140,469 1.0839
02 Jul 2013 12:07:30 1194844 15874080 hadcm3n_n408_1920_40_008378166_1 103,680 112,053 1.0808
02 Jul 2013 12:02:25 1194844 15874080 hadcm3n_n408_1920_40_008378166_1 77,760 83,720 1.0766
02 Jul 2013 11:49:53 1194844 15874080 hadcm3n_n408_1920_40_008378166_1 51,840 55,529 1.0712
02 Jul 2013 11:24:25 1194844 15874080 hadcm3n_n408_1920_40_008378166_1 25,920 27,635 1.0662


©2024 cpdn.org