climateprediction.net home page
Task 13626442

Task 13626442

Name hadcm3n_o4q0_1940_40_007543329_0
Workunit 7740561
Created 10 Nov 2011, 1:18:41 UTC
Sent 10 Nov 2011, 1:21:21 UTC
Report deadline 9 Feb 2012, 8:48:32 UTC
Received 17 Nov 2011, 18:39:53 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 962214
Run time 5 days 4 hours 55 min 56 sec
CPU time 4 days 20 hours 35 min 23 sec
Validate state Invalid
Credit 3,421.44
Device peak FLOPS 2.84 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>6.12.26</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
15:19:46 (4048): No heartbeat from core client for 30 sec - exiting
15:19:47 (4048): No heartbeat from core client for 30 sec - exiting
15:19:48 (4048): No heartbeat from core client for 30 sec - exiting
15:19:49 (4048): No heartbeat from core client for 30 sec - exiting
15:19:50 (4048): No heartbeat from core client for 30 sec - exiting
15:19:51 (4048): No heartbeat from core client for 30 sec - exiting
15:19:52 (4048): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Ocean Restart file copy failed on o4q0ko.dae9bm0
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4008, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4008, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4008, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4008, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4008, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3436, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
16 Nov 2011 23:56:04 962214 13626442 hadcm3n_o4q0_1940_40_007543329_0 285,120 390,888 1.3710
16 Nov 2011 08:19:46 962214 13626442 hadcm3n_o4q0_1940_40_007543329_0 259,200 353,948 1.3655
15 Nov 2011 17:26:51 962214 13626442 hadcm3n_o4q0_1940_40_007543329_0 233,280 317,081 1.3592
15 Nov 2011 17:26:51 962214 13626442 hadcm3n_o4q0_1940_40_007543329_0 207,360 282,067 1.3603
15 Nov 2011 17:26:51 962214 13626442 hadcm3n_o4q0_1940_40_007543329_0 181,440 247,216 1.3625
15 Nov 2011 17:26:51 962214 13626442 hadcm3n_o4q0_1940_40_007543329_0 155,520 211,765 1.3617
15 Nov 2011 17:26:51 962214 13626442 hadcm3n_o4q0_1940_40_007543329_0 129,600 176,929 1.3652
15 Nov 2011 17:26:51 962214 13626442 hadcm3n_o4q0_1940_40_007543329_0 103,680 142,315 1.3726
15 Nov 2011 17:26:51 962214 13626442 hadcm3n_o4q0_1940_40_007543329_0 77,760 107,278 1.3796
15 Nov 2011 17:26:51 962214 13626442 hadcm3n_o4q0_1940_40_007543329_0 51,840 72,599 1.4004
15 Nov 2011 17:26:51 962214 13626442 hadcm3n_o4q0_1940_40_007543329_0 25,920 36,907 1.4239


©2024 cpdn.org