climateprediction.net home page
Task 13658902

Task 13658902

Name hadcm3n_p7bs_1940_40_007422404_2
Workunit 7620039
Created 24 Nov 2011, 13:17:06 UTC
Sent 24 Nov 2011, 13:23:15 UTC
Report deadline 23 Feb 2012, 20:50:26 UTC
Received 6 Dec 2011, 6:25:05 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1134605
Run time 20 hours 1 min 6 sec
CPU time 19 hours 39 min 19 sec
Validate state Invalid
Credit 311.04
Device peak FLOPS 2.74 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>6.12.34</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
CPDN Monitor - Quit request from BOINC...
12:22:32 (8636): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
12:24:31 (6452): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
12:26:20 (884): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - No 'heartbeat' from BOINC...
12:38:44 (8252): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
12:38:48 (8252): No heartbeat from core client for 30 sec - exiting
12:38:49 (8252): No heartbeat from core client for 30 sec - exiting
12:38:50 (8252): No heartbeat from core client for 30 sec - exiting
12:38:51 (8252): No heartbeat from core client for 30 sec - exiting
12:38:52 (8252): No heartbeat from core client for 30 sec - exiting
12:38:53 (8252): No heartbeat from core client for 30 sec - exiting
12:38:54 (8252): No heartbeat from core client for 30 sec - exiting
12:54:42 (6340): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
12:54:45 (6340): No heartbeat from core client for 30 sec - exiting
12:54:46 (6340): No heartbeat from core client for 30 sec - exiting
12:54:47 (6340): No heartbeat from core client for 30 sec - exiting
12:54:48 (6340): No heartbeat from core client for 30 sec - exiting
12:54:49 (6340): No heartbeat from core client for 30 sec - exiting
12:54:50 (6340): No heartbeat from core client for 30 sec - exiting
12:54:51 (6340): No heartbeat from core client for 30 sec - exiting
12:54:52 (6340): No heartbeat from core client for 30 sec - exiting
12:54:53 (6340): No heartbeat from core client for 30 sec - exiting
12:54:54 (6340): No heartbeat from core client for 30 sec - exiting
12:56:35 (8512): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Atmos Hold Restart file rename failed on atmos_restart.hold
13:45:31 (7276): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
13:45:36 (7276): No heartbeat from core client for 30 sec - exiting
13:45:37 (7276): No heartbeat from core client for 30 sec - exiting
13:45:38 (7276): No heartbeat from core client for 30 sec - exiting
13:45:39 (7276): No heartbeat from core client for 30 sec - exiting
13:48:00 (5220): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
13:48:50 (6544): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
17:06:44 (4392): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
17:07:08 (4392): No heartbeat from core client for 30 sec - exiting
17:07:09 (4392): No heartbeat from core client for 30 sec - exiting
17:07:10 (4392): No heartbeat from core client for 30 sec - exiting
17:07:11 (4392): No heartbeat from core client for 30 sec - exiting
17:07:12 (4392): No heartbeat from core client for 30 sec - exiting
17:07:13 (4392): No heartbeat from core client for 30 sec - exiting
17:07:14 (4392): No heartbeat from core client for 30 sec - exiting
17:07:15 (4392): No heartbeat from core client for 30 sec - exiting
17:07:16 (4392): No heartbeat from core client for 30 sec - exiting
17:07:17 (4392): No heartbeat from core client for 30 sec - exiting
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8940, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8940, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8940, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8940, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8940, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8940, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
06 Dec 2011 01:22:22 1134605 13658902 hadcm3n_p7bs_1940_40_007422404_2 25,920 52,109 2.0104


©2024 cpdn.org