climateprediction.net home page
Task 16064810

Task 16064810

Name hadcm3n_o3o7_1980_40_008400252_2
Workunit 8551108
Created 10 Oct 2013, 20:25:09 UTC
Sent 10 Oct 2013, 20:29:20 UTC
Report deadline 10 Jan 2014, 3:56:31 UTC
Received 30 Jun 2014, 19:22:08 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1095341
Run time 2 days 5 hours 38 min 22 sec
CPU time 2 days 4 hours 49 min 3 sec
Validate state Invalid
Credit 933.12
Device peak FLOPS 2.02 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>7.0.64</core_client_version>
<![CDATA[
<message>
The device does not recognize the command.
 (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
12:16:36 (5836): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
12:16:38 (5836): No heartbeat from core client for 30 sec - exiting
12:16:39 (5836): No heartbeat from core client for 30 sec - exiting
12:35:35 (9544): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
12:51:34 (8140): No heartbeat from core client for 30 sec - exiting
12:51:40 (8140): No heartbeat from core client for 30 sec - exiting
12:51:41 (8140): No heartbeat from core client for 30 sec - exiting
12:51:42 (8140): No heartbeat from core client for 30 sec - exiting
12:51:43 (8140): No heartbeat from core client for 30 sec - exiting
12:51:45 (8140): No heartbeat from core client for 30 sec - exiting
12:51:46 (8140): No heartbeat from core client for 30 sec - exiting
12:51:47 (8140): No heartbeat from core client for 30 sec - exiting
12:51:48 (8140): No heartbeat from core client for 30 sec - exiting
12:51:49 (8140): No heartbeat from core client for 30 sec - exiting
12:51:50 (8140): No heartbeat from core client for 30 sec - exiting
12:51:51 (8140): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
12:51:52 (8140): No heartbeat from core client for 30 sec - exiting
12:51:53 (8140): No heartbeat from core client for 30 sec - exiting
12:51:54 (8140): No heartbeat from core client for 30 sec - exiting
13:21:05 (3164): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
13:21:19 (3164): No heartbeat from core client for 30 sec - exiting
13:21:20 (3164): No heartbeat from core client for 30 sec - exiting
13:21:22 (3164): No heartbeat from core client for 30 sec - exiting
13:21:23 (3164): No heartbeat from core client for 30 sec - exiting
13:21:24 (3164): No heartbeat from core client for 30 sec - exiting
13:21:25 (3164): No heartbeat from core client for 30 sec - exiting
13:21:26 (3164): No heartbeat from core client for 30 sec - exiting
13:21:27 (3164): No heartbeat from core client for 30 sec - exiting
13:21:28 (3164): No heartbeat from core client for 30 sec - exiting
13:21:29 (3164): No heartbeat from core client for 30 sec - exiting
13:31:19 (11176): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
13:31:20 (11176): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6260, iMonCtr=1
Model crash detected, will try to restart...
17:30:37 (6260): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5176, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5176, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5176, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8280, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8280, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
15 Oct 2013 22:36:27 1095341 16064810 hadcm3n_o3o7_1980_40_008400252_2 77,760 147,042 1.8910
15 Oct 2013 08:20:40 1095341 16064810 hadcm3n_o3o7_1980_40_008400252_2 51,840 97,452 1.8799
14 Oct 2013 18:32:25 1095341 16064810 hadcm3n_o3o7_1980_40_008400252_2 25,920 47,974 1.8508


©2024 cpdn.org