climateprediction.net home page
Task 15862618

Task 15862618

Name hadcm3n_zkpr_1920_40_008335062_4
Workunit 8485923
Created 25 Jun 2013, 3:36:44 UTC
Sent 25 Jun 2013, 3:49:29 UTC
Report deadline 24 Sep 2013, 11:16:40 UTC
Received 28 Jun 2013, 15:16:26 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1251442
Run time 1 days 20 hours 10 min 42 sec
CPU time 1 days 11 hours 9 min 54 sec
Validate state Invalid
Credit 622.08
Device peak FLOPS 1.95 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>7.0.64</core_client_version>
<![CDATA[
<message>
The device does not recognize the command.
 (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7628, iMonCtr=1
Model crash detected, will try to restart...
10:23:20 (2772): No heartbeat from core client for 30 sec - exiting
10:23:21 (2772): No heartbeat from core client for 30 sec - exiting
10:23:22 (2772): No heartbeat from core client for 30 sec - exiting
10:23:23 (2772): No heartbeat from core client for 30 sec - exiting
10:23:24 (2772): No heartbeat from core client for 30 sec - exiting
10:23:25 (2772): No heartbeat from core client for 30 sec - exiting
10:23:26 (2772): No heartbeat from core client for 30 sec - exiting
10:23:27 (2772): No heartbeat from core client for 30 sec - exiting
10:23:28 (2772): No heartbeat from core client for 30 sec - exiting
10:23:29 (2772): No heartbeat from core client for 30 sec - exiting
10:23:31 (2772): No heartbeat from core client for 30 sec - exiting
10:23:32 (2772): No heartbeat from core client for 30 sec - exiting
10:23:33 (2772): No heartbeat from core client for 30 sec - exiting
10:23:34 (2772): No heartbeat from core client for 30 sec - exiting
10:23:35 (2772): No heartbeat from core client for 30 sec - exiting
10:23:36 (2772): No heartbeat from core client for 30 sec - exiting
10:23:37 (2772): No heartbeat from core client for 30 sec - exiting
10:23:38 (2772): No heartbeat from core client for 30 sec - exiting
10:23:39 (2772): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4720, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4016, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2756, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2756, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4412, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4412, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4412, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4412, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4412, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4412, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
28 Jun 2013 13:15:28 1251442 15862618 hadcm3n_zkpr_1920_40_008335062_4 51,840 125,356 2.4181
26 Jun 2013 18:02:42 1251442 15862618 hadcm3n_zkpr_1920_40_008335062_4 25,920 62,427 2.4084


©2024 cpdn.org