climateprediction.net home page
Task 13104651

Task 13104651

Name hadcm3n_ydj1_1900_40_007350391_1
Workunit 7547821
Created 6 Jul 2011, 14:05:43 UTC
Sent 17 Jul 2011, 0:47:33 UTC
Report deadline 16 Oct 2011, 8:14:44 UTC
Received 31 Jul 2011, 0:05:48 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1135618
Run time 10 days 5 hours 48 min 31 sec
CPU time 8 days 5 hours 6 min 7 sec
Validate state Invalid
Credit 1,866.24
Device peak FLOPS 0.98 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>6.10.58</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6996, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6996, iMonCtr=1
Model crash detected, will try to restart...
01:15:16 (5044): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4292, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5164, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5480, iMonCtr=1
Model crash detected, will try to restart...
10:58:34 (5852): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4612, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4612, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4612, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4612, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4612, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2184, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
28 Jul 2011 14:52:56 1135618 13104651 hadcm3n_ydj1_1900_40_007350391_1 155,520 681,039 4.3791
26 Jul 2011 15:30:38 1135618 13104651 hadcm3n_ydj1_1900_40_007350391_1 129,600 568,469 4.3863
25 Jul 2011 22:51:16 1135618 13104651 hadcm3n_ydj1_1900_40_007350391_1 103,680 475,225 4.5836
25 Jul 2011 20:56:56 1135618 13104651 hadcm3n_ydj1_1900_40_007350391_1 77,760 359,004 4.6168
25 Jul 2011 19:18:18 1135618 13104651 hadcm3n_ydj1_1900_40_007350391_1 51,840 239,548 4.6209
25 Jul 2011 17:54:39 1135618 13104651 hadcm3n_ydj1_1900_40_007350391_1 25,920 123,875 4.7791


©2024 climateprediction.net