climateprediction.net home page
Task 15465305

Task 15465305

Name hadcm3n_o3e2_2100_40_008254536_0
Workunit 8409660
Created 28 Nov 2012, 13:38:01 UTC
Sent 28 Nov 2012, 13:42:33 UTC
Report deadline 27 Feb 2013, 21:09:44 UTC
Received 6 Dec 2012, 14:20:45 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1257657
Run time 6 days 20 hours 44 min 56 sec
CPU time 6 days 19 hours 47 min 26 sec
Validate state Invalid
Credit 12,441.60
Device peak FLOPS 2.70 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>7.0.28</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
13:22:13 (1064): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2952, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2952, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2952, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2952, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2952, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2952, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
17 Dec 2012 03:28:45 1229566 15465305 hadcm3n_o3e2_2100_40_008254536_0 1,036,800 1,462,146 1.4102
16 Dec 2012 18:02:20 1229566 15465305 hadcm3n_o3e2_2100_40_008254536_0 1,010,880 1,425,524 1.4102
16 Dec 2012 07:09:14 1229566 15465305 hadcm3n_o3e2_2100_40_008254536_0 984,960 1,389,450 1.4107
15 Dec 2012 21:16:40 1229566 15465305 hadcm3n_o3e2_2100_40_008254536_0 959,040 1,353,991 1.4118
15 Dec 2012 11:14:02 1229566 15465305 hadcm3n_o3e2_2100_40_008254536_0 933,120 1,318,153 1.4126
15 Dec 2012 00:56:55 1229566 15465305 hadcm3n_o3e2_2100_40_008254536_0 907,200 1,281,331 1.4124
14 Dec 2012 14:42:27 1229566 15465305 hadcm3n_o3e2_2100_40_008254536_0 881,280 1,244,653 1.4123
14 Dec 2012 05:21:35 1229566 15465305 hadcm3n_o3e2_2100_40_008254536_0 855,360 1,207,979 1.4122
14 Dec 2012 03:09:48 1229566 15465305 hadcm3n_o3e2_2100_40_008254536_0 829,440 1,171,090 1.4119
14 Dec 2012 03:09:48 1229566 15465305 hadcm3n_o3e2_2100_40_008254536_0 803,520 1,134,218 1.4116
14 Dec 2012 03:09:48 1229566 15465305 hadcm3n_o3e2_2100_40_008254536_0 777,600 1,097,360 1.4112
14 Dec 2012 03:09:48 1229566 15465305 hadcm3n_o3e2_2100_40_008254536_0 751,680 1,060,539 1.4109
14 Dec 2012 03:09:48 1229566 15465305 hadcm3n_o3e2_2100_40_008254536_0 725,760 1,023,849 1.4107
14 Dec 2012 03:09:48 1229566 15465305 hadcm3n_o3e2_2100_40_008254536_0 699,840 987,058 1.4104
14 Dec 2012 03:09:48 1229566 15465305 hadcm3n_o3e2_2100_40_008254536_0 673,920 950,283 1.4101
14 Dec 2012 03:09:48 1229566 15465305 hadcm3n_o3e2_2100_40_008254536_0 648,000 913,525 1.4098
14 Dec 2012 03:09:48 1229566 15465305 hadcm3n_o3e2_2100_40_008254536_0 622,080 876,742 1.4094
14 Dec 2012 03:09:48 1229566 15465305 hadcm3n_o3e2_2100_40_008254536_0 596,160 839,947 1.4089
14 Dec 2012 03:09:48 1229566 15465305 hadcm3n_o3e2_2100_40_008254536_0 570,240 803,166 1.4085
14 Dec 2012 03:09:48 1229566 15465305 hadcm3n_o3e2_2100_40_008254536_0 544,320 766,484 1.4081


©2024 cpdn.org