climateprediction.net home page
Task 10958739

Task 10958739

Name hadsm3dhet2_jklv_006589701_3
Workunit 6793074
Created 15 Mar 2010, 11:52:27 UTC
Sent 23 Oct 2010, 4:24:27 UTC
Report deadline 5 Oct 2011, 9:44:27 UTC
Received 1 Nov 2010, 11:18:29 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 922180
Run time 3 days 14 hours 20 min 30 sec
CPU time 3 days 10 hours 34 min 53 sec
Validate state Invalid
Credit 1,984.87
Device peak FLOPS 2.25 GFLOPS
Application version UK Met Office HadSM3 Slab Model v6.07
windows_intelx86
Stderr
<core_client_version>6.10.18</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No Process Handle
CPDN process is not running, exiting, bRetVal = 1, checkPID=5556, selfPID=5556, iMonCtr=1
CPDN Monitor - Quit request from BOINC...
No Process Handle
CPDN process is not running, exiting, bRetVal = 1, checkPID=3452, selfPID=3452, iMonCtr=1
CPDN Monitor - Quit request from BOINC...
No Process Handle
CPDN process is not running, exiting, bRetVal = 1, checkPID=5596, selfPID=5596, iMonCtr=1
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
CPDN process is not running, exiting, bRetVal = 1, checkPID=5892, selfPID=5892, iMonCtr=1
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
CPDN process is not running, exiting, bRetVal = 1, checkPID=3340, selfPID=3340, iMonCtr=1
No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
forrtl: Access is denied.
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
forrtl: Access is denied.
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6172, iMonCtr=1
Model crash detected, will try to restart...
forrtl: Access is denied.
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6172, iMonCtr=1
Model crash detected, will try to restart...
forrtl: Access is denied.
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6172, iMonCtr=1
Model crash detected, will try to restart...
forrtl: Access is denied.
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6172, iMonCtr=1
Model crash detected, will try to restart...
forrtl: Access is denied.
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6172, iMonCtr=1
Model crash detected, will try to restart...
forrtl: Access is denied.
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6172, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
01 Nov 2010 04:58:33 922180 10958739 hadsm3dhet2_jklv_006589701_3 216,040 286,244 1.3250
27 Oct 2010 18:10:50 922180 10958739 hadsm3dhet2_jklv_006589701_3 205,238 271,870 1.3247
27 Oct 2010 13:02:48 922180 10958739 hadsm3dhet2_jklv_006589701_3 194,436 257,298 1.3233
27 Oct 2010 09:03:18 922180 10958739 hadsm3dhet2_jklv_006589701_3 183,634 242,843 1.3224
27 Oct 2010 05:04:36 922180 10958739 hadsm3dhet2_jklv_006589701_3 172,832 228,662 1.3230
27 Oct 2010 04:00:04 922180 10958739 hadsm3dhet2_jklv_006589701_3 162,030 214,168 1.3218
26 Oct 2010 21:02:52 922180 10958739 hadsm3dhet2_jklv_006589701_3 151,228 200,030 1.3227
26 Oct 2010 15:00:33 922180 10958739 hadsm3dhet2_jklv_006589701_3 140,426 185,619 1.3218
26 Oct 2010 03:26:20 922180 10958739 hadsm3dhet2_jklv_006589701_3 129,624 171,423 1.3225
25 Oct 2010 21:08:27 922180 10958739 hadsm3dhet2_jklv_006589701_3 118,822 157,219 1.3231
25 Oct 2010 17:09:11 922180 10958739 hadsm3dhet2_jklv_006589701_3 108,020 142,886 1.3228
25 Oct 2010 11:43:33 922180 10958739 hadsm3dhet2_jklv_006589701_3 97,218 128,610 1.3229
25 Oct 2010 04:34:16 922180 10958739 hadsm3dhet2_jklv_006589701_3 86,416 114,520 1.3252
25 Oct 2010 03:32:35 922180 10958739 hadsm3dhet2_jklv_006589701_3 75,614 100,251 1.3258
24 Oct 2010 20:31:11 922180 10958739 hadsm3dhet2_jklv_006589701_3 64,812 85,778 1.3235
24 Oct 2010 16:28:00 922180 10958739 hadsm3dhet2_jklv_006589701_3 54,010 71,455 1.3230
24 Oct 2010 12:29:43 922180 10958739 hadsm3dhet2_jklv_006589701_3 43,208 57,197 1.3238
24 Oct 2010 08:31:50 922180 10958739 hadsm3dhet2_jklv_006589701_3 32,406 42,985 1.3265
24 Oct 2010 04:43:38 922180 10958739 hadsm3dhet2_jklv_006589701_3 21,604 28,752 1.3309
24 Oct 2010 03:34:50 922180 10958739 hadsm3dhet2_jklv_006589701_3 10,802 14,359 1.3293


©2024 climateprediction.net