climateprediction.net home page
Task 11102254

Task 11102254

Name hadsm3dhet2_jvoh_006604051_7
Workunit 6807424
Created 15 Mar 2010, 12:12:38 UTC
Sent 2 Jun 2010, 10:10:32 UTC
Report deadline 15 May 2011, 15:30:32 UTC
Received 8 Jul 2010, 6:07:33 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1051426
Run time 7 days 14 hours 25 min 55 sec
CPU time 6 days 4 hours 47 min 22 sec
Validate state Invalid
Credit 1,984.87
Device peak FLOPS 0.00 GFLOPS
Application version ---
Stderr
<core_client_version>6.10.18</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - No 'heartbeat' from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6120, iMonCtr=1
Model crash detected, will try to restart...
CCPDN Monitor - Quit request from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4496, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4496, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4496, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4496, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4496, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4496, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4496, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5220, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5220, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5220, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5220, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5220, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5220, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
06 Jul 2010 23:32:44 1051426 11102254 hadsm3dhet2_jvoh_006604051_7 216,040 518,017 2.3978
06 Jul 2010 07:22:17 1051426 11102254 hadsm3dhet2_jvoh_006604051_7 205,238 494,909 2.4114
05 Jul 2010 22:36:10 1051426 11102254 hadsm3dhet2_jvoh_006604051_7 194,436 472,642 2.4308
05 Jul 2010 01:16:49 1051426 11102254 hadsm3dhet2_jvoh_006604051_7 183,634 449,009 2.4451
04 Jul 2010 08:45:14 1051426 11102254 hadsm3dhet2_jvoh_006604051_7 172,832 425,976 2.4647
03 Jul 2010 10:38:11 1051426 11102254 hadsm3dhet2_jvoh_006604051_7 162,030 403,939 2.4930
30 Jun 2010 21:05:00 1051426 11102254 hadsm3dhet2_jvoh_006604051_7 151,228 378,862 2.5052
29 Jun 2010 14:18:38 1051426 11102254 hadsm3dhet2_jvoh_006604051_7 140,426 352,253 2.5085
28 Jun 2010 13:50:42 1051426 11102254 hadsm3dhet2_jvoh_006604051_7 129,624 326,154 2.5162
27 Jun 2010 15:05:51 1051426 11102254 hadsm3dhet2_jvoh_006604051_7 118,822 298,872 2.5153
26 Jun 2010 01:21:32 1051426 11102254 hadsm3dhet2_jvoh_006604051_7 108,020 272,492 2.5226
25 Jun 2010 05:45:38 1051426 11102254 hadsm3dhet2_jvoh_006604051_7 97,218 245,170 2.5219
23 Jun 2010 21:11:14 1051426 11102254 hadsm3dhet2_jvoh_006604051_7 86,416 218,946 2.5336
22 Jun 2010 21:03:51 1051426 11102254 hadsm3dhet2_jvoh_006604051_7 75,614 192,033 2.5396
19 Jun 2010 10:42:08 1051426 11102254 hadsm3dhet2_jvoh_006604051_7 64,812 164,351 2.5358
16 Jun 2010 23:18:15 1051426 11102254 hadsm3dhet2_jvoh_006604051_7 54,010 137,005 2.5367
13 Jun 2010 09:26:37 1051426 11102254 hadsm3dhet2_jvoh_006604051_7 43,208 108,998 2.5226
09 Jun 2010 22:00:57 1051426 11102254 hadsm3dhet2_jvoh_006604051_7 32,406 82,483 2.5453
08 Jun 2010 11:35:25 1051426 11102254 hadsm3dhet2_jvoh_006604051_7 21,604 56,334 2.6076
03 Jun 2010 09:49:14 1051426 11102254 hadsm3dhet2_jvoh_006604051_7 10,802 28,695 2.6565


©2024 cpdn.org