climateprediction.net home page
Task 11138355

Task 11138355

Name hadsm3dhet2_jygr_006607661_3
Workunit 6811034
Created 15 Mar 2010, 12:17:09 UTC
Sent 21 May 2010, 19:31:20 UTC
Report deadline 4 May 2011, 0:51:20 UTC
Received 24 Jun 2010, 7:02:30 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1049197
Run time 5 days 17 hours 8 min 42 sec
CPU time 5 days 0 hours 40 min 32 sec
Validate state Invalid
Credit 1,786.38
Device peak FLOPS 0.00 GFLOPS
Application version ---
Stderr
<core_client_version>6.10.18</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1212, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1652, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=156, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=156, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=156, iMonCtr=1
Model crash detected, will try to restart...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1488, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4908, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4160, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5248, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5248, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4276, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4276, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4252, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5608, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5548, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5548, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5252, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5252, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3444, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3444, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3444, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3444, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1952, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5472, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5472, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5472, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5472, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5472, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5472, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
23 Jun 2010 16:53:33 1049197 11138355 hadsm3dhet2_jygr_006607661_3 194,436 425,660 2.1892
13 Jun 2010 15:19:08 1049197 11138355 hadsm3dhet2_jygr_006607661_3 183,634 401,978 2.1890
12 Jun 2010 17:15:15 1049197 11138355 hadsm3dhet2_jygr_006607661_3 172,832 379,158 2.1938
11 Jun 2010 01:45:34 1049197 11138355 hadsm3dhet2_jygr_006607661_3 162,030 355,979 2.1970
10 Jun 2010 18:35:49 1049197 11138355 hadsm3dhet2_jygr_006607661_3 151,228 332,392 2.1980
09 Jun 2010 19:56:22 1049197 11138355 hadsm3dhet2_jygr_006607661_3 140,426 308,492 2.1968
07 Jun 2010 11:39:01 1049197 11138355 hadsm3dhet2_jygr_006607661_3 129,624 284,778 2.1970
03 Jun 2010 12:33:10 1049197 11138355 hadsm3dhet2_jygr_006607661_3 118,822 261,185 2.1981
03 Jun 2010 10:20:02 1049197 11138355 hadsm3dhet2_jygr_006607661_3 108,020 237,662 2.2002
02 Jun 2010 07:38:24 1049197 11138355 hadsm3dhet2_jygr_006607661_3 97,218 214,198 2.2033
31 May 2010 21:14:48 1049197 11138355 hadsm3dhet2_jygr_006607661_3 86,416 190,251 2.2016
30 May 2010 14:54:31 1049197 11138355 hadsm3dhet2_jygr_006607661_3 75,614 166,567 2.2029
29 May 2010 20:31:51 1049197 11138355 hadsm3dhet2_jygr_006607661_3 64,812 142,768 2.2028
29 May 2010 12:15:19 1049197 11138355 hadsm3dhet2_jygr_006607661_3 54,010 116,568 2.1583
28 May 2010 20:32:52 1049197 11138355 hadsm3dhet2_jygr_006607661_3 43,208 92,984 2.1520
28 May 2010 13:21:10 1049197 11138355 hadsm3dhet2_jygr_006607661_3 32,406 69,405 2.1417
24 May 2010 18:04:29 1049197 11138355 hadsm3dhet2_jygr_006607661_3 21,604 45,247 2.0944
23 May 2010 12:06:22 1049197 11138355 hadsm3dhet2_jygr_006607661_3 10,802 22,317 2.0660


©2024 cpdn.org