climateprediction.net home page
Task 11148359

Task 11148359

Name hadsm3dhet2_jz8j_006608661_6
Workunit 6812034
Created 15 Mar 2010, 12:18:33 UTC
Sent 18 May 2010, 20:12:05 UTC
Report deadline 1 May 2011, 1:32:05 UTC
Received 30 Nov 2010, 10:30:42 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status -177 (0xFFFFFF4F) ERR_RSC_LIMIT_EXCEEDED
Computer ID 1023370
Run time 111 days 6 hours 2 min 13 sec
CPU time 111 days 5 hours 30 min 8 sec
Validate state Invalid
Credit 2,580.33
Device peak FLOPS 0.00 GFLOPS
Application version ---
Stderr
<core_client_version>6.10.58</core_client_version>
<![CDATA[
<message>
Maximum elapsed time exceeded
</message>
<stderr_txt>
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4664, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3820, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=42CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4992, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4376, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4760, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5060, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4708, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4692, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4852, iMonCtr=1
Model crash detected, will try to restart...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5012, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4348, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6124, iMonCtr=1
Model crash detected, will try to restart...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6764, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4904, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5472, iMonCtr=1
Model crash detected, will try to restart...
CCPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4968, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4704, iMonCtr=1
Model crash detected, will try to restart...
CCPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4948, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4948, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4024, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4188, iMonCtr=1
Model crash detected, will try to restart...
MainError:	10:01:41 AM	No files match the supplied pattern.
MainError:	10:01:41 AM	No files match the supplied pattern.
MainError:	10:01:41 AM	No files match the supplied pattern.
MainError:	10:01:41 AM	No files match the supplied pattern.
MainError:	10:01:41 AM	No files match the supplied pattern.
MainError:	10:01:41 AM	No files match the supplied pattern.
MainError:	10:01:41 AM	No files match the supplied pattern.
MainError:	10:01:41 AM	No files match the supplied pattern.
MainError:	10:01:41 AM	No files match the supplied pattern.
MainError:	10:01:41 AM	No files match the supplied pattern.
MainError:	10:01:41 AM	No files match the supplied pattern.
MainError:	10:01:41 AM	No files match the supplied pattern.
MainError:	10:01:41 AM	No files match the supplied pattern.
MainError:	10:01:41 AM	No files match the supplied pattern.
MainError:	10:01:41 AM	No files match the supplied pattern.
MainError:	10:01:41 AM	No files match the supplied pattern.
MainError:	10:01:41 AM	No files match the supplied pattern.
MainError:	10:01:41 AM	No files match the supplied pattern.
MainError:	10:01:41 AM	No files match the supplied pattern.
MainError:	10:01:41 AM	No files match the supplied pattern.
MainError:	10:01:41 AM	No files match the supplied pattern.
MainError:	10:01:41 AM	No files match the supplied pattern.
MainError:	10:01:41 AM	No files match the supplied pattern.
MainError:	10:01:41 AM	No files match the supplied pattern.
MainError:	10:01:41 AM	No files match the supplied pattern.
MainError:	10:01:41 AM	No files match the supplied pattern.
MainError:	10:01:41 AM	No files match the supplied pattern.
MainError:	10:01:41 AM	No files match the supplied pattern.
MainError:	10:01:41 AM	No files match the supplied pattern.
MainError:	10:01:41 AM	No files match the supplied pattern.
MainError:	10:01:41 AM	No files match the supplied pattern.
MainError:	10:01:41 AM	No files match the supplied pattern.
MainError:	10:01:41 AM	No files match the supplied pattern.
MainError:	10:01:41 AM	No files match the supplied pattern.
MainError:	10:01:41 AM	No files match the supplied pattern.
MainError:	10:01:41 AM	No files match the supplied pattern.
MainError:	10:01:41 AM	No files match the supplied pattern.
MainError:	10:01:41 AM	No files match the supplied pattern.
MainError:	10:01:41 AM	No files match the supplied pattern.
MainError:	10:01:41 AM	No files match the supplied pattern.
MainError:	10:01:41 AM	No files match the supplied pattern.
MainError:	10:01:41 AM	No files match the supplied pattern.
MainError:	10:01:41 AM	No files match the supplied pattern.
MainError:	10:01:41 AM	No files match the supplied pattern.
MainError:	10:01:41 AM	No files match the supplied pattern.
MainError:	10:01:41 AM	No files match the supplied pattern.
MainError:	10:01:41 AM	No files match the supplied pattern.
MainError:	10:01:41 AM	No files match the supplied pattern.
MainError:	10:01:41 AM	No files match the supplied pattern.
MainError:	10:01:41 AM	No files match the supplied pattern.
MainError:	10:01:41 AM	No files match the supplied pattern.
MainError:	10:01:41 AM	No files match the supplied pattern.
MainError:	10:01:41 AM	No files match the supplied pattern.
MainError:	10:01:41 AM	No files match the supplied pattern.
MainError:	10:01:41 AM	No files match the supplied pattern.
MainError:	10:01:41 AM	No files match the supplied pattern.
CCPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4252, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6664, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4660, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4836, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Abort request from BOINC...
called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
24 Nov 2010 21:07:16 1023370 11148359 hadsm3dhet2_jz8j_006608661_6 21,604 9,265,699 32.9914
14 Nov 2010 17:30:14 1023370 11148359 hadsm3dhet2_jz8j_006608661_6 10,802 8,593,698 31.8226
05 Nov 2010 10:06:38 1023370 11148359 hadsm3dhet2_jz8j_006608661_6 259,248 7,942,367 30.6362
21 Oct 2010 08:23:44 1023370 11148359 hadsm3dhet2_jz8j_006608661_6 248,446 7,319,374 29.4606
09 Oct 2010 23:41:04 1023370 11148359 hadsm3dhet2_jz8j_006608661_6 237,644 6,646,989 27.9704
28 Sep 2010 03:19:42 1023370 11148359 hadsm3dhet2_jz8j_006608661_6 226,842 5,971,744 26.3256
18 Sep 2010 20:38:11 1023370 11148359 hadsm3dhet2_jz8j_006608661_6 216,040 5,292,837 24.4993
04 Sep 2010 00:17:57 1023370 11148359 hadsm3dhet2_jz8j_006608661_6 205,238 4,665,023 22.7298
18 Aug 2010 03:06:15 1023370 11148359 hadsm3dhet2_jz8j_006608661_6 194,436 3,986,777 20.5043
04 Aug 2010 03:55:12 1023370 11148359 hadsm3dhet2_jz8j_006608661_6 183,634 3,332,307 18.1465
24 Jul 2010 16:29:38 1023370 11148359 hadsm3dhet2_jz8j_006608661_6 172,832 2,674,978 15.4773
10 Jul 2010 05:23:53 1023370 11148359 hadsm3dhet2_jz8j_006608661_6 162,030 2,013,406 12.4261
27 Jun 2010 08:49:28 1023370 11148359 hadsm3dhet2_jz8j_006608661_6 151,228 1,364,221 9.0210
05 Jun 2010 08:52:50 1023370 11148359 hadsm3dhet2_jz8j_006608661_6 140,426 702,319 5.0013
23 May 2010 00:21:22 1023370 11148359 hadsm3dhet2_jz8j_006608661_6 129,624 131,653 1.0157
22 May 2010 20:48:36 1023370 11148359 hadsm3dhet2_jz8j_006608661_6 118,822 120,828 1.0169
22 May 2010 17:26:15 1023370 11148359 hadsm3dhet2_jz8j_006608661_6 108,020 109,928 1.0177
22 May 2010 14:02:36 1023370 11148359 hadsm3dhet2_jz8j_006608661_6 97,218 99,011 1.0184
22 May 2010 10:34:59 1023370 11148359 hadsm3dhet2_jz8j_006608661_6 86,416 87,842 1.0165
20 May 2010 19:19:50 1023370 11148359 hadsm3dhet2_jz8j_006608661_6 75,614 76,895 1.0169


©2024 cpdn.org