climateprediction.net home page
Task 11294424

Task 11294424

Name hadsm3dhet2_kai9_006623267_3
Workunit 6826640
Created 15 Mar 2010, 12:37:25 UTC
Sent 8 Apr 2010, 13:11:02 UTC
Report deadline 21 Mar 2011, 18:31:02 UTC
Received 29 Dec 2010, 22:29:11 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status -177 (0xFFFFFF4F) ERR_RSC_LIMIT_EXCEEDED
Computer ID 715996
Run time 138 days 3 hours 2 min 8 sec
CPU time 136 days 18 hours 54 min
Validate state Invalid
Credit 3,374.28
Device peak FLOPS 0.00 GFLOPS
Application version ---
Stderr
<core_client_version>6.10.18</core_client_version>
<![CDATA[
<message>
Maximum elapsed time exceeded
</message>
<stderr_txt>
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=14576, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6992, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7052, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7592, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3068, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5168, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6844, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2616, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3992, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1716, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5340, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6816, iMonCtr=1
Model crash detected, will try to restart...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8168, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6088, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5612, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1056, iMonCtr=1
Model crash detected, will try to restart...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4108, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=352, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5540, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2180, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2976, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5508, iMonCtr=1
Model crash detected, will try to restart...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2016, iMonCtr=1
Model crash detected, will try to restart...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5940, iMonCtr=1
Model crash detected, will try to restart...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5844, iMonCtr=1
Model crash detected, will try to restart...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4980, iMonCtr=1
Model crash detected, will try to restart...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6080, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5720, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3664, iMonCtr=1
Model crash detected, will try to restart...
MainError:	09:17:41 AM	No files match the supplied pattern.
MainError:	09:17:41 AM	No files match the supplied pattern.
MainError:	09:17:41 AM	No files match the supplied pattern.
MainError:	09:17:41 AM	No files match the supplied pattern.
MainError:	09:17:41 AM	No files match the supplied pattern.
MainError:	09:17:41 AM	No files match the supplied pattern.
MainError:	09:17:41 AM	No files match the supplied pattern.
MainError:	09:17:41 AM	No files match the supplied pattern.
MainError:	09:17:41 AM	No files match the supplied pattern.
MainError:	09:17:41 AM	No files match the supplied pattern.
MainError:	09:17:41 AM	No files match the supplied pattern.
MainError:	09:17:41 AM	No files match the supplied pattern.
MainError:	09:17:41 AM	No files match the supplied pattern.
MainError:	09:17:41 AM	No files match the supplied pattern.
MainError:	09:17:41 AM	No files match the supplied pattern.
MainError:	09:17:41 AM	No files match the supplied pattern.
MainError:	09:17:41 AM	No files match the supplied pattern.
MainError:	09:17:41 AM	No files match the supplied pattern.
MainError:	09:17:41 AM	No files match the supplied pattern.
MainError:	09:17:41 AM	No files match the supplied pattern.
MainError:	09:17:41 AM	No files match the supplied pattern.
MainError:	09:17:41 AM	No files match the supplied pattern.
MainError:	09:17:41 AM	No files match the supplied pattern.
MainError:	09:17:41 AM	No files match the supplied pattern.
MainError:	09:17:41 AM	No files match the supplied pattern.
MainError:	09:17:41 AM	No files match the supplied pattern.
MainError:	09:17:41 AM	No files match the supplied pattern.
MainError:	09:17:41 AM	No files match the supplied pattern.
MainError:	09:17:41 AM	No files match the supplied pattern.
MainError:	09:17:41 AM	No files match the supplied pattern.
MainError:	09:17:41 AM	No files match the supplied pattern.
MainError:	09:17:41 AM	No files match the supplied pattern.
MainError:	09:17:41 AM	No files match the supplied pattern.
MainError:	09:17:41 AM	No files match the supplied pattern.
MainError:	09:17:41 AM	No files match the supplied pattern.
MainError:	09:17:41 AM	No files match the supplied pattern.
MainError:	09:17:41 AM	No files match the supplied pattern.
MainError:	09:17:41 AM	No files match the supplied pattern.
MainError:	09:17:41 AM	No files match the supplied pattern.
MainError:	09:17:41 AM	No files match the supplied pattern.
MainError:	09:17:41 AM	No files match the supplied pattern.
MainError:	09:17:41 AM	No files match the supplied pattern.
MainError:	09:17:41 AM	No files match the supplied pattern.
MainError:	09:17:41 AM	No files match the supplied pattern.
MainError:	09:17:41 AM	No files match the supplied pattern.
MainError:	09:17:41 AM	No files match the supplied pattern.
MainError:	09:17:41 AM	No files match the supplied pattern.
MainError:	09:17:41 AM	No files match the supplied pattern.
MainError:	09:17:41 AM	No files match the supplied pattern.
MainError:	09:17:41 AM	No files match the supplied pattern.
MainError:	09:17:41 AM	No files match the supplied pattern.
MainError:	09:17:41 AM	No files match the supplied pattern.
MainError:	09:17:41 AM	No files match the supplied pattern.
MainError:	09:17:41 AM	No files match the supplied pattern.
MainError:	09:17:41 AM	No files match the supplied pattern.
MainError:	09:17:41 AM	No files match the supplied pattern.
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6848, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5916, iMonCtr=1
Model crash detected, will try to restart...
CNo heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3216, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7004, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7032, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3632, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5424, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2252, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7260, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7260, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7464, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4440, iMonCtr=1
Model crash detected, will try to restart...
CCPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5132, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=936, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6764, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6448, iMonCtr=1
Model crash detected, will try to restart...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6084, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4228, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5744, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5852, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7216, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4976, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4976, iMonCtr=1
Model crash detected, will try to restart...
CCPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5180, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6968, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5816, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3064, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7100, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=796, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5040, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5660, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1436, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2188, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4132, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5116, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6856, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5600, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4572, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Abort request from BOINC...
called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
21 Dec 2010 21:00:34 715996 11294424 hadsm3dhet2_kai9_006623267_3 108,020 11,445,234 31.1632
08 Dec 2010 12:22:20 715996 11294424 hadsm3dhet2_kai9_006623267_3 97,218 10,785,019 30.2554
24 Nov 2010 13:46:44 715996 11294424 hadsm3dhet2_kai9_006623267_3 86,416 10,126,103 29.2946
10 Nov 2010 13:57:20 715996 11294424 hadsm3dhet2_kai9_006623267_3 75,614 9,463,993 28.2624
26 Oct 2010 21:28:42 715996 11294424 hadsm3dhet2_kai9_006623267_3 64,812 8,802,493 27.1632
11 Oct 2010 22:21:36 715996 11294424 hadsm3dhet2_kai9_006623267_3 54,010 8,141,690 25.9904
26 Sep 2010 15:20:29 715996 11294424 hadsm3dhet2_kai9_006623267_3 43,208 7,482,191 24.7381
12 Sep 2010 15:43:15 715996 11294424 hadsm3dhet2_kai9_006623267_3 32,406 6,820,480 23.3855
28 Aug 2010 21:29:35 715996 11294424 hadsm3dhet2_kai9_006623267_3 21,604 6,157,490 21.9243
15 Aug 2010 00:28:11 715996 11294424 hadsm3dhet2_kai9_006623267_3 10,802 5,496,498 20.3536
31 Jul 2010 09:22:09 715996 11294424 hadsm3dhet2_kai9_006623267_3 259,248 4,838,047 18.6618
15 Jul 2010 16:07:10 715996 11294424 hadsm3dhet2_kai9_006623267_3 248,446 4,176,642 16.8111
30 Jun 2010 13:47:50 715996 11294424 hadsm3dhet2_kai9_006623267_3 237,644 3,514,854 14.7904
14 Jun 2010 10:26:06 715996 11294424 hadsm3dhet2_kai9_006623267_3 226,842 2,853,825 12.5807
29 May 2010 15:52:16 715996 11294424 hadsm3dhet2_kai9_006623267_3 216,040 2,193,373 10.1526
15 May 2010 10:50:28 715996 11294424 hadsm3dhet2_kai9_006623267_3 205,238 1,528,493 7.4474
30 Apr 2010 10:21:56 715996 11294424 hadsm3dhet2_kai9_006623267_3 194,436 867,920 4.4638
13 Apr 2010 22:49:52 715996 11294424 hadsm3dhet2_kai9_006623267_3 183,634 243,083 1.3237
13 Apr 2010 18:51:23 715996 11294424 hadsm3dhet2_kai9_006623267_3 172,832 228,872 1.3242
13 Apr 2010 14:49:35 715996 11294424 hadsm3dhet2_kai9_006623267_3 162,030 214,541 1.3241


©2024 cpdn.org