climateprediction.net home page
Task 11899730

Task 11899730

Name hadsm3dhet2_u3eo_006725931_4
Workunit 6929274
Created 17 Sep 2010, 8:08:44 UTC
Sent 20 Sep 2010, 19:19:58 UTC
Report deadline 3 Sep 2011, 0:39:58 UTC
Received 23 Dec 2010, 13:23:37 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status -177 (0xFFFFFF4F) ERR_RSC_LIMIT_EXCEEDED
Computer ID 1305473
Run time 82 days 16 hours 48 min 23 sec
CPU time 71 days 7 hours 22 min 43 sec
Validate state Invalid
Credit 2,977.30
Device peak FLOPS 3.46 GFLOPS
Application version UK Met Office HadSM3 Slab Model v6.07
windows_intelx86
Stderr
<core_client_version>6.10.58</core_client_version>
<![CDATA[
<message>
Maximum elapsed time exceeded
</message>
<stderr_txt>
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7404, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6244, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8760, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8760, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8760, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8760, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6480, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1696, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
MainError:	02:05:37 AM	No files match the supplied pattern.
MainError:	02:05:37 AM	No files match the supplied pattern.
MainError:	02:05:37 AM	No files match the supplied pattern.
MainError:	02:05:37 AM	No files match the supplied pattern.
MainError:	02:05:37 AM	No files match the supplied pattern.
MainError:	02:05:37 AM	No files match the supplied pattern.
MainError:	02:05:37 AM	No files match the supplied pattern.
MainError:	02:05:37 AM	No files match the supplied pattern.
MainError:	02:05:37 AM	No files match the supplied pattern.
MainError:	02:05:37 AM	No files match the supplied pattern.
MainError:	02:05:37 AM	No files match the supplied pattern.
MainError:	02:05:37 AM	No files match the supplied pattern.
MainError:	02:05:37 AM	No files match the supplied pattern.
MainError:	02:05:37 AM	No files match the supplied pattern.
MainError:	02:05:37 AM	No files match the supplied pattern.
MainError:	02:05:37 AM	No files match the supplied pattern.
MainError:	02:05:37 AM	No files match the supplied pattern.
MainError:	02:05:37 AM	No files match the supplied pattern.
MainError:	02:05:37 AM	No files match the supplied pattern.
MainError:	02:05:37 AM	No files match the supplied pattern.
MainError:	02:05:37 AM	No files match the supplied pattern.
MainError:	02:05:37 AM	No files match the supplied pattern.
MainError:	02:05:37 AM	No files match the supplied pattern.
MainError:	02:05:37 AM	No files match the supplied pattern.
MainError:	02:05:37 AM	No files match the supplied pattern.
MainError:	02:05:37 AM	No files match the supplied pattern.
MainError:	02:05:37 AM	No files match the supplied pattern.
MainError:	02:05:37 AM	No files match the supplied pattern.
MainError:	02:05:37 AM	No files match the supplied pattern.
MainError:	02:05:37 AM	No files match the supplied pattern.
MainError:	02:05:37 AM	No files match the supplied pattern.
MainError:	02:05:37 AM	No files match the supplied pattern.
MainError:	02:05:37 AM	No files match the supplied pattern.
MainError:	02:05:37 AM	No files match the supplied pattern.
MainError:	02:05:37 AM	No files match the supplied pattern.
MainError:	02:05:37 AM	No files match the supplied pattern.
MainError:	02:05:37 AM	No files match the supplied pattern.
MainError:	02:05:37 AM	No files match the supplied pattern.
MainError:	02:05:37 AM	No files match the supplied pattern.
MainError:	02:05:37 AM	No files match the supplied pattern.
MainError:	02:05:37 AM	No files match the supplied pattern.
MainError:	02:05:37 AM	No files match the supplied pattern.
MainError:	02:05:37 AM	No files match the supplied pattern.
MainError:	02:05:37 AM	No files match the supplied pattern.
MainError:	02:05:37 AM	No files match the supplied pattern.
MainError:	02:05:37 AM	No files match the supplied pattern.
MainError:	02:05:37 AM	No files match the supplied pattern.
MainError:	02:05:37 AM	No files match the supplied pattern.
MainError:	02:05:37 AM	No files match the supplied pattern.
MainError:	02:05:37 AM	No files match the supplied pattern.
MainError:	02:05:37 AM	No files match the supplied pattern.
MainError:	02:05:37 AM	No files match the supplied pattern.
MainError:	02:05:37 AM	No files match the supplied pattern.
MainError:	02:05:37 AM	No files match the supplied pattern.
MainError:	02:05:37 AM	No files match the supplied pattern.
MainError:	02:05:37 AM	No files match the supplied pattern.
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7376, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6512, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6260, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5696, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4804, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5784, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8840, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CCPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7652, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7340, iMonCtr=1
Model crash detected, will try to restart...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7792, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8020, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=9056, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Abort request from BOINC...
called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
22 Dec 2010 15:02:17 1076549 11899730 hadsm3dhet2_u3eo_006725931_4 64,812 6,133,251 18.9263
14 Dec 2010 20:27:27 1076549 11899730 hadsm3dhet2_u3eo_006725931_4 54,010 5,610,917 17.9115
07 Dec 2010 12:43:44 1076549 11899730 hadsm3dhet2_u3eo_006725931_4 43,208 5,091,926 16.8353
30 Nov 2010 08:41:59 1076549 11899730 hadsm3dhet2_u3eo_006725931_4 32,406 4,572,267 15.6770
20 Nov 2010 15:51:22 1076549 11899730 hadsm3dhet2_u3eo_006725931_4 21,604 4,048,940 14.4166
11 Nov 2010 15:28:01 1076549 11899730 hadsm3dhet2_u3eo_006725931_4 10,802 3,529,275 13.0690
04 Nov 2010 05:19:02 1076549 11899730 hadsm3dhet2_u3eo_006725931_4 259,248 3,007,142 11.5995
27 Oct 2010 10:24:59 1076549 11899730 hadsm3dhet2_u3eo_006725931_4 248,446 2,482,758 9.9931
18 Oct 2010 17:19:29 1076549 11899730 hadsm3dhet2_u3eo_006725931_4 237,644 1,955,214 8.2275
11 Oct 2010 06:24:44 1076549 11899730 hadsm3dhet2_u3eo_006725931_4 226,842 1,427,477 6.2928
03 Oct 2010 17:34:54 1076549 11899730 hadsm3dhet2_u3eo_006725931_4 216,040 904,723 4.1878
26 Sep 2010 09:15:49 1076549 11899730 hadsm3dhet2_u3eo_006725931_4 205,238 376,174 1.8329
23 Sep 2010 14:55:12 1076549 11899730 hadsm3dhet2_u3eo_006725931_4 194,436 190,321 0.9788
23 Sep 2010 11:19:55 1076549 11899730 hadsm3dhet2_u3eo_006725931_4 183,634 179,640 0.9783
23 Sep 2010 07:50:12 1076549 11899730 hadsm3dhet2_u3eo_006725931_4 172,832 169,036 0.9780
23 Sep 2010 04:20:31 1076549 11899730 hadsm3dhet2_u3eo_006725931_4 162,030 158,438 0.9778
23 Sep 2010 03:20:37 1076549 11899730 hadsm3dhet2_u3eo_006725931_4 151,228 147,824 0.9775
22 Sep 2010 21:35:00 1076549 11899730 hadsm3dhet2_u3eo_006725931_4 140,426 137,794 0.9813
22 Sep 2010 18:11:35 1076549 11899730 hadsm3dhet2_u3eo_006725931_4 129,624 127,260 0.9818
22 Sep 2010 14:41:48 1076549 11899730 hadsm3dhet2_u3eo_006725931_4 118,822 116,707 0.9822


©2024 climateprediction.net