Name | hadsm3dhet2_jrd6_006598460_0 |
Workunit | 6801833 |
Created | 15 Mar 2010, 12:05:07 UTC |
Sent | 27 Jun 2010, 7:09:13 UTC |
Report deadline | 9 Jun 2011, 12:29:13 UTC |
Received | 21 Dec 2010, 23:48:48 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | -177 (0xFFFFFF4F) ERR_RSC_LIMIT_EXCEEDED |
Computer ID | 993045 |
Run time | 69 days 22 hours 22 min 19 sec |
CPU time | 72 days 11 hours 44 min 32 sec |
Validate state | Invalid |
Credit | 3,870.49 |
Device peak FLOPS | 4.21 GFLOPS |
Application version | UK Met Office HadSM3 Slab Model v6.07 windows_intelx86 |
Stderr | <core_client_version>6.10.18</core_client_version> <![CDATA[ <message> Maximum elapsed time exceeded </message> <stderr_txt> CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5996, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CreateFile error 32 when trying set file time CPDN Monitor - Quit request from BOINC... No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2944, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6116, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5816, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7868, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4460, iMonCtr=1 Model crash detected, will try to restart... MainError: 07:35:12 PM Permission denied MainError: 07:35:12 PM Not a netCDF id MainError: 07:35:12 PM Not a netCDF id MainError: 07:35:12 PM Not a netCDF id MainError: 07:35:12 PM Permission denied MainError: 07:35:12 PM Not a netCDF id MainError: 07:35:12 PM Not a netCDF id MainError: 07:35:12 PM Not a netCDF id MainError: 07:35:12 PM Permission denied MainError: 07:35:12 PM Not a netCDF id MainError: 07:35:12 PM Not a netCDF id MainError: 07:35:12 PM Not a netCDF id MainError: 07:35:12 PM Permission denied MainError: 07:35:12 PM Not a netCDF id MainError: 07:35:12 PM Not a netCDF id MainError: 07:35:12 PM Not a netCDF id MainError: 07:35:12 PM Permission denied MainError: 07:35:12 PM Not a netCDF id MainError: 07:35:12 PM Not a netCDF id MainError: 07:35:12 PM Not a netCDF id MainError: 07:35:12 PM Permission denied MainError: 07:35:12 PM Not a netCDF id MainError: 07:35:12 PM Not a netCDF id MainError: 07:35:12 PM Not a netCDF id MainError: 07:35:12 PM Permission denied MainError: 07:35:12 PM Not a netCDF id MainError: 07:35:12 PM Not a netCDF id MainError: 07:35:12 PM Not a netCDF id MainError: 07:35:34 PM Permission denied MainError: 07:35:34 PM Not a netCDF id MainError: 07:35:34 PM Not a netCDF id MainError: 07:35:34 PM Not a netCDF id MainError: 07:35:34 PM Permission denied MainError: 07:35:34 PM Not a netCDF id MainError: 07:35:34 PM Not a netCDF id MainError: 07:35:34 PM Not a netCDF id MainError: 07:35:34 PM Permission denied MainError: 07:35:34 PM Not a netCDF id MainError: 07:35:34 PM Not a netCDF id MainError: 07:35:34 PM Not a netCDF id MainError: 07:35:46 PM No files match the supplied pattern. MainError: 07:35:46 PM No files match the supplied pattern. MainError: 07:35:46 PM No files match the supplied pattern. MainError: 07:35:46 PM No files match the supplied pattern. MainError: 07:35:46 PM No files match the supplied pattern. MainError: 07:35:46 PM No files match the supplied pattern. MainError: 07:35:46 PM No files match the supplied pattern. MainError: 07:35:46 PM No files match the supplied pattern. MainError: 07:35:46 PM No files match the supplied pattern. MainError: 07:35:46 PM No files match the supplied pattern. MainError: 07:35:46 PM No files match the supplied pattern. MainError: 07:35:46 PM No files match the supplied pattern. MainError: 07:35:46 PM No files match the supplied pattern. MainError: 07:35:46 PM No files match the supplied pattern. MainError: 07:35:46 PM No files match the supplied pattern. MainError: 07:35:46 PM No files match the supplied pattern. MainError: 07:35:46 PM No files match the supplied pattern. MainError: 07:35:46 PM No files match the supplied pattern. MainError: 07:35:46 PM No files match the supplied pattern. MainError: 07:35:46 PM No files match the supplied pattern. MainError: 07:35:46 PM No files match the supplied pattern. MainError: 07:35:46 PM No files match the supplied pattern. MainError: 07:35:46 PM No files match the supplied pattern. MainError: 07:35:46 PM No files match the supplied pattern. MainError: 07:35:46 PM No files match the supplied pattern. MainError: 07:35:46 PM No files match the supplied pattern. MainError: 07:35:46 PM No files match the supplied pattern. MainError: 07:35:46 PM No files match the supplied pattern. MainError: 07:35:46 PM No files match the supplied pattern. MainError: 07:35:46 PM No files match the supplied pattern. MainError: 07:35:46 PM No files match the supplied pattern. MainError: 07:35:46 PM No files match the supplied pattern. MainError: 07:35:46 PM No files match the supplied pattern. MainError: 07:35:46 PM No files match the supplied pattern. MainError: 07:35:46 PM No files match the supplied pattern. MainError: 07:35:46 PM No files match the supplied pattern. MainError: 07:35:46 PM No files match the supplied pattern. MainError: 07:35:46 PM No files match the supplied pattern. MainError: 07:35:46 PM No files match the supplied pattern. MainError: 07:35:46 PM No files match the supplied pattern. MainError: 07:35:46 PM No files match the supplied pattern. MainError: 07:35:46 PM No files match the supplied pattern. MainError: 07:35:46 PM No files match the supplied pattern. MainError: 07:35:46 PM No files match the supplied pattern. MainError: 07:35:46 PM No files match the supplied pattern. MainError: 07:35:46 PM No files match the supplied pattern. MainError: 07:35:46 PM No files match the supplied pattern. MainError: 07:35:46 PM No files match the supplied pattern. MainError: 07:35:46 PM No files match the supplied pattern. MainError: 07:35:46 PM No files match the supplied pattern. MainError: 07:35:46 PM No files match the supplied pattern. MainError: 07:35:46 PM No files match the supplied pattern. MainError: 07:35:46 PM No files match the supplied pattern. MainError: 07:35:46 PM No files match the supplied pattern. MainError: 07:35:46 PM No files match the supplied pattern. MainError: 07:35:46 PM No files match the supplied pattern. CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5596, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CreateFile error 32 when trying set file time CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=10536, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6028, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5792, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CreateFile error 32 when trying set file time CreateFile error 32 when trying set file time CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5160, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5780, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5864, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6124, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CreateFile error 32 when trying set file time CPDN Monitor - Quit request from BOINC... No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CCPDN Monitor - Quit request from BOINC... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=14632, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6312, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=12580, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7636, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5452, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Abort request from BOINC... called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
15 Dec 2010 08:41:57 | 993045 | 11046328 | hadsm3dhet2_jrd6_006598460_0 | 162,030 | 6,001,545 | 14.2460 |
07 Dec 2010 21:35:49 | 993045 | 11046328 | hadsm3dhet2_jrd6_006598460_0 | 151,228 | 5,653,241 | 13.7724 |
29 Nov 2010 13:38:09 | 993045 | 11046328 | hadsm3dhet2_jrd6_006598460_0 | 140,426 | 5,304,867 | 13.2730 |
20 Nov 2010 13:56:28 | 993045 | 11046328 | hadsm3dhet2_jrd6_006598460_0 | 129,624 | 4,956,741 | 12.7465 |
11 Nov 2010 13:32:37 | 993045 | 11046328 | hadsm3dhet2_jrd6_006598460_0 | 118,822 | 4,608,687 | 12.1900 |
03 Nov 2010 23:07:10 | 993045 | 11046328 | hadsm3dhet2_jrd6_006598460_0 | 108,020 | 4,251,268 | 11.5754 |
25 Oct 2010 06:22:59 | 993045 | 11046328 | hadsm3dhet2_jrd6_006598460_0 | 97,218 | 3,892,900 | 10.9208 |
17 Oct 2010 14:06:17 | 993045 | 11046328 | hadsm3dhet2_jrd6_006598460_0 | 86,416 | 3,539,941 | 10.2410 |
10 Oct 2010 13:52:43 | 993045 | 11046328 | hadsm3dhet2_jrd6_006598460_0 | 75,614 | 3,190,460 | 9.5277 |
01 Oct 2010 14:07:18 | 993045 | 11046328 | hadsm3dhet2_jrd6_006598460_0 | 64,812 | 2,840,836 | 8.7664 |
22 Sep 2010 09:37:38 | 993045 | 11046328 | hadsm3dhet2_jrd6_006598460_0 | 54,010 | 2,490,541 | 7.9504 |
13 Sep 2010 09:34:05 | 993045 | 11046328 | hadsm3dhet2_jrd6_006598460_0 | 43,208 | 2,149,628 | 7.1072 |
04 Sep 2010 07:37:17 | 993045 | 11046328 | hadsm3dhet2_jrd6_006598460_0 | 32,406 | 1,816,799 | 6.2293 |
23 Aug 2010 15:50:07 | 993045 | 11046328 | hadsm3dhet2_jrd6_006598460_0 | 21,604 | 1,484,396 | 5.2853 |
07 Aug 2010 18:42:46 | 993045 | 11046328 | hadsm3dhet2_jrd6_006598460_0 | 10,802 | 1,150,666 | 4.2609 |
28 Jul 2010 19:40:10 | 993045 | 11046328 | hadsm3dhet2_jrd6_006598460_0 | 259,248 | 818,256 | 3.1563 |
14 Jul 2010 16:18:25 | 993045 | 11046328 | hadsm3dhet2_jrd6_006598460_0 | 248,446 | 483,621 | 1.9466 |
02 Jul 2010 15:37:23 | 993045 | 11046328 | hadsm3dhet2_jrd6_006598460_0 | 237,644 | 154,965 | 0.6521 |
01 Jul 2010 19:43:07 | 993045 | 11046328 | hadsm3dhet2_jrd6_006598460_0 | 226,842 | 148,353 | 0.6540 |
01 Jul 2010 17:47:48 | 993045 | 11046328 | hadsm3dhet2_jrd6_006598460_0 | 216,040 | 141,613 | 0.6555 |
©2024 cpdn.org