climateprediction.net home page
Task 11903566

Task 11903566

Name hadsm3dhet2_u9fs_006726315_0
Workunit 6929658
Created 17 Sep 2010, 8:09:17 UTC
Sent 19 Sep 2010, 10:56:48 UTC
Report deadline 1 Sep 2011, 16:16:48 UTC
Received 29 Nov 2010, 14:55:57 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1101986
Run time 3 days 22 hours 16 min 7 sec
CPU time 3 days 21 hours 53 min 56 sec
Validate state Invalid
Credit 2,977.30
Device peak FLOPS 2.52 GFLOPS
Application version UK Met Office HadSM3 Slab Model v6.07
windows_intelx86
Stderr
<core_client_version>6.10.58</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1156, iMonCtr=1
Model crash detected, will try to restart...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
MainError:	07:35:37 PM	No files match the supplied pattern.
MainError:	07:35:37 PM	No files match the supplied pattern.
MainError:	07:35:37 PM	No files match the supplied pattern.
MainError:	07:35:37 PM	No files match the supplied pattern.
MainError:	07:35:37 PM	No files match the supplied pattern.
MainError:	07:35:37 PM	No files match the supplied pattern.
MainError:	07:35:37 PM	No files match the supplied pattern.
MainError:	07:35:37 PM	No files match the supplied pattern.
MainError:	07:35:37 PM	No files match the supplied pattern.
MainError:	07:35:37 PM	No files match the supplied pattern.
MainError:	07:35:37 PM	No files match the supplied pattern.
MainError:	07:35:37 PM	No files match the supplied pattern.
MainError:	07:35:37 PM	No files match the supplied pattern.
MainError:	07:35:37 PM	No files match the supplied pattern.
MainError:	07:35:37 PM	No files match the supplied pattern.
MainError:	07:35:37 PM	No files match the supplied pattern.
MainError:	07:35:37 PM	No files match the supplied pattern.
MainError:	07:35:37 PM	No files match the supplied pattern.
MainError:	07:35:37 PM	No files match the supplied pattern.
MainError:	07:35:37 PM	No files match the supplied pattern.
MainError:	07:35:37 PM	No files match the supplied pattern.
MainError:	07:35:37 PM	No files match the supplied pattern.
MainError:	07:35:37 PM	No files match the supplied pattern.
MainError:	07:35:37 PM	No files match the supplied pattern.
MainError:	07:35:37 PM	No files match the supplied pattern.
MainError:	07:35:37 PM	No files match the supplied pattern.
MainError:	07:35:37 PM	No files match the supplied pattern.
MainError:	07:35:37 PM	No files match the supplied pattern.
MainError:	07:35:38 PM	No files match the supplied pattern.
MainError:	07:35:38 PM	No files match the supplied pattern.
MainError:	07:35:38 PM	No files match the supplied pattern.
MainError:	07:35:38 PM	No files match the supplied pattern.
MainError:	07:35:38 PM	No files match the supplied pattern.
MainError:	07:35:38 PM	No files match the supplied pattern.
MainError:	07:35:38 PM	No files match the supplied pattern.
MainError:	07:35:38 PM	No files match the supplied pattern.
MainError:	07:35:38 PM	No files match the supplied pattern.
MainError:	07:35:38 PM	No files match the supplied pattern.
MainError:	07:35:38 PM	No files match the supplied pattern.
MainError:	07:35:38 PM	No files match the supplied pattern.
MainError:	07:35:38 PM	No files match the supplied pattern.
MainError:	07:35:38 PM	No files match the supplied pattern.
MainError:	07:35:38 PM	No files match the supplied pattern.
MainError:	07:35:38 PM	No files match the supplied pattern.
MainError:	07:35:38 PM	No files match the supplied pattern.
MainError:	07:35:38 PM	No files match the supplied pattern.
MainError:	07:35:38 PM	No files match the supplied pattern.
MainError:	07:35:38 PM	No files match the supplied pattern.
MainError:	07:35:38 PM	No files match the supplied pattern.
MainError:	07:35:38 PM	No files match the supplied pattern.
MainError:	07:35:38 PM	No files match the supplied pattern.
MainError:	07:35:38 PM	No files match the supplied pattern.
MainError:	07:35:38 PM	No files match the supplied pattern.
MainError:	07:35:38 PM	No files match the supplied pattern.
MainError:	07:35:38 PM	No files match the supplied pattern.
MainError:	07:35:38 PM	No files match the supplied pattern.
Suspended CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
forrtl: Access is denied.

CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4944, iMonCtr=1
Model crash detected, will try to restart...
forrtl: Access is denied.

CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4944, iMonCtr=1
Model crash detected, will try to restart...
forrtl: Access is denied.

CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4944, iMonCtr=1
Model crash detected, will try to restart...
forrtl: Access is denied.

CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4944, iMonCtr=1
Model crash detected, will try to restart...
forrtl: Access is denied.

CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4944, iMonCtr=1
Model crash detected, will try to restart...
forrtl: Access is denied.

CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4944, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
29 Nov 2010 13:32:50 1101986 11903566 hadsm3dhet2_u9fs_006726315_0 64,812 337,297 1.0408
27 Nov 2010 18:36:00 1101986 11903566 hadsm3dhet2_u9fs_006726315_0 54,010 325,948 1.0405
25 Nov 2010 21:35:17 1101986 11903566 hadsm3dhet2_u9fs_006726315_0 43,208 314,613 1.0402
25 Nov 2010 16:53:18 1101986 11903566 hadsm3dhet2_u9fs_006726315_0 32,406 303,004 1.0389
24 Nov 2010 21:12:40 1101986 11903566 hadsm3dhet2_u9fs_006726315_0 21,604 291,533 1.0380
24 Nov 2010 16:44:18 1101986 11903566 hadsm3dhet2_u9fs_006726315_0 10,802 280,268 1.0378
23 Nov 2010 19:39:29 1101986 11903566 hadsm3dhet2_u9fs_006726315_0 259,248 268,784 1.0368
23 Nov 2010 15:01:12 1101986 11903566 hadsm3dhet2_u9fs_006726315_0 248,446 257,658 1.0371
21 Nov 2010 18:05:14 1101986 11903566 hadsm3dhet2_u9fs_006726315_0 237,644 246,680 1.0380
21 Nov 2010 10:05:46 1101986 11903566 hadsm3dhet2_u9fs_006726315_0 226,842 235,589 1.0386
15 Nov 2010 15:18:42 1101986 11903566 hadsm3dhet2_u9fs_006726315_0 216,040 224,533 1.0393
14 Nov 2010 11:42:15 1101986 11903566 hadsm3dhet2_u9fs_006726315_0 205,238 213,398 1.0398
14 Nov 2010 07:18:14 1101986 11903566 hadsm3dhet2_u9fs_006726315_0 194,436 202,389 1.0409
11 Nov 2010 19:35:09 1101986 11903566 hadsm3dhet2_u9fs_006726315_0 183,634 191,243 1.0414
05 Nov 2010 09:56:19 1101986 11903566 hadsm3dhet2_u9fs_006726315_0 172,832 179,753 1.0400
31 Oct 2010 16:01:44 1101986 11903566 hadsm3dhet2_u9fs_006726315_0 162,030 168,444 1.0396
21 Oct 2010 15:38:35 1101986 11903566 hadsm3dhet2_u9fs_006726315_0 151,228 157,199 1.0395
09 Oct 2010 23:48:34 1101986 11903566 hadsm3dhet2_u9fs_006726315_0 140,426 145,908 1.0390
05 Oct 2010 21:26:27 1101986 11903566 hadsm3dhet2_u9fs_006726315_0 129,624 134,466 1.0374
05 Oct 2010 16:08:41 1101986 11903566 hadsm3dhet2_u9fs_006726315_0 118,822 123,087 1.0359


©2024 climateprediction.net