Name | hadsm3dhet2_jkyf_006590153_0 |
Workunit | 6793526 |
Created | 15 Mar 2010, 11:53:11 UTC |
Sent | 21 Oct 2010, 22:01:07 UTC |
Report deadline | 4 Oct 2011, 3:21:07 UTC |
Received | 7 Dec 2010, 14:10:47 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 1099533 |
Run time | 7 days 23 hours 22 min 24 sec |
CPU time | 5 days 13 hours 9 min 41 sec |
Validate state | Invalid |
Credit | 2,183.35 |
Device peak FLOPS | 2.33 GFLOPS |
Application version | UK Met Office HadSM3 Slab Model v6.07 windows_intelx86 |
Stderr | <core_client_version>6.10.58</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> CPDN Monitor - Quit request from BOINC... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5524, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3904, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4936, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4116, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1580, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=944, iMonCtr=1 Model crash detected, will try to restart... CCPDN Monitor - Quit request from BOINC... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4888, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5696, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6864, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4720, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5592, iMonCtr=1 Model crash detected, will try to restart... No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CCPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5072, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5072, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5072, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5072, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5072, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5072, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
07 Dec 2010 14:11:57 | 1099533 | 10963257 | hadsm3dhet2_jkyf_006590153_0 | 237,644 | 477,786 | 2.0105 |
05 Dec 2010 11:41:45 | 1099533 | 10963257 | hadsm3dhet2_jkyf_006590153_0 | 226,842 | 456,213 | 2.0111 |
03 Dec 2010 19:37:44 | 1099533 | 10963257 | hadsm3dhet2_jkyf_006590153_0 | 216,040 | 434,311 | 2.0103 |
03 Dec 2010 00:58:51 | 1099533 | 10963257 | hadsm3dhet2_jkyf_006590153_0 | 205,238 | 412,065 | 2.0077 |
01 Dec 2010 23:18:10 | 1099533 | 10963257 | hadsm3dhet2_jkyf_006590153_0 | 194,436 | 390,072 | 2.0062 |
30 Nov 2010 18:52:42 | 1099533 | 10963257 | hadsm3dhet2_jkyf_006590153_0 | 183,634 | 367,983 | 2.0039 |
26 Nov 2010 06:10:30 | 1099533 | 10963257 | hadsm3dhet2_jkyf_006590153_0 | 172,832 | 346,634 | 2.0056 |
24 Nov 2010 09:15:34 | 1099533 | 10963257 | hadsm3dhet2_jkyf_006590153_0 | 162,030 | 325,958 | 2.0117 |
23 Nov 2010 09:46:15 | 1099533 | 10963257 | hadsm3dhet2_jkyf_006590153_0 | 151,228 | 303,582 | 2.0074 |
23 Nov 2010 00:58:32 | 1099533 | 10963257 | hadsm3dhet2_jkyf_006590153_0 | 140,426 | 281,183 | 2.0024 |
22 Nov 2010 16:14:27 | 1099533 | 10963257 | hadsm3dhet2_jkyf_006590153_0 | 129,624 | 259,568 | 2.0025 |
17 Nov 2010 15:34:30 | 1099533 | 10963257 | hadsm3dhet2_jkyf_006590153_0 | 118,822 | 238,712 | 2.0090 |
15 Nov 2010 19:17:08 | 1099533 | 10963257 | hadsm3dhet2_jkyf_006590153_0 | 108,020 | 218,775 | 2.0253 |
15 Nov 2010 09:56:08 | 1099533 | 10963257 | hadsm3dhet2_jkyf_006590153_0 | 97,218 | 196,509 | 2.0213 |
14 Nov 2010 21:17:42 | 1099533 | 10963257 | hadsm3dhet2_jkyf_006590153_0 | 86,416 | 172,781 | 1.9994 |
14 Nov 2010 10:44:12 | 1099533 | 10963257 | hadsm3dhet2_jkyf_006590153_0 | 75,614 | 150,048 | 1.9844 |
14 Nov 2010 07:23:34 | 1099533 | 10963257 | hadsm3dhet2_jkyf_006590153_0 | 64,812 | 127,478 | 1.9669 |
13 Nov 2010 17:27:18 | 1099533 | 10963257 | hadsm3dhet2_jkyf_006590153_0 | 54,010 | 105,533 | 1.9540 |
03 Nov 2010 05:30:36 | 1099533 | 10963257 | hadsm3dhet2_jkyf_006590153_0 | 43,208 | 83,871 | 1.9411 |
30 Oct 2010 18:35:11 | 1099533 | 10963257 | hadsm3dhet2_jkyf_006590153_0 | 32,406 | 62,680 | 1.9342 |
©2024 cpdn.org