Name | hadsm3dhet2_jo1y_006594168_3 |
Workunit | 6797541 |
Created | 15 Mar 2010, 11:59:44 UTC |
Sent | 10 Oct 2010, 18:54:58 UTC |
Report deadline | 23 Sep 2011, 0:14:58 UTC |
Received | 15 Nov 2010, 16:49:29 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 1040325 |
Run time | 6 days 21 hours 46 min 29 sec |
CPU time | 5 days 3 hours 18 min 3 sec |
Validate state | Invalid |
Credit | 2,778.81 |
Device peak FLOPS | 2.05 GFLOPS |
Application version | UK Met Office HadSM3 Slab Model v6.07 windows_intelx86 |
Stderr | <core_client_version>6.10.18</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2956, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... MainError: 08:16:26 PM No files match the supplied pattern. MainError: 08:16:26 PM No files match the supplied pattern. MainError: 08:16:26 PM No files match the supplied pattern. MainError: 08:16:26 PM No files match the supplied pattern. MainError: 08:16:26 PM No files match the supplied pattern. MainError: 08:16:26 PM No files match the supplied pattern. MainError: 08:16:26 PM No files match the supplied pattern. MainError: 08:16:26 PM No files match the supplied pattern. MainError: 08:16:26 PM No files match the supplied pattern. MainError: 08:16:26 PM No files match the supplied pattern. MainError: 08:16:26 PM No files match the supplied pattern. MainError: 08:16:26 PM No files match the supplied pattern. MainError: 08:16:26 PM No files match the supplied pattern. MainError: 08:16:26 PM No files match the supplied pattern. MainError: 08:16:26 PM No files match the supplied pattern. MainError: 08:16:26 PM No files match the supplied pattern. MainError: 08:16:26 PM No files match the supplied pattern. MainError: 08:16:26 PM No files match the supplied pattern. MainError: 08:16:26 PM No files match the supplied pattern. MainError: 08:16:26 PM No files match the supplied pattern. MainError: 08:16:26 PM No files match the supplied pattern. MainError: 08:16:26 PM No files match the supplied pattern. MainError: 08:16:26 PM No files match the supplied pattern. MainError: 08:16:26 PM No files match the supplied pattern. MainError: 08:16:26 PM No files match the supplied pattern. MainError: 08:16:26 PM No files match the supplied pattern. MainError: 08:16:26 PM No files match the supplied pattern. MainError: 08:16:26 PM No files match the supplied pattern. MainError: 08:16:26 PM No files match the supplied pattern. MainError: 08:16:26 PM No files match the supplied pattern. MainError: 08:16:26 PM No files match the supplied pattern. MainError: 08:16:26 PM No files match the supplied pattern. MainError: 08:16:26 PM No files match the supplied pattern. MainError: 08:16:26 PM No files match the supplied pattern. MainError: 08:16:26 PM No files match the supplied pattern. MainError: 08:16:26 PM No files match the supplied pattern. MainError: 08:16:26 PM No files match the supplied pattern. MainError: 08:16:26 PM No files match the supplied pattern. MainError: 08:16:26 PM No files match the supplied pattern. MainError: 08:16:26 PM No files match the supplied pattern. MainError: 08:16:26 PM No files match the supplied pattern. MainError: 08:16:26 PM No files match the supplied pattern. MainError: 08:16:26 PM No files match the supplied pattern. MainError: 08:16:26 PM No files match the supplied pattern. MainError: 08:16:26 PM No files match the supplied pattern. MainError: 08:16:26 PM No files match the supplied pattern. MainError: 08:16:26 PM No files match the supplied pattern. MainError: 08:16:26 PM No files match the supplied pattern. MainError: 08:16:26 PM No files match the supplied pattern. MainError: 08:16:26 PM No files match the supplied pattern. MainError: 08:16:26 PM No files match the supplied pattern. MainError: 08:16:26 PM No files match the supplied pattern. MainError: 08:16:26 PM No files match the supplied pattern. MainError: 08:16:26 PM No files match the supplied pattern. MainError: 08:16:26 PM No files match the supplied pattern. MainError: 08:16:26 PM No files match the supplied pattern. CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4216, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4216, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4276, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4276, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4276, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4276, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4276, iMonCtr=1 Model crash detected, will try to restart... Model crashed: 7R CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4276, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4276, iMonCtr=1 Model crash detected, will try to restart... Model crashed: 7R CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4276, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
11 Nov 2010 19:19:28 | 1040325 | 11003410 | hadsm3dhet2_jo1y_006594168_3 | 43,208 | 441,272 | 1.4590 |
11 Nov 2010 19:19:28 | 1040325 | 11003410 | hadsm3dhet2_jo1y_006594168_3 | 32,406 | 430,017 | 1.4744 |
11 Nov 2010 19:19:28 | 1040325 | 11003410 | hadsm3dhet2_jo1y_006594168_3 | 21,604 | 418,621 | 1.4905 |
11 Nov 2010 19:19:28 | 1040325 | 11003410 | hadsm3dhet2_jo1y_006594168_3 | 10,802 | 407,229 | 1.5080 |
02 Nov 2010 22:04:59 | 1040325 | 11003410 | hadsm3dhet2_jo1y_006594168_3 | 259,248 | 393,720 | 1.5187 |
27 Oct 2010 23:35:54 | 1040325 | 11003410 | hadsm3dhet2_jo1y_006594168_3 | 248,446 | 376,911 | 1.5171 |
27 Oct 2010 17:29:47 | 1040325 | 11003410 | hadsm3dhet2_jo1y_006594168_3 | 237,644 | 359,614 | 1.5132 |
27 Oct 2010 16:58:57 | 1040325 | 11003410 | hadsm3dhet2_jo1y_006594168_3 | 226,842 | 342,382 | 1.5093 |
27 Oct 2010 16:58:57 | 1040325 | 11003410 | hadsm3dhet2_jo1y_006594168_3 | 216,040 | 325,090 | 1.5048 |
26 Oct 2010 20:37:12 | 1040325 | 11003410 | hadsm3dhet2_jo1y_006594168_3 | 205,238 | 308,247 | 1.5019 |
23 Oct 2010 22:49:42 | 1040325 | 11003410 | hadsm3dhet2_jo1y_006594168_3 | 194,436 | 291,603 | 1.4997 |
23 Oct 2010 16:24:16 | 1040325 | 11003410 | hadsm3dhet2_jo1y_006594168_3 | 183,634 | 275,203 | 1.4986 |
23 Oct 2010 10:00:35 | 1040325 | 11003410 | hadsm3dhet2_jo1y_006594168_3 | 172,832 | 258,667 | 1.4966 |
22 Oct 2010 21:08:06 | 1040325 | 11003410 | hadsm3dhet2_jo1y_006594168_3 | 162,030 | 242,124 | 1.4943 |
22 Oct 2010 04:21:37 | 1040325 | 11003410 | hadsm3dhet2_jo1y_006594168_3 | 151,228 | 225,633 | 1.4920 |
21 Oct 2010 16:49:47 | 1040325 | 11003410 | hadsm3dhet2_jo1y_006594168_3 | 140,426 | 209,204 | 1.4898 |
21 Oct 2010 16:49:47 | 1040325 | 11003410 | hadsm3dhet2_jo1y_006594168_3 | 129,624 | 192,855 | 1.4878 |
21 Oct 2010 16:49:47 | 1040325 | 11003410 | hadsm3dhet2_jo1y_006594168_3 | 118,822 | 177,720 | 1.4957 |
21 Oct 2010 16:49:47 | 1040325 | 11003410 | hadsm3dhet2_jo1y_006594168_3 | 108,020 | 162,474 | 1.5041 |
20 Oct 2010 18:08:42 | 1040325 | 11003410 | hadsm3dhet2_jo1y_006594168_3 | 97,218 | 146,969 | 1.5117 |
©2024 cpdn.org