Name | hadsm3dhet2_jtgw_006601186_9 |
Workunit | 6804559 |
Created | 15 Mar 2010, 12:08:57 UTC |
Sent | 14 Jun 2010, 20:16:27 UTC |
Report deadline | 28 May 2011, 1:36:27 UTC |
Received | 10 Jul 2010, 2:29:30 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 1 (0x00000001) Unknown error code |
Computer ID | 1022351 |
Run time | 5 days 6 hours 57 min 15 sec |
CPU time | 4 days 14 hours 14 min 49 sec |
Validate state | Invalid |
Credit | 2,481.08 |
Device peak FLOPS | 2.04 GFLOPS |
Application version | UK Met Office HadSM3 Slab Model v6.07 windows_intelx86 |
Stderr | <core_client_version>6.10.18</core_client_version> <![CDATA[ <message> Incorrect function. (0x1) - exit code 1 (0x1) </message> <stderr_txt> No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... No heartbeat from core client for 30 sec - exiting No heartbeat from core client for 30 sec - exiting No heartbeat from core client for 30 sec - exiting No heartbeat from core client for 30 sec - exiting No heartbeat from core client for 30 sec - exiting No heartbeat from core client for 30 sec - exiting No heartbeat from core client for 30 sec - exiting No heartbeat from core client for 30 sec - exiting No heartbeat from core client for 30 sec - exiting No heartbeat from core client for 30 sec - exiting No heartbeat from core client for 30 sec - exiting No heartbeat from core client for 30 sec - exiting No heartbeat from core client for 30 sec - exiting No heartbeat from core client for 30 sec - exiting No heartbeat from core client for 30 sec - exiting No heartbeat from core client for 30 sec - exiting No heartbeat from core client for 30 sec - exiting No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Quit request from BOINC... No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Quit request from BOINC... Can't set up shared mem: -1 Will run in standalone mode. Can't set up shared mem: -1 Will run in standalone mode. Suspended CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Quit request from BOINC... No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... No heartbeat from core client for 30 sec - exiting No heartbeat from core client for 30 sec - exiting No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - Quit request from BOINC... No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Quit request from BOINC... No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... No heartbeat from core client for 30 sec - exiting No heartbeat from core client for 30 sec - exiting No heartbeat from core client for 30 sec - exiting No heartbeat from core client for 30 sec - exiting No heartbeat from core client for 30 sec - exiting No heartbeat from core client for 30 sec - exiting No heartbeat from core client for 30 sec - exiting No heartbeat from core client for 30 sec - exiting No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... No heartbeat from core client for 30 sec - exiting No heartbeat from core client for 30 sec - exiting No heartbeat from core client for 30 sec - exiting No heartbeat from core client for 30 sec - exiting MainError: 05:55:03 AM No files match the supplied pattern. MainError: 05:55:03 AM No files match the supplied pattern. MainError: 05:55:03 AM No files match the supplied pattern. MainError: 05:55:03 AM No files match the supplied pattern. MainError: 05:55:03 AM No files match the supplied pattern. MainError: 05:55:03 AM No files match the supplied pattern. MainError: 05:55:03 AM No files match the supplied pattern. MainError: 05:55:03 AM No files match the supplied pattern. MainError: 05:55:03 AM No files match the supplied pattern. MainError: 05:55:03 AM No files match the supplied pattern. MainError: 05:55:03 AM No files match the supplied pattern. MainError: 05:55:03 AM No files match the supplied pattern. MainError: 05:55:03 AM No files match the supplied pattern. MainError: 05:55:03 AM No files match the supplied pattern. MainError: 05:55:03 AM No files match the supplied pattern. MainError: 05:55:03 AM No files match the supplied pattern. MainError: 05:55:03 AM No files match the supplied pattern. MainError: 05:55:03 AM No files match the supplied pattern. MainError: 05:55:03 AM No files match the supplied pattern. MainError: 05:55:03 AM No files match the supplied pattern. MainError: 05:55:03 AM No files match the supplied pattern. MainError: 05:55:03 AM No files match the supplied pattern. MainError: 05:55:03 AM No files match the supplied pattern. MainError: 05:55:03 AM No files match the supplied pattern. MainError: 05:55:03 AM No files match the supplied pattern. MainError: 05:55:03 AM No files match the supplied pattern. MainError: 05:55:03 AM No files match the supplied pattern. MainError: 05:55:03 AM No files match the supplied pattern. MainError: 05:55:03 AM No files match the supplied pattern. MainError: 05:55:03 AM No files match the supplied pattern. MainError: 05:55:03 AM No files match the supplied pattern. MainError: 05:55:03 AM No files match the supplied pattern. MainError: 05:55:03 AM No files match the supplied pattern. MainError: 05:55:03 AM No files match the supplied pattern. MainError: 05:55:03 AM No files match the supplied pattern. MainError: 05:55:03 AM No files match the supplied pattern. MainError: 05:55:03 AM No files match the supplied pattern. MainError: 05:55:03 AM No files match the supplied pattern. MainError: 05:55:03 AM No files match the supplied pattern. MainError: 05:55:03 AM No files match the supplied pattern. MainError: 05:55:03 AM No files match the supplied pattern. MainError: 05:55:03 AM No files match the supplied pattern. MainError: 05:55:03 AM No files match the supplied pattern. MainError: 05:55:03 AM No files match the supplied pattern. MainError: 05:55:03 AM No files match the supplied pattern. MainError: 05:55:03 AM No files match the supplied pattern. MainError: 05:55:03 AM No files match the supplied pattern. MainError: 05:55:03 AM No files match the supplied pattern. MainError: 05:55:03 AM No files match the supplied pattern. MainError: 05:55:03 AM No files match the supplied pattern. MainError: 05:55:03 AM No files match the supplied pattern. MainError: 05:55:03 AM No files match the supplied pattern. MainError: 05:55:03 AM No files match the supplied pattern. MainError: 05:55:03 AM No files match the supplied pattern. MainError: 05:55:03 AM No files match the supplied pattern. MainError: 05:55:03 AM No files match the supplied pattern. CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Quit request from BOINC... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5396, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5396, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5396, iMonCtr=1 Model crash detected, will try to restart... </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
10 Jul 2010 01:33:19 | 1022351 | 11073597 | hadsm3dhet2_jtgw_006601186_9 | 10,802 | 396,331 | 1.4676 |
09 Jul 2010 05:57:58 | 1022351 | 11073597 | hadsm3dhet2_jtgw_006601186_9 | 259,248 | 380,482 | 1.4676 |
08 Jul 2010 09:17:34 | 1022351 | 11073597 | hadsm3dhet2_jtgw_006601186_9 | 248,446 | 364,656 | 1.4677 |
06 Jul 2010 22:06:22 | 1022351 | 11073597 | hadsm3dhet2_jtgw_006601186_9 | 237,644 | 348,525 | 1.4666 |
06 Jul 2010 05:11:43 | 1022351 | 11073597 | hadsm3dhet2_jtgw_006601186_9 | 226,842 | 332,949 | 1.4678 |
05 Jul 2010 11:19:28 | 1022351 | 11073597 | hadsm3dhet2_jtgw_006601186_9 | 216,040 | 316,830 | 1.4665 |
04 Jul 2010 19:05:05 | 1022351 | 11073597 | hadsm3dhet2_jtgw_006601186_9 | 205,238 | 301,239 | 1.4678 |
03 Jul 2010 13:17:25 | 1022351 | 11073597 | hadsm3dhet2_jtgw_006601186_9 | 194,436 | 285,776 | 1.4698 |
02 Jul 2010 18:21:18 | 1022351 | 11073597 | hadsm3dhet2_jtgw_006601186_9 | 183,634 | 270,132 | 1.4710 |
01 Jul 2010 05:04:40 | 1022351 | 11073597 | hadsm3dhet2_jtgw_006601186_9 | 172,832 | 253,868 | 1.4689 |
29 Jun 2010 20:46:49 | 1022351 | 11073597 | hadsm3dhet2_jtgw_006601186_9 | 162,030 | 237,521 | 1.4659 |
29 Jun 2010 02:15:17 | 1022351 | 11073597 | hadsm3dhet2_jtgw_006601186_9 | 151,228 | 220,653 | 1.4591 |
28 Jun 2010 06:19:21 | 1022351 | 11073597 | hadsm3dhet2_jtgw_006601186_9 | 140,426 | 204,121 | 1.4536 |
27 Jun 2010 02:54:04 | 1022351 | 11073597 | hadsm3dhet2_jtgw_006601186_9 | 129,624 | 187,834 | 1.4491 |
25 Jun 2010 16:14:05 | 1022351 | 11073597 | hadsm3dhet2_jtgw_006601186_9 | 118,822 | 171,951 | 1.4471 |
24 Jun 2010 18:26:03 | 1022351 | 11073597 | hadsm3dhet2_jtgw_006601186_9 | 108,020 | 156,412 | 1.4480 |
24 Jun 2010 02:32:53 | 1022351 | 11073597 | hadsm3dhet2_jtgw_006601186_9 | 97,218 | 141,099 | 1.4514 |
22 Jun 2010 16:51:47 | 1022351 | 11073597 | hadsm3dhet2_jtgw_006601186_9 | 86,416 | 125,406 | 1.4512 |
21 Jun 2010 19:58:04 | 1022351 | 11073597 | hadsm3dhet2_jtgw_006601186_9 | 75,614 | 110,542 | 1.4619 |
18 Jun 2010 22:58:45 | 1022351 | 11073597 | hadsm3dhet2_jtgw_006601186_9 | 64,812 | 95,080 | 1.4670 |
©2024 cpdn.org