Name | hadcm3n_odk5_1900_40_008472696_1 |
Workunit | 8623535 |
Created | 30 Sep 2013, 16:25:35 UTC |
Sent | 30 Sep 2013, 16:52:20 UTC |
Report deadline | 31 Dec 2013, 0:19:31 UTC |
Received | 10 Nov 2013, 13:37:38 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 193 (0x000000C1) EXIT_SIGNAL |
Computer ID | 459222 |
Run time | 9 days 20 hours 9 min 6 sec |
CPU time | 9 days 17 hours 45 min 5 sec |
Validate state | Invalid |
Credit | 12,441.60 |
Device peak FLOPS | 3.37 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86 |
Stderr | <core_client_version>7.0.64</core_client_version> <![CDATA[ <message> (unknown error) - exit code 193 (0xc1) </message> <stderr_txt> 16:45:50 (11156): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5800, iMonCtr=1 Model crash detected, will try to restart... 13:05:31 (6104): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 13:05:32 (6104): No heartbeat from core client for 30 sec - exiting 13:05:33 (6104): No heartbeat from core client for 30 sec - exiting 13:05:34 (6104): No heartbeat from core client for 30 sec - exiting 13:05:35 (6104): No heartbeat from core client for 30 sec - exiting 13:05:36 (6104): No heartbeat from core client for 30 sec - exiting 13:05:37 (6104): No heartbeat from core client for 30 sec - exiting 13:05:38 (6104): No heartbeat from core client for 30 sec - exiting 13:05:39 (6104): No heartbeat from core client for 30 sec - exiting Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3776, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... 16:44:36 (11144): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 09:57:55 (8748): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 17:57:06 (10112): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8676, iMonCtr=1 Model crash detected, will try to restart... 17:44:39 (10472): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=9916, iMonCtr=1 Model crash detected, will try to restart... 19:32:15 (10876): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 19:32:16 (10876): No heartbeat from core client for 30 sec - exiting 19:32:17 (10876): No heartbeat from core client for 30 sec - exiting 19:32:18 (10876): No heartbeat from core client for 30 sec - exiting 19:32:19 (10876): No heartbeat from core client for 30 sec - exiting 19:32:20 (10876): No heartbeat from core client for 30 sec - exiting 19:32:21 (10876): No heartbeat from core client for 30 sec - exiting 19:32:22 (10876): No heartbeat from core client for 30 sec - exiting 19:32:23 (10876): No heartbeat from core client for 30 sec - exiting 19:32:24 (10876): No heartbeat from core client for 30 sec - exiting 19:32:25 (10876): No heartbeat from core client for 30 sec - exiting 09:48:35 (10824): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 19:18:59 (5372): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 19:19:00 (5372): No heartbeat from core client for 30 sec - exiting 19:19:01 (5372): No heartbeat from core client for 30 sec - exiting 19:19:02 (5372): No heartbeat from core client for 30 sec - exiting 19:19:03 (5372): No heartbeat from core client for 30 sec - exiting 19:19:04 (5372): No heartbeat from core client for 30 sec - exiting 19:19:05 (5372): No heartbeat from core client for 30 sec - exiting 19:19:06 (5372): No heartbeat from core client for 30 sec - exiting 19:19:07 (5372): No heartbeat from core client for 30 sec - exiting 19:19:08 (5372): No heartbeat from core client for 30 sec - exiting 19:19:09 (5372): No heartbeat from core client for 30 sec - exiting 20:09:36 (6352): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 14:36:47 (8096): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=9316, iMonCtr=1 Model crash detected, will try to restart... 19:07:22 (9004): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 19:07:23 (9004): No heartbeat from core client for 30 sec - exiting 19:07:24 (9004): No heartbeat from core client for 30 sec - exiting 19:07:25 (9004): No heartbeat from core client for 30 sec - exiting 19:07:26 (9004): No heartbeat from core client for 30 sec - exiting 19:07:27 (9004): No heartbeat from core client for 30 sec - exiting 19:07:28 (9004): No heartbeat from core client for 30 sec - exiting 19:07:29 (9004): No heartbeat from core client for 30 sec - exiting 19:07:30 (9004): No heartbeat from core client for 30 sec - exiting 19:07:31 (9004): No heartbeat from core client for 30 sec - exiting 19:07:32 (9004): No heartbeat from core client for 30 sec - exiting 10:02:02 (1176): No heartbeat from core client for 30 sec - exiting 10:02:03 (1176): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 10:19:34 (11204): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Signal 11 received, exiting... Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
10 Nov 2013 11:13:31 | 459222 | 16050990 | hadcm3n_odk5_1900_40_008472696_1 | 1,036,800 | 841,494 | 0.8116 |
09 Nov 2013 18:04:26 | 459222 | 16050990 | hadcm3n_odk5_1900_40_008472696_1 | 1,010,880 | 820,765 | 0.8119 |
09 Nov 2013 12:27:05 | 459222 | 16050990 | hadcm3n_odk5_1900_40_008472696_1 | 984,960 | 800,530 | 0.8128 |
08 Nov 2013 17:51:32 | 459222 | 16050990 | hadcm3n_odk5_1900_40_008472696_1 | 959,040 | 779,286 | 0.8126 |
07 Nov 2013 17:47:37 | 459222 | 16050990 | hadcm3n_odk5_1900_40_008472696_1 | 933,120 | 757,322 | 0.8116 |
04 Nov 2013 18:52:47 | 459222 | 16050990 | hadcm3n_odk5_1900_40_008472696_1 | 907,200 | 734,920 | 0.8101 |
03 Nov 2013 17:02:08 | 459222 | 16050990 | hadcm3n_odk5_1900_40_008472696_1 | 881,280 | 714,498 | 0.8108 |
03 Nov 2013 11:28:39 | 459222 | 16050990 | hadcm3n_odk5_1900_40_008472696_1 | 855,360 | 694,552 | 0.8120 |
02 Nov 2013 17:06:02 | 459222 | 16050990 | hadcm3n_odk5_1900_40_008472696_1 | 829,440 | 673,661 | 0.8122 |
02 Nov 2013 11:24:09 | 459222 | 16050990 | hadcm3n_odk5_1900_40_008472696_1 | 803,520 | 653,122 | 0.8128 |
01 Nov 2013 18:37:41 | 459222 | 16050990 | hadcm3n_odk5_1900_40_008472696_1 | 777,600 | 632,020 | 0.8128 |
01 Nov 2013 13:03:50 | 459222 | 16050990 | hadcm3n_odk5_1900_40_008472696_1 | 751,680 | 611,958 | 0.8141 |
31 Oct 2013 21:43:28 | 459222 | 16050990 | hadcm3n_odk5_1900_40_008472696_1 | 725,760 | 591,521 | 0.8150 |
30 Oct 2013 20:14:04 | 459222 | 16050990 | hadcm3n_odk5_1900_40_008472696_1 | 699,840 | 571,695 | 0.8169 |
29 Oct 2013 18:57:41 | 459222 | 16050990 | hadcm3n_odk5_1900_40_008472696_1 | 673,920 | 551,283 | 0.8180 |
27 Oct 2013 18:57:22 | 459222 | 16050990 | hadcm3n_odk5_1900_40_008472696_1 | 648,000 | 530,876 | 0.8193 |
25 Oct 2013 18:55:05 | 459222 | 16050990 | hadcm3n_odk5_1900_40_008472696_1 | 622,080 | 510,872 | 0.8212 |
21 Oct 2013 18:27:02 | 459222 | 16050990 | hadcm3n_odk5_1900_40_008472696_1 | 596,160 | 490,155 | 0.8222 |
20 Oct 2013 16:52:00 | 459222 | 16050990 | hadcm3n_odk5_1900_40_008472696_1 | 570,240 | 469,050 | 0.8225 |
20 Oct 2013 10:35:15 | 459222 | 16050990 | hadcm3n_odk5_1900_40_008472696_1 | 544,320 | 446,709 | 0.8207 |
©2024 climateprediction.net