Name | hadcm3n_zc9p_1920_40_008244778_2 |
Workunit | 8399902 |
Created | 30 Nov 2012, 0:48:05 UTC |
Sent | 30 Nov 2012, 0:48:37 UTC |
Report deadline | 1 Mar 2013, 8:15:48 UTC |
Received | 16 Mar 2013, 14:36:14 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 193 (0x000000C1) EXIT_SIGNAL |
Computer ID | 1022927 |
Run time | 31 days 19 hours 46 min 39 sec |
CPU time | 23 days 3 hours 6 min 55 sec |
Validate state | Invalid |
Credit | 12,441.60 |
Device peak FLOPS | 2.23 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86 |
Stderr | <core_client_version>6.10.18</core_client_version> <![CDATA[ <message> - exit code 193 (0xc1) </message> <stderr_txt> 19:49:55 (5108): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 19:36:00 (2328): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 19:51:49 (4932): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 19:47:24 (4184): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7032, iMonCtr=1 Model crash detected, will try to restart... 19:04:38 (5192): No heartbeat from core client for 30 sec - exiting 19:04:39 (5192): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4532, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4532, iMonCtr=1 Model crash detected, will try to restart... 19:09:56 (5600): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 19:13:31 (3640): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 19:05:56 (4388): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5364, iMonCtr=1 Model crash detected, will try to restart... 19:02:43 (4704): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 08:15:17 (4320): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 16:34:01 (4356): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 19:17:45 (1272): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 19:08:39 (4308): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 19:25:26 (5412): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 19:20:55 (4912): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 22:10:28 (4780): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 19:12:08 (3880): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 19:07:34 (3400): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 19:10:37 (4388): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 19:18:36 (5716): No heartbeat from core client for 30 sec - exiting 19:18:37 (5716): No heartbeat from core client for 30 sec - exiting 19:18:38 (5716): No heartbeat from core client for 30 sec - exiting 19:18:39 (5716): No heartbeat from core client for 30 sec - exiting 19:18:40 (5716): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 19:18:49 (2420): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 18:53:23 (4432): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 19:08:13 (5580): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 02:27:04 (5872): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 21:24:35 (5960): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:24:36 (5960): No heartbeat from core client for 30 sec - exiting 21:24:37 (5960): No heartbeat from core client for 30 sec - exiting 21:24:38 (5960): No heartbeat from core client for 30 sec - exiting 21:24:39 (5960): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5652, iMonCtr=1 Model crash detected, will try to restart... 11:13:07 (5500): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 19:15:35 (5188): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4400, iMonCtr=1 Model crash detected, will try to restart... 19:12:49 (3096): No heartbeat from core client for 30 sec - exiting 19:12:50 (3096): No heartbeat from core client for 30 sec - exiting 19:12:51 (3096): No heartbeat from core client for 30 sec - exiting 19:12:52 (3096): No heartbeat from core client for 30 sec - exiting 19:12:53 (3096): No heartbeat from core client for 30 sec - exiting 19:12:54 (3096): No heartbeat from core client for 30 sec - exiting 19:12:55 (3096): No heartbeat from core client for 30 sec - exiting 19:12:56 (3096): No heartbeat from core client for 30 sec - exiting 19:12:57 (3096): No heartbeat from core client for 30 sec - exiting 19:12:58 (3096): No heartbeat from core client for 30 sec - exiting 19:12:59 (3096): No heartbeat from core client for 30 sec - exiting 19:13:00 (3096): No heartbeat from core client for 30 sec - exiting 19:13:01 (3096): No heartbeat from core client for 30 sec - exiting 19:13:02 (3096): No heartbeat from core client for 30 sec - exiting 19:13:03 (3096): No heartbeat from core client for 30 sec - exiting 19:13:04 (3096): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 19:10:31 (2500): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 11:58:29 (6116): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 22:33:49 (5308): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4584, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5048, iMonCtr=1 Model crash detected, will try to restart... 00:03:09 (6128): No heartbeat from core client for 30 sec - exiting 00:03:11 (6128): No heartbeat from core client for 30 sec - exiting 00:03:12 (6128): No heartbeat from core client for 30 sec - exiting 00:03:13 (6128): No heartbeat from core client for 30 sec - exiting 00:03:14 (6128): No heartbeat from core client for 30 sec - exiting 00:03:15 (6128): No heartbeat from core client for 30 sec - exiting 00:03:16 (6128): No heartbeat from core client for 30 sec - exiting 00:03:17 (6128): No heartbeat from core client for 30 sec - exiting 00:03:18 (6128): No heartbeat from core client for 30 sec - exiting 00:03:19 (6128): No heartbeat from core client for 30 sec - exiting 00:03:20 (6128): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5204, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1740, iMonCtr=1 Model crash detected, will try to restart... 19:09:14 (1648): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 18:56:36 (4700): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5720, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5720, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5720, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... 19:16:21 (2440): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 19:16:22 (2440): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... 08:29:39 (3384): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 19:10:49 (5632): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Signal 11 received, exiting... Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
16 Mar 2013 14:40:30 | 1022927 | 15466900 | hadcm3n_zc9p_1920_40_008244778_2 | 1,036,800 | 1,998,400 | 1.9275 |
15 Mar 2013 11:39:16 | 1022927 | 15466900 | hadcm3n_zc9p_1920_40_008244778_2 | 1,010,880 | 1,949,110 | 1.9281 |
13 Mar 2013 02:43:24 | 1022927 | 15466900 | hadcm3n_zc9p_1920_40_008244778_2 | 984,960 | 1,900,483 | 1.9295 |
28 Feb 2013 05:46:59 | 1022927 | 15466900 | hadcm3n_zc9p_1920_40_008244778_2 | 959,040 | 1,845,881 | 1.9247 |
24 Feb 2013 04:28:41 | 1022927 | 15466900 | hadcm3n_zc9p_1920_40_008244778_2 | 933,120 | 1,792,590 | 1.9211 |
23 Feb 2013 09:52:21 | 1022927 | 15466900 | hadcm3n_zc9p_1920_40_008244778_2 | 907,200 | 1,740,618 | 1.9187 |
22 Feb 2013 05:16:17 | 1022927 | 15466900 | hadcm3n_zc9p_1920_40_008244778_2 | 881,280 | 1,690,615 | 1.9184 |
17 Feb 2013 17:26:20 | 1022927 | 15466900 | hadcm3n_zc9p_1920_40_008244778_2 | 855,360 | 1,635,315 | 1.9118 |
14 Feb 2013 10:51:08 | 1022927 | 15466900 | hadcm3n_zc9p_1920_40_008244778_2 | 829,440 | 1,581,670 | 1.9069 |
10 Feb 2013 18:23:56 | 1022927 | 15466900 | hadcm3n_zc9p_1920_40_008244778_2 | 803,520 | 1,526,429 | 1.8997 |
29 Jan 2013 00:23:07 | 1022927 | 15466900 | hadcm3n_zc9p_1920_40_008244778_2 | 777,600 | 1,472,129 | 1.8932 |
15 Jan 2013 10:54:00 | 1022927 | 15466900 | hadcm3n_zc9p_1920_40_008244778_2 | 751,680 | 1,422,703 | 1.8927 |
11 Jan 2013 03:43:27 | 1022927 | 15466900 | hadcm3n_zc9p_1920_40_008244778_2 | 725,760 | 1,365,856 | 1.8820 |
04 Jan 2013 10:15:38 | 1022927 | 15466900 | hadcm3n_zc9p_1920_40_008244778_2 | 699,840 | 1,316,080 | 1.8805 |
03 Jan 2013 08:25:53 | 1022927 | 15466900 | hadcm3n_zc9p_1920_40_008244778_2 | 673,920 | 1,266,436 | 1.8792 |
31 Dec 2012 08:48:27 | 1022927 | 15466900 | hadcm3n_zc9p_1920_40_008244778_2 | 648,000 | 1,216,982 | 1.8781 |
30 Dec 2012 14:05:35 | 1022927 | 15466900 | hadcm3n_zc9p_1920_40_008244778_2 | 622,080 | 1,164,915 | 1.8726 |
29 Dec 2012 21:00:35 | 1022927 | 15466900 | hadcm3n_zc9p_1920_40_008244778_2 | 596,160 | 1,113,735 | 1.8682 |
29 Dec 2012 04:19:22 | 1022927 | 15466900 | hadcm3n_zc9p_1920_40_008244778_2 | 570,240 | 1,062,408 | 1.8631 |
28 Dec 2012 02:13:03 | 1022927 | 15466900 | hadcm3n_zc9p_1920_40_008244778_2 | 544,320 | 1,014,141 | 1.8631 |
©2024 cpdn.org