Name | hadam3p_pnw_2x6q_1999_1_007176506_1 |
Workunit | 7374788 |
Created | 10 Mar 2011, 0:46:51 UTC |
Sent | 10 Mar 2011, 1:09:52 UTC |
Report deadline | 20 Feb 2012, 6:29:52 UTC |
Received | 4 Jul 2011, 7:57:39 UTC |
Server state | Over |
Outcome | Success |
Client state | Done |
Exit status | 0 (0x00000000) |
Computer ID | 989394 |
Run time | 10 days 5 hours 57 min 16 sec |
CPU time | 7 days 23 hours 45 min 34 sec |
Validate state | Workunit error - check skipped |
Credit | 3,003.79 |
Device peak FLOPS | 1.55 GFLOPS |
Application version | UK Met Office HadAM3P-HadRM3P Pacific North West v6.08 windows_intelx86 |
Stderr | <core_client_version>6.6.36</core_client_version> <![CDATA[ <stderr_txt> Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4528, selfPID=1772, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3056, selfPID=648, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5072, selfPID=2148, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6024, selfPID=1352, iMonCtr=1 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5260, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5316, selfPID=2352, iMonCtr=1 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1352, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2720, iMonCtr=2 Model crash detected, will try to restart... 20:16:42 (3656): No heartbeat from core client for 30 sec - exiting 20:16:43 (3656): No heartbeat from core client for 30 sec - exiting 20:16:44 (3656): No heartbeat from core client for 30 sec - exiting 20:16:45 (3656): No heartbeat from core client for 30 sec - exiting 20:16:46 (3656): No heartbeat from core client for 30 sec - exiting 20:16:47 (3656): No heartbeat from core client for 30 sec - exiting 20:16:48 (3656): No heartbeat from core client for 30 sec - exiting 20:16:49 (3656): No heartbeat from core client for 30 sec - exiting 20:16:50 (3656): No heartbeat from core client for 30 sec - exiting 20:16:51 (3656): No heartbeat from core client for 30 sec - exiting 20:16:52 (3656): No heartbeat from core client for 30 sec - exiting 20:16:53 (3656): No heartbeat from core client for 30 sec - exiting 20:16:54 (3656): No heartbeat from core client for 30 sec - exiting 20:16:55 (3656): No heartbeat from core client for 30 sec - exiting 20:16:56 (3656): No heartbeat from core client for 30 sec - exiting 20:16:57 (3656): No heartbeat from core client for 30 sec - exiting 20:16:58 (3656): No heartbeat from core client for 30 sec - exiting 20:16:59 (3656): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5460, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6140, selfPID=5100, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5948, iMonCtr=2 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2608, iMonCtr=2 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3324, iMonCtr=2 Leaving CPDN_Main::Monitor... Regional yearly means requires 12 input files got 2 Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3016, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3448, selfPID=6016, iMonCtr=1 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5904, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2892, selfPID=4792, iMonCtr=1 Model crash detected, will try to restart... 18:46:23 (1424): No heartbeat from core client for 30 sec - exiting 18:46:24 (1424): No heartbeat from core client for 30 sec - exiting 18:46:25 (1424): No heartbeat from core client for 30 sec - exiting 18:46:26 (1424): No heartbeat from core client for 30 sec - exiting 18:46:27 (1424): No heartbeat from core client for 30 sec - exiting 18:46:28 (1424): No heartbeat from core client for 30 sec - exiting 18:46:29 (1424): No heartbeat from core client for 30 sec - exiting 18:46:30 (1424): No heartbeat from core client for 30 sec - exiting 18:46:31 (1424): No heartbeat from core client for 30 sec - exiting 18:46:32 (1424): No heartbeat from core client for 30 sec - exiting 18:46:33 (1424): No heartbeat from core client for 30 sec - exiting 18:46:34 (1424): No heartbeat from core client for 30 sec - exiting 18:46:35 (1424): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 23:55:55 (5432): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 01:24:28 (4872): No heartbeat from core client for 30 sec - exiting 01:24:29 (4872): No heartbeat from core client for 30 sec - exiting 01:24:30 (4872): No heartbeat from core client for 30 sec - exiting 01:24:31 (4872): No heartbeat from core client for 30 sec - exiting 01:24:32 (4872): No heartbeat from core client for 30 sec - exiting 01:24:33 (4872): No heartbeat from core client for 30 sec - exiting 01:24:34 (4872): No heartbeat from core client for 30 sec - exiting 01:24:35 (4872): No heartbeat from core client for 30 sec - exiting 01:24:36 (4872): No heartbeat from core client for 30 sec - exiting 01:24:37 (4872): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 01:29:46 (292): No heartbeat from core client for 30 sec - exiting 01:29:47 (292): No heartbeat from core client for 30 sec - exiting 01:29:48 (292): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 02:14:20 (296): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 02:50:37 (3952): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5588, iMonCtr=2 Model crash detected, will try to restart... 00:43:39 (5644): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 05:00:15 (5116): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2832, iMonCtr=2 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4876, iMonCtr=2 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=424, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3396, selfPID=5484, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Regional yearly means requires 12 input files got 4 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1424, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Regional yearly means requires 12 input files got 4 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4248, selfPID=5764, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=708, selfPID=5352, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Regional yearly means requires 12 input files got 4 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4256, selfPID=4784, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=352, selfPID=3260, iMonCtr=1 Model crash detected, will try to restart... 16:19:14 (5620): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 16:19:17 (5620): No heartbeat from core client for 30 sec - exiting 16:19:18 (5620): No heartbeat from core client for 30 sec - exiting 16:19:19 (5620): No heartbeat from core client for 30 sec - exiting 16:19:20 (5620): No heartbeat from core client for 30 sec - exiting 16:19:21 (5620): No heartbeat from core client for 30 sec - exiting Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5480, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Regional yearly means requires 12 input files got 4 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1516, selfPID=5496, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5188, selfPID=4232, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1264, selfPID=4628, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5084, iMonCtr=2 Model crash detected, will try to restart... 13:13:39 (5516): No heartbeat from core client for 30 sec - exiting 13:13:40 (5516): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5920, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Regional yearly means requires 12 input files got 5 CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2752, selfPID=3996, iMonCtr=1 Model crash detected, will try to restart... 13:28:02 (5956): No heartbeat from core client for 30 sec - exiting 13:28:03 (5956): No heartbeat from core client for 30 sec - exiting 13:28:04 (5956): No heartbeat from core client for 30 sec - exiting 13:28:05 (5956): No heartbeat from core client for 30 sec - exiting 13:28:06 (5956): No heartbeat from core client for 30 sec - exiting 13:28:07 (5956): No heartbeat from core client for 30 sec - exiting 13:28:08 (5956): No heartbeat from core client for 30 sec - exiting 13:28:09 (5956): No heartbeat from core client for 30 sec - exiting 13:28:10 (5956): No heartbeat from core client for 30 sec - exiting 13:28:11 (5956): No heartbeat from core client for 30 sec - exiting 13:28:12 (5956): No heartbeat from core client for 30 sec - exiting 13:28:13 (5956): No heartbeat from core client for 30 sec - exiting 13:28:14 (5956): No heartbeat from core client for 30 sec - exiting 13:28:15 (5956): No heartbeat from core client for 30 sec - exiting 13:28:16 (5956): No heartbeat from core client for 30 sec - exiting 13:28:17 (5956): No heartbeat from core client for 30 sec - exiting 13:28:19 (5956): No heartbeat from core client for 30 sec - exiting 13:28:20 (5956): No heartbeat from core client for 30 sec - exiting 13:28:21 (5956): No heartbeat from core client for 30 sec - exiting 13:28:22 (5956): No heartbeat from core client for 30 sec - exiting 13:28:23 (5956): No heartbeat from core client for 30 sec - exiting 13:28:24 (5956): No heartbeat from core client for 30 sec - exiting 13:28:25 (5956): No heartbeat from core client for 30 sec - exiting 13:28:26 (5956): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6100, selfPID=6072, iMonCtr=1 Model crash detected, will try to restart... C23:40:25 (5156): No heartbeat from core client for 30 sec - exiting 23:40:26 (5156): No heartbeat from core client for 30 sec - exiting 23:40:27 (5156): No heartbeat from core client for 30 sec - exiting 23:40:28 (5156): No heartbeat from core client for 30 sec - exiting 23:40:29 (5156): No heartbeat from core client for 30 sec - exiting 23:40:30 (5156): No heartbeat from core client for 30 sec - exiting 23:40:31 (5156): No heartbeat from core client for 30 sec - exiting 23:40:32 (5156): No heartbeat from core client for 30 sec - exiting 23:40:33 (5156): No heartbeat from core client for 30 sec - exiting 23:40:34 (5156): No heartbeat from core client for 30 sec - exiting 23:40:35 (5156): No heartbeat from core client for 30 sec - exiting 23:40:36 (5156): No heartbeat from core client for 30 sec - exiting 23:40:37 (5156): No heartbeat from core client for 30 sec - exiting 23:40:38 (5156): No heartbeat from core client for 30 sec - exiting 23:40:39 (5156): No heartbeat from core client for 30 sec - exiting 23:40:40 (5156): No heartbeat from core client for 30 sec - exiting 23:40:41 (5156): No heartbeat from core client for 30 sec - exiting 23:40:42 (5156): No heartbeat from core client for 30 sec - exiting 23:40:43 (5156): No heartbeat from core client for 30 sec - exiting 23:40:44 (5156): No heartbeat from core client for 30 sec - exiting 23:40:45 (5156): No heartbeat from core client for 30 sec - exiting 23:40:46 (5156): No heartbeat from core client for 30 sec - exiting 23:40:47 (5156): No heartbeat from core client for 30 sec - exiting 23:40:48 (5156): No heartbeat from core client for 30 sec - exiting 23:40:49 (5156): No heartbeat from core client for 30 sec - exiting 23:40:50 (5156): No heartbeat from core client for 30 sec - exiting 23:40:51 (5156): No heartbeat from core client for 30 sec - exiting 23:40:52 (5156): No heartbeat from core client for 30 sec - exiting 23:40:53 (5156): No heartbeat from core client for 30 sec - exiting 23:40:54 (5156): No heartbeat from core client for 30 sec - exiting 23:40:55 (5156): No heartbeat from core client for 30 sec - exiting 23:40:56 (5156): No heartbeat from core client for 30 sec - exiting 23:40:57 (5156): No heartbeat from core client for 30 sec - exiting 23:40:58 (5156): No heartbeat from core client for 30 sec - exiting 23:40:59 (5156): No heartbeat from core client for 30 sec - exiting 23:41:00 (5156): No heartbeat from core client for 30 sec - exiting 23:41:01 (5156): No heartbeat from core client for 30 sec - exiting 23:41:02 (5156): No heartbeat from core client for 30 sec - exiting 23:41:03 (5156): No heartbeat from core client for 30 sec - exiting 23:41:04 (5156): No heartbeat from core client for 30 sec - exiting 23:41:05 (5156): No heartbeat from core client for 30 sec - exiting 23:41:06 (5156): No heartbeat from core client for 30 sec - exiting 23:41:07 (5156): No heartbeat from core client for 30 sec - exiting 23:41:08 (5156): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4876, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Regional yearly means requires 12 input files got 6 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=160, selfPID=5616, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3604, selfPID=5148, iMonCtr=1 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5308, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5456, iMonCtr=2 Model crash detected, will try to restart... 01:23:13 (5612): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 01:23:14 (5612): No heartbeat from core client for 30 sec - exiting 01:23:15 (5612): No heartbeat from core client for 30 sec - exiting 01:23:16 (5612): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... GController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5088, selfPID=7764, iMonCtr=1 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5516, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5028, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... 00:39:42 (5016): called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
04 Jul 2011 05:56:36 | 989394 | 12655936 | hadam3p_pnw_2x6q_1999_1_007176506_1 | 138,240 | 688,526 | 4.9807 |
02 Jul 2011 05:43:54 | 989394 | 12655936 | hadam3p_pnw_2x6q_1999_1_007176506_1 | 126,720 | 629,334 | 4.9663 |
30 Jun 2011 21:18:29 | 989394 | 12655936 | hadam3p_pnw_2x6q_1999_1_007176506_1 | 115,200 | 570,604 | 4.9532 |
29 Jun 2011 22:05:25 | 989394 | 12655936 | hadam3p_pnw_2x6q_1999_1_007176506_1 | 103,680 | 512,569 | 4.9438 |
28 Jun 2011 16:03:37 | 989394 | 12655936 | hadam3p_pnw_2x6q_1999_1_007176506_1 | 92,160 | 455,441 | 4.9419 |
26 Jun 2011 06:43:49 | 989394 | 12655936 | hadam3p_pnw_2x6q_1999_1_007176506_1 | 80,640 | 397,597 | 4.9305 |
06 Jun 2011 01:21:41 | 989394 | 12655936 | hadam3p_pnw_2x6q_1999_1_007176506_1 | 69,120 | 339,151 | 4.9067 |
15 May 2011 22:36:58 | 989394 | 12655936 | hadam3p_pnw_2x6q_1999_1_007176506_1 | 57,600 | 282,207 | 4.8994 |
21 Apr 2011 03:28:04 | 989394 | 12655936 | hadam3p_pnw_2x6q_1999_1_007176506_1 | 46,089 | 224,148 | 4.8634 |
21 Apr 2011 03:28:04 | 989394 | 12655936 | hadam3p_pnw_2x6q_1999_1_007176506_1 | 46,080 | 223,437 | 4.8489 |
21 Apr 2011 03:28:04 | 989394 | 12655936 | hadam3p_pnw_2x6q_1999_1_007176506_1 | 34,633 | 167,536 | 4.8375 |
21 Apr 2011 03:28:04 | 989394 | 12655936 | hadam3p_pnw_2x6q_1999_1_007176506_1 | 34,560 | 166,280 | 4.8113 |
08 Apr 2011 20:55:53 | 989394 | 12655936 | hadam3p_pnw_2x6q_1999_1_007176506_1 | 23,136 | 111,861 | 4.8349 |
30 Mar 2011 00:28:03 | 989394 | 12655936 | hadam3p_pnw_2x6q_1999_1_007176506_1 | 11,616 | 56,932 | 4.9012 |
©2024 cpdn.org