Name | hadam3p_pnw_2zls_1965_1_008141381_0 |
Workunit | 8296495 |
Created | 14 Aug 2012, 0:36:19 UTC |
Sent | 26 Aug 2012, 16:13:15 UTC |
Report deadline | 8 Aug 2013, 21:33:15 UTC |
Received | 5 Sep 2012, 18:02:46 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 0 (0x00000000) |
Computer ID | 1215861 |
Run time | 2 days 7 hours 36 min |
CPU time | 2 days 3 hours 49 min 55 sec |
Validate state | Invalid |
Credit | 2,755.56 |
Device peak FLOPS | 3.39 GFLOPS |
Application version | UK Met Office HadAM3P-HadRM3P Pacific North West v6.09 windows_intelx86 |
Stderr | <core_client_version>7.0.25</core_client_version> <![CDATA[ <stderr_txt> CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... No Process Handle Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4940, selfPID=4940, iMonCtr=2 Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7084, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5320, selfPID=5320, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3328, selfPID=3328, iMonCtr=2 18:26:26 (2664): No heartbeat from core client for 30 sec - exiting 18:26:27 (2664): No heartbeat from core client for 30 sec - exiting 18:26:28 (2664): No heartbeat from core client for 30 sec - exiting 18:26:29 (2664): No heartbeat from core client for 30 sec - exiting 18:26:30 (2664): No heartbeat from core client for 30 sec - exiting 18:26:31 (2664): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 20:25:35 (3352): Can't acquire lockfile (32) - waiting 35s 20:25:57 (6136): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 09:40:07 (5872): No heartbeat from core client for 30 sec - exiting 09:40:08 (5872): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 09:40:09 (5872): No heartbeat from core client for 30 sec - exiting 19:29:18 (5104): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5128, selfPID=4520, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3448, iMonCtr=2 19:18:18 (4044): No heartbeat from core client for 30 sec - exiting 19:18:19 (4044): No heartbeat from core client for 30 sec - exiting 19:18:20 (4044): No heartbeat from core client for 30 sec - exiting 19:18:21 (4044): No heartbeat from core client for 30 sec - exiting 19:18:22 (4044): No heartbeat from core client for 30 sec - exiting 19:18:23 (4044): No heartbeat from core client for 30 sec - exiting 19:18:24 (4044): No heartbeat from core client for 30 sec - exiting 19:18:25 (4044): No heartbeat from core client for 30 sec - exiting 19:18:27 (4044): No heartbeat from core client for 30 sec - exiting 19:18:28 (4044): No heartbeat from core client for 30 sec - exiting 19:18:29 (4044): No heartbeat from core client for 30 sec - exiting 19:18:30 (4044): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 09:40:42 (3136): No heartbeat from core client for 30 sec - exiting 09:40:43 (3136): No heartbeat from core client for 30 sec - exiting 09:40:44 (3136): No heartbeat from core client for 30 sec - exiting 09:40:45 (3136): No heartbeat from core client for 30 sec - exiting 09:40:46 (3136): No heartbeat from core client for 30 sec - exiting 09:40:47 (3136): No heartbeat from core client for 30 sec - exiting 09:40:48 (3136): No heartbeat from core client for 30 sec - exiting 09:40:50 (3136): No heartbeat from core client for 30 sec - exiting 09:40:51 (3136): No heartbeat from core client for 30 sec - exiting 09:40:52 (3136): No heartbeat from core client for 30 sec - exiting 09:40:53 (3136): No heartbeat from core client for 30 sec - exiting 09:40:54 (3136): No heartbeat from core client for 30 sec - exiting 09:40:55 (3136): No heartbeat from core client for 30 sec - exiting 09:40:56 (3136): No heartbeat from core client for 30 sec - exiting 09:40:57 (3136): No heartbeat from core client for 30 sec - exiting 09:40:58 (3136): No heartbeat from core client for 30 sec - exiting 09:40:59 (3136): No heartbeat from core client for 30 sec - exiting 09:41:00 (3136): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4544, selfPID=4544, iMonCtr=2 11:35:00 (5180): No heartbeat from core client for 30 sec - exiting 11:35:01 (5180): No heartbeat from core client for 30 sec - exiting 11:35:02 (5180): No heartbeat from core client for 30 sec - exiting 11:35:03 (5180): No heartbeat from core client for 30 sec - exiting 11:35:04 (5180): No heartbeat from core client for 30 sec - exiting 11:35:06 (5180): No heartbeat from core client for 30 sec - exiting 11:35:07 (5180): No heartbeat from core client for 30 sec - exiting 11:35:08 (5180): No heartbeat from core client for 30 sec - exiting 11:35:09 (5180): No heartbeat from core client for 30 sec - exiting 11:35:10 (5180): No heartbeat from core client for 30 sec - exiting 11:35:11 (5180): No heartbeat from core client for 30 sec - exiting 11:35:12 (5180): No heartbeat from core client for 30 sec - exiting 11:35:13 (5180): No heartbeat from core client for 30 sec - exiting 11:35:14 (5180): No heartbeat from core client for 30 sec - exiting 11:35:15 (5180): No heartbeat from core client for 30 sec - exiting 11:35:16 (5180): No heartbeat from core client for 30 sec - exiting 11:35:18 (5180): No heartbeat from core client for 30 sec - exiting 11:35:19 (5180): No heartbeat from core client for 30 sec - exiting 11:35:20 (5180): No heartbeat from core client for 30 sec - exiting 11:35:21 (5180): No heartbeat from core client for 30 sec - exiting 11:35:22 (5180): No heartbeat from core client for 30 sec - exiting 11:35:23 (5180): No heartbeat from core client for 30 sec - exiting 11:35:24 (5180): No heartbeat from core client for 30 sec - exiting 11:35:25 (5180): No heartbeat from core client for 30 sec - exiting 11:35:26 (5180): No heartbeat from core client for 30 sec - exiting 11:35:27 (5180): No heartbeat from core client for 30 sec - exiting 11:35:28 (5180): No heartbeat from core client for 30 sec - exiting 11:35:30 (5180): No heartbeat from core client for 30 sec - exiting 11:35:31 (5180): No heartbeat from core client for 30 sec - exiting 11:35:32 (5180): No heartbeat from core client for 30 sec - exiting 11:35:33 (5180): No heartbeat from core client for 30 sec - exiting 11:35:34 (5180): No heartbeat from core client for 30 sec - exiting 11:35:35 (5180): No heartbeat from core client for 30 sec - exiting 11:35:36 (5180): No heartbeat from core client for 30 sec - exiting 11:35:37 (5180): No heartbeat from core client for 30 sec - exiting 11:35:38 (5180): No heartbeat from core client for 30 sec - exiting 11:35:39 (5180): No heartbeat from core client for 30 sec - exiting 11:35:40 (5180): No heartbeat from core client for 30 sec - exiting 11:35:42 (5180): No heartbeat from core client for 30 sec - exiting 11:35:43 (5180): No heartbeat from core client for 30 sec - exiting 11:35:44 (5180): No heartbeat from core client for 30 sec - exiting 11:35:45 (5180): No heartbeat from core client for 30 sec - exiting 11:35:46 (5180): No heartbeat from core client for 30 sec - exiting 11:35:47 (5180): No heartbeat from core client for 30 sec - exiting 11:35:48 (5180): No heartbeat from core client for 30 sec - exiting 11:35:49 (5180): No heartbeat from core client for 30 sec - exiting 11:35:50 (5180): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3328, selfPID=3328, iMonCtr=2 09:14:05 (5768): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 12:41:32 (4804): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5308, selfPID=5308, iMonCtr=2 09:13:44 (6952): No heartbeat from core client for 30 sec - exiting 09:13:45 (6952): No heartbeat from core client for 30 sec - exiting 09:13:46 (6952): No heartbeat from core client for 30 sec - exiting 09:13:47 (6952): No heartbeat from core client for 30 sec - exiting 09:13:49 (6952): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 16:11:33 (7404): No heartbeat from core client for 30 sec - exiting 16:11:34 (7404): No heartbeat from core client for 30 sec - exiting 16:11:35 (7404): No heartbeat from core client for 30 sec - exiting 16:11:36 (7404): No heartbeat from core client for 30 sec - exiting 16:11:37 (7404): No heartbeat from core client for 30 sec - exiting 16:11:38 (7404): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 10:38:27 (4812): No heartbeat from core client for 30 sec - exiting 10:38:28 (4812): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 08:52:04 (5008): No heartbeat from core client for 30 sec - exiting 08:52:05 (5008): No heartbeat from core client for 30 sec - exiting 08:52:06 (5008): No heartbeat from core client for 30 sec - exiting 08:52:08 (5008): No heartbeat from core client for 30 sec - exiting 08:52:09 (5008): No heartbeat from core client for 30 sec - exiting 08:52:10 (5008): No heartbeat from core client for 30 sec - exiting 08:52:11 (5008): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 08:52:38 (5880): Can't acquire lockfile (32) - waiting 35s CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5116, selfPID=5116, iMonCtr=2 CPDN Monitor - Quit request from BOINC... Model crashed: READHIST: End of file in READ from history file for namelist NLIHISTO tmp/xaakm.pipe_dummy 2048 Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1576, selfPID=1576, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1576, selfPID=3340, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Regional yearly means requires 12 input files got 11 Called boinc_finish </stderr_txt> <message> upload failure: <file_xfer_error> <file_name>hadam3p_pnw_2zls_1965_1_008141381_0_12.zip</file_name> <error_code>-161</error_code> </file_xfer_error> </message> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
05 Sep 2012 09:10:35 | 1215861 | 15113485 | hadam3p_pnw_2zls_1965_1_008141381_0 | 126,816 | 175,527 | 1.3841 |
03 Sep 2012 19:40:51 | 1215861 | 15113485 | hadam3p_pnw_2zls_1965_1_008141381_0 | 115,296 | 159,806 | 1.3860 |
03 Sep 2012 15:22:40 | 1215861 | 15113485 | hadam3p_pnw_2zls_1965_1_008141381_0 | 103,776 | 143,870 | 1.3864 |
02 Sep 2012 17:11:15 | 1215861 | 15113485 | hadam3p_pnw_2zls_1965_1_008141381_0 | 92,256 | 128,168 | 1.3893 |
02 Sep 2012 12:36:30 | 1215861 | 15113485 | hadam3p_pnw_2zls_1965_1_008141381_0 | 80,736 | 112,451 | 1.3928 |
01 Sep 2012 12:06:06 | 1215861 | 15113485 | hadam3p_pnw_2zls_1965_1_008141381_0 | 69,216 | 96,646 | 1.3963 |
01 Sep 2012 07:45:14 | 1215861 | 15113485 | hadam3p_pnw_2zls_1965_1_008141381_0 | 57,696 | 80,939 | 1.4029 |
30 Aug 2012 19:25:49 | 1215861 | 15113485 | hadam3p_pnw_2zls_1965_1_008141381_0 | 46,176 | 64,553 | 1.3980 |
29 Aug 2012 18:29:50 | 1215861 | 15113485 | hadam3p_pnw_2zls_1965_1_008141381_0 | 34,656 | 48,365 | 1.3956 |
28 Aug 2012 16:34:09 | 1215861 | 15113485 | hadam3p_pnw_2zls_1965_1_008141381_0 | 23,136 | 32,407 | 1.4007 |
27 Aug 2012 17:29:08 | 1215861 | 15113485 | hadam3p_pnw_2zls_1965_1_008141381_0 | 11,616 | 16,426 | 1.4141 |
©2024 cpdn.org