Name | hadam3p_anz_m8g8_2012_1_009308590_0 |
Workunit | 9392778 |
Created | 17 Dec 2014, 20:03:52 UTC |
Sent | 21 Dec 2014, 0:34:18 UTC |
Report deadline | 3 Dec 2015, 5:54:18 UTC |
Received | 18 Jan 2015, 19:28:41 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 0 (0x00000000) |
Computer ID | 1218461 |
Run time | 4 days 3 hours 8 min 30 sec |
CPU time | 3 days 7 hours 48 min 1 sec |
Validate state | Invalid |
Credit | 1,503.36 |
Device peak FLOPS | 2.23 GFLOPS |
Application version | UK Met Office HadAM3P-HadRM3P Australia New Zealand v6.10 windows_intelx86 |
Stderr | <core_client_version>7.4.27</core_client_version> <![CDATA[ <stderr_txt> 18:01:46 (8260): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 18:01:58 (8260): No heartbeat from core client for 30 sec - exiting 18:01:59 (8260): No heartbeat from core client for 30 sec - exiting 18:02:00 (8260): No heartbeat from core client for 30 sec - exiting 18:02:01 (8260): No heartbeat from core client for 30 sec - exiting 18:02:02 (8260): No heartbeat from core client for 30 sec - exiting 18:02:03 (8260): No heartbeat from core client for 30 sec - exiting 18:02:04 (8260): No heartbeat from core client for 30 sec - exiting 18:02:05 (8260): No heartbeat from core client for 30 sec - exiting 18:02:06 (8260): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=14008, selfPID=14008, iMonCtr=2 02:27:24 (13448): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 02:27:28 (13448): No heartbeat from core client for 30 sec - exiting 02:27:30 (13448): No heartbeat from core client for 30 sec - exiting 02:27:31 (13448): No heartbeat from core client for 30 sec - exiting 02:27:32 (13448): No heartbeat from core client for 30 sec - exiting 02:27:33 (13448): No heartbeat from core client for 30 sec - exiting 02:27:34 (13448): No heartbeat from core client for 30 sec - exiting 02:27:35 (13448): No heartbeat from core client for 30 sec - exiting 02:27:36 (13448): No heartbeat from core client for 30 sec - exiting 02:27:37 (13448): No heartbeat from core client for 30 sec - exiting 02:27:38 (13448): No heartbeat from core client for 30 sec - exiting 02:27:40 (13448): No heartbeat from core client for 30 sec - exiting 02:27:41 (13448): No heartbeat from core client for 30 sec - exiting 02:27:42 (13448): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=11484, selfPID=11484, iMonCtr=2 CPDN Monitor - Quit request from BOINC... 11:16:27 (13512): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=11576, iMonCtr=2 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=15836, iMonCtr=2 13:15:56 (9816): No heartbeat from core client for 30 sec - exiting 13:15:57 (9816): No heartbeat from core client for 30 sec - exiting 13:15:58 (9816): No heartbeat from core client for 30 sec - exiting 13:15:59 (9816): No heartbeat from core client for 30 sec - exiting 13:16:00 (9816): No heartbeat from core client for 30 sec - exiting 13:16:01 (9816): No heartbeat from core client for 30 sec - exiting 13:16:02 (9816): No heartbeat from core client for 30 sec - exiting 13:16:03 (9816): No heartbeat from core client for 30 sec - exiting 13:16:05 (9816): No heartbeat from core client for 30 sec - exiting 13:16:06 (9816): No heartbeat from core client for 30 sec - exiting 13:16:07 (9816): No heartbeat from core client for 30 sec - exiting 13:16:08 (9816): No heartbeat from core client for 30 sec - exiting 13:16:09 (9816): No heartbeat from core client for 30 sec - exiting 13:16:10 (9816): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 06:49:56 (10676): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 06:49:58 (10676): No heartbeat from core client for 30 sec - exiting 06:49:59 (10676): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 18:16:47 (608): No heartbeat from core client for 30 sec - exiting 18:16:48 (608): No heartbeat from core client for 30 sec - exiting 18:16:49 (608): No heartbeat from core client for 30 sec - exiting 18:16:50 (608): No heartbeat from core client for 30 sec - exiting 18:16:51 (608): No heartbeat from core client for 30 sec - exiting 18:16:52 (608): No heartbeat from core client for 30 sec - exiting 18:16:53 (608): No heartbeat from core client for 30 sec - exiting 18:16:54 (608): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6980, selfPID=6980, iMonCtr=2 CPDN Monitor - Quit request from BOINC... 22:24:29 (4780): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 22:24:46 (4780): No heartbeat from core client for 30 sec - exiting 22:24:49 (4780): No heartbeat from core client for 30 sec - exiting 22:24:50 (4780): No heartbeat from core client for 30 sec - exiting 22:24:51 (4780): No heartbeat from core client for 30 sec - exiting 22:24:52 (4780): No heartbeat from core client for 30 sec - exiting 22:24:53 (4780): No heartbeat from core client for 30 sec - exiting 22:24:55 (4780): No heartbeat from core client for 30 sec - exiting 22:24:56 (4780): No heartbeat from core client for 30 sec - exiting 22:24:57 (4780): No heartbeat from core client for 30 sec - exiting 22:24:58 (4780): No heartbeat from core client for 30 sec - exiting 22:25:00 (4780): No heartbeat from core client for 30 sec - exiting 22:25:01 (4780): No heartbeat from core client for 30 sec - exiting 22:25:02 (4780): No heartbeat from core client for 30 sec - exiting 22:25:03 (4780): No heartbeat from core client for 30 sec - exiting 22:25:04 (4780): No heartbeat from core client for 30 sec - exiting 22:25:05 (4780): No heartbeat from core client for 30 sec - exiting 22:25:06 (4780): No heartbeat from core client for 30 sec - exiting 22:25:07 (4780): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 10:53:03 (14656): No heartbeat from core client for 30 sec - exiting 10:53:04 (14656): No heartbeat from core client for 30 sec - exiting 10:53:05 (14656): No heartbeat from core client for 30 sec - exiting 10:53:06 (14656): No heartbeat from core client for 30 sec - exiting 10:53:07 (14656): No heartbeat from core client for 30 sec - exiting 10:53:08 (14656): No heartbeat from core client for 30 sec - exiting 10:53:09 (14656): No heartbeat from core client for 30 sec - exiting 10:53:10 (14656): No heartbeat from core client for 30 sec - exiting 10:53:11 (14656): No heartbeat from core client for 30 sec - exiting 10:53:12 (14656): No heartbeat from core client for 30 sec - exiting 10:53:13 (14656): No heartbeat from core client for 30 sec - exiting 10:53:15 (14656): No heartbeat from core client for 30 sec - exiting 10:53:16 (14656): No heartbeat from core client for 30 sec - exiting 10:53:17 (14656): No heartbeat from core client for 30 sec - exiting 10:53:18 (14656): No heartbeat from core client for 30 sec - exiting 10:53:19 (14656): No heartbeat from core client for 30 sec - exiting 10:53:20 (14656): No heartbeat from core client for 30 sec - exiting 10:53:21 (14656): No heartbeat from core client for 30 sec - exiting 10:53:22 (14656): No heartbeat from core client for 30 sec - exiting 10:53:23 (14656): No heartbeat from core client for 30 sec - exiting 10:53:24 (14656): No heartbeat from core client for 30 sec - exiting 10:53:25 (14656): No heartbeat from core client for 30 sec - exiting 10:53:26 (14656): No heartbeat from core client for 30 sec - exiting 10:53:27 (14656): No heartbeat from core client for 30 sec - exiting 10:53:28 (14656): No heartbeat from core client for 30 sec - exiting 10:53:29 (14656): No heartbeat from core client for 30 sec - exiting 10:53:30 (14656): No heartbeat from core client for 30 sec - exiting 10:53:31 (14656): No heartbeat from core client for 30 sec - exiting 10:53:32 (14656): No heartbeat from core client for 30 sec - exiting 10:53:33 (14656): No heartbeat from core client for 30 sec - exiting 10:53:34 (14656): No heartbeat from core client for 30 sec - exiting 10:53:35 (14656): No heartbeat from core client for 30 sec - exiting 10:53:36 (14656): No heartbeat from core client for 30 sec - exiting 10:53:37 (14656): No heartbeat from core client for 30 sec - exiting 10:53:38 (14656): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 21:34:16 (18416): No heartbeat from core client for 30 sec - exiting 21:34:18 (18416): No heartbeat from core client for 30 sec - exiting 21:34:52 (18416): No heartbeat from core client for 30 sec - exiting 21:34:53 (18416): No heartbeat from core client for 30 sec - exiting 21:34:54 (18416): No heartbeat from core client for 30 sec - exiting 21:34:55 (18416): No heartbeat from core client for 30 sec - exiting 21:34:56 (18416): No heartbeat from core client for 30 sec - exiting 21:34:57 (18416): No heartbeat from core client for 30 sec - exiting 21:34:58 (18416): No heartbeat from core client for 30 sec - exiting 21:34:59 (18416): No heartbeat from core client for 30 sec - exiting 21:35:00 (18416): No heartbeat from core client for 30 sec - exiting 21:35:01 (18416): No heartbeat from core client for 30 sec - exiting 21:35:03 (18416): No heartbeat from core client for 30 sec - exiting 21:35:04 (18416): No heartbeat from core client for 30 sec - exiting 21:35:05 (18416): No heartbeat from core client for 30 sec - exiting 21:35:06 (18416): No heartbeat from core client for 30 sec - exiting 21:35:07 (18416): No heartbeat from core client for 30 sec - exiting 21:35:08 (18416): No heartbeat from core client for 30 sec - exiting 21:35:09 (18416): No heartbeat from core client for 30 sec - exiting 21:35:10 (18416): No heartbeat from core client for 30 sec - exiting 21:35:11 (18416): No heartbeat from core client for 30 sec - exiting 21:35:13 (18416): No heartbeat from core client for 30 sec - exiting 21:35:14 (18416): No heartbeat from core client for 30 sec - exiting 21:35:15 (18416): No heartbeat from core client for 30 sec - exiting 21:35:16 (18416): No heartbeat from core client for 30 sec - exiting 21:35:17 (18416): No heartbeat from core client for 30 sec - exiting 21:35:18 (18416): No heartbeat from core client for 30 sec - exiting 21:35:19 (18416): No heartbeat from core client for 30 sec - exiting 21:35:20 (18416): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:35:21 (18416): No heartbeat from core client for 30 sec - exiting 21:35:22 (18416): No heartbeat from core client for 30 sec - exiting 21:35:23 (18416): No heartbeat from core client for 30 sec - exiting 21:35:25 (18416): No heartbeat from core client for 30 sec - exiting 21:35:26 (18416): No heartbeat from core client for 30 sec - exiting 21:35:27 (18416): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... 00:50:10 (6932): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 16:39:32 (19488): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 16:39:34 (19488): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 01:04:50 (17420): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 01:05:00 (17420): No heartbeat from core client for 30 sec - exiting 01:05:01 (17420): No heartbeat from core client for 30 sec - exiting 01:05:02 (17420): No heartbeat from core client for 30 sec - exiting 01:05:03 (17420): No heartbeat from core client for 30 sec - exiting 01:05:04 (17420): No heartbeat from core client for 30 sec - exiting 01:05:05 (17420): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 10:39:53 (18800): No heartbeat from core client for 30 sec - exiting 10:39:55 (18800): No heartbeat from core client for 30 sec - exiting 10:39:56 (18800): No heartbeat from core client for 30 sec - exiting 10:39:57 (18800): No heartbeat from core client for 30 sec - exiting 10:39:58 (18800): No heartbeat from core client for 30 sec - exiting 10:39:59 (18800): No heartbeat from core client for 30 sec - exiting 10:40:00 (18800): No heartbeat from core client for 30 sec - exiting 10:40:01 (18800): No heartbeat from core client for 30 sec - exiting 10:40:02 (18800): No heartbeat from core client for 30 sec - exiting 10:40:03 (18800): No heartbeat from core client for 30 sec - exiting 10:40:04 (18800): No heartbeat from core client for 30 sec - exiting 10:40:05 (18800): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... GController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=21704, selfPID=12064, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5380, selfPID=6052, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 00:04:50 (8960): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 23:40:09 (10500): No heartbeat from core client for 30 sec - exiting 23:40:11 (10500): No heartbeat from core client for 30 sec - exiting 23:40:12 (10500): No heartbeat from core client for 30 sec - exiting 23:40:13 (10500): No heartbeat from core client for 30 sec - exiting 23:40:14 (10500): No heartbeat from core client for 30 sec - exiting 23:40:15 (10500): No heartbeat from core client for 30 sec - exiting 23:40:16 (10500): No heartbeat from core client for 30 sec - exiting 23:40:17 (10500): No heartbeat from core client for 30 sec - exiting 23:40:18 (10500): No heartbeat from core client for 30 sec - exiting 23:40:19 (10500): No heartbeat from core client for 30 sec - exiting 23:40:20 (10500): No heartbeat from core client for 30 sec - exiting 23:40:22 (10500): No heartbeat from core client for 30 sec - exiting 23:40:23 (10500): No heartbeat from core client for 30 sec - exiting 23:40:24 (10500): No heartbeat from core client for 30 sec - exiting 23:40:25 (10500): No heartbeat from core client for 30 sec - exiting 23:40:26 (10500): No heartbeat from core client for 30 sec - exiting 23:40:27 (10500): No heartbeat from core client for 30 sec - exiting 23:40:28 (10500): No heartbeat from core client for 30 sec - exiting 23:40:29 (10500): No heartbeat from core client for 30 sec - exiting 23:40:30 (10500): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3076, selfPID=3076, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=8552, selfPID=8552, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=10636, iMonCtr=1 Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1304, selfPID=1304, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1304, selfPID=9684, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Called boinc_finish </stderr_txt> <message> upload failure: <file_xfer_error> <file_name>hadam3p_anz_m8g8_2012_1_009308590_0_4.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_anz_m8g8_2012_1_009308590_0_5.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_anz_m8g8_2012_1_009308590_0_6.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_anz_m8g8_2012_1_009308590_0_7.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_anz_m8g8_2012_1_009308590_0_8.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_anz_m8g8_2012_1_009308590_0_9.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_anz_m8g8_2012_1_009308590_0_10.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_anz_m8g8_2012_1_009308590_0_11.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_anz_m8g8_2012_1_009308590_0_12.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> </message> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
18 Jan 2015 16:59:24 | 1218461 | 17594860 | hadam3p_anz_m8g8_2012_1_009308590_0 | 34,859 | 287,260 | 8.2406 |
08 Jan 2015 16:21:53 | 1218461 | 17594860 | hadam3p_anz_m8g8_2012_1_009308590_0 | 23,339 | 193,414 | 8.2872 |
31 Dec 2014 20:28:29 | 1218461 | 17594860 | hadam3p_anz_m8g8_2012_1_009308590_0 | 11,819 | 98,667 | 8.3482 |
©2024 cpdn.org