Name | hadam3p_anz_rbaf_2012_1_008744245_0 |
Workunit | 8890223 |
Created | 8 May 2014, 20:02:37 UTC |
Sent | 9 May 2014, 7:27:59 UTC |
Report deadline | 21 Apr 2015, 12:47:59 UTC |
Received | 12 May 2014, 17:10:34 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 0 (0x00000000) |
Computer ID | 1319781 |
Run time | 2 days 15 hours 41 min 12 sec |
CPU time | 2 days 14 hours 35 min |
Validate state | Invalid |
Credit | 1,503.36 |
Device peak FLOPS | 2.67 GFLOPS |
Application version | UK Met Office HadAM3P-HadRM3P Australia New Zealand v6.10 windows_intelx86 |
Stderr | <core_client_version>7.0.28</core_client_version> <![CDATA[ <stderr_txt> Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 02:58:04 (988): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:00:25 (724): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:37:04 (5796): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 04:55:27 (6324): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 04:56:11 (8172): No heartbeat from core client for 30 sec - exiting 04:56:12 (8172): No heartbeat from core client for 30 sec - exiting 04:56:13 (8172): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 09:15:09 (3388): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 09:52:27 (2412): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 09:53:17 (4084): No heartbeat from core client for 30 sec - exiting 09:53:18 (4084): No heartbeat from core client for 30 sec - exiting 09:53:19 (4084): No heartbeat from core client for 30 sec - exiting 09:53:20 (4084): No heartbeat from core client for 30 sec - exiting 09:53:21 (4084): No heartbeat from core client for 30 sec - exiting 09:53:22 (4084): No heartbeat from core client for 30 sec - exiting 09:53:23 (4084): No heartbeat from core client for 30 sec - exiting 09:53:24 (4084): No heartbeat from core client for 30 sec - exiting 09:53:25 (4084): No heartbeat from core client for 30 sec - exiting 09:53:26 (4084): No heartbeat from core client for 30 sec - exiting 09:53:27 (4084): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 11:07:33 (4024): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 11:09:06 (5144): No heartbeat from core client for 30 sec - exiting 11:09:07 (5144): No heartbeat from core client for 30 sec - exiting 11:09:08 (5144): No heartbeat from core client for 30 sec - exiting 11:09:09 (5144): No heartbeat from core client for 30 sec - exiting 11:09:10 (5144): No heartbeat from core client for 30 sec - exiting 11:09:11 (5144): No heartbeat from core client for 30 sec - exiting 11:09:12 (5144): No heartbeat from core client for 30 sec - exiting 11:09:13 (5144): No heartbeat from core client for 30 sec - exiting 11:09:14 (5144): No heartbeat from core client for 30 sec - exiting 11:09:15 (5144): No heartbeat from core client for 30 sec - exiting 11:09:16 (5144): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 11:42:54 (3600): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 11:43:41 (7160): No heartbeat from core client for 30 sec - exiting 11:43:42 (7160): No heartbeat from core client for 30 sec - exiting 11:43:43 (7160): No heartbeat from core client for 30 sec - exiting 11:43:44 (7160): No heartbeat from core client for 30 sec - exiting 11:43:45 (7160): No heartbeat from core client for 30 sec - exiting 11:43:46 (7160): No heartbeat from core client for 30 sec - exiting 11:43:47 (7160): No heartbeat from core client for 30 sec - exiting 11:43:48 (7160): No heartbeat from core client for 30 sec - exiting 11:43:49 (7160): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 15:19:32 (3308): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 15:20:31 (7156): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 14:19:11 (5704): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 14:19:12 (5704): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - Suspend request from BOINC... 06:33:32 (9040): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 06:34:37 (124): No heartbeat from core client for 30 sec - exiting 06:34:38 (124): No heartbeat from core client for 30 sec - exiting 06:34:39 (124): No heartbeat from core client for 30 sec - exiting 06:34:40 (124): No heartbeat from core client for 30 sec - exiting 06:34:41 (124): No heartbeat from core client for 30 sec - exiting 06:34:42 (124): No heartbeat from core client for 30 sec - exiting 06:34:43 (124): No heartbeat from core client for 30 sec - exiting 06:34:44 (124): No heartbeat from core client for 30 sec - exiting 06:34:45 (124): No heartbeat from core client for 30 sec - exiting 06:34:46 (124): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 13:13:05 (6584): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 14:40:44 (8544): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 14:40:47 (8544): No heartbeat from core client for 30 sec - exiting 14:46:16 (6992): No heartbeat from core client for 30 sec - exiting 14:46:17 (6992): No heartbeat from core client for 30 sec - exiting 14:46:18 (6992): No heartbeat from core client for 30 sec - exiting 14:46:19 (6992): No heartbeat from core client for 30 sec - exiting 14:46:20 (6992): No heartbeat from core client for 30 sec - exiting 14:46:21 (6992): No heartbeat from core client for 30 sec - exiting 14:46:22 (6992): No heartbeat from core client for 30 sec - exiting 14:46:23 (6992): No heartbeat from core client for 30 sec - exiting 14:46:24 (6992): No heartbeat from core client for 30 sec - exiting 14:46:25 (6992): No heartbeat from core client for 30 sec - exiting 14:46:26 (6992): No heartbeat from core client for 30 sec - exiting 14:46:27 (6992): No heartbeat from core client for 30 sec - exiting 14:46:28 (6992): No heartbeat from core client for 30 sec - exiting 14:46:29 (6992): No heartbeat from core client for 30 sec - exiting 14:46:30 (6992): No heartbeat from core client for 30 sec - exiting 14:46:31 (6992): No heartbeat from core client for 30 sec - exiting 14:46:32 (6992): No heartbeat from core client for 30 sec - exiting 14:46:33 (6992): No heartbeat from core client for 30 sec - exiting 14:46:34 (6992): No heartbeat from core client for 30 sec - exiting 14:46:35 (6992): No heartbeat from core client for 30 sec - exiting 14:46:36 (6992): No heartbeat from core client for 30 sec - exiting 14:46:37 (6992): No heartbeat from core client for 30 sec - exiting 14:46:38 (6992): No heartbeat from core client for 30 sec - exiting 14:46:39 (6992): No heartbeat from core client for 30 sec - exiting 14:46:40 (6992): No heartbeat from core client for 30 sec - exiting 14:46:41 (6992): No heartbeat from core client for 30 sec - exiting 14:46:42 (6992): No heartbeat from core client for 30 sec - exiting 14:46:44 (6992): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:51:31 (7660): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 02:51:56 (5576): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 02:51:58 (5576): No heartbeat from core client for 30 sec - exiting 02:55:40 (7896): No heartbeat from core client for 30 sec - exiting 02:55:41 (7896): No heartbeat from core client for 30 sec - exiting 02:55:42 (7896): No heartbeat from core client for 30 sec - exiting 02:55:43 (7896): No heartbeat from core client for 30 sec - exiting 02:55:44 (7896): No heartbeat from core client for 30 sec - exiting 02:55:45 (7896): No heartbeat from core client for 30 sec - exiting 02:55:46 (7896): No heartbeat from core client for 30 sec - exiting 02:55:47 (7896): No heartbeat from core client for 30 sec - exiting 02:55:48 (7896): No heartbeat from core client for 30 sec - exiting 02:55:49 (7896): No heartbeat from core client for 30 sec - exiting 02:55:50 (7896): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=7604, iMonCtr=1 Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=0, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=9072, selfPID=6396, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Called boinc_finish </stderr_txt> <message> upload failure: <file_xfer_error> <file_name>hadam3p_anz_rbaf_2012_1_008744245_0_4.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_anz_rbaf_2012_1_008744245_0_5.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_anz_rbaf_2012_1_008744245_0_6.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_anz_rbaf_2012_1_008744245_0_7.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_anz_rbaf_2012_1_008744245_0_8.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_anz_rbaf_2012_1_008744245_0_9.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_anz_rbaf_2012_1_008744245_0_10.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_anz_rbaf_2012_1_008744245_0_11.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_anz_rbaf_2012_1_008744245_0_12.zip</file_name> <error_code>-161</error_code> </file_xfer_error> </message> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
12 May 2014 07:47:36 | 1319781 | 16627816 | hadam3p_anz_rbaf_2012_1_008744245_0 | 34,859 | 225,292 | 6.4630 |
11 May 2014 07:06:16 | 1319781 | 16627816 | hadam3p_anz_rbaf_2012_1_008744245_0 | 23,339 | 150,058 | 6.4295 |
10 May 2014 09:35:13 | 1319781 | 16627816 | hadam3p_anz_rbaf_2012_1_008744245_0 | 11,819 | 75,754 | 6.4095 |
©2024 climateprediction.net