Name | hadam3p_eu_wy7x_2004_1_006906901_0 |
Workunit | 7110217 |
Created | 20 Nov 2010, 20:46:34 UTC |
Sent | 14 Feb 2011, 21:42:25 UTC |
Report deadline | 28 Jan 2012, 3:02:25 UTC |
Received | 24 Feb 2011, 12:59:12 UTC |
Server state | Over |
Outcome | Didn't need |
Client state | Compute error |
Exit status | 0 (0x00000000) |
Computer ID | 1118380 |
Run time | 4 days 0 hours 15 min 50 sec |
CPU time | 3 days 4 hours 53 min 27 sec |
Validate state | Invalid |
Credit | 1,194.02 |
Device peak FLOPS | 2.15 GFLOPS |
Application version | UK Met Office HadAM3P-HadRM3P Europe v6.08 windows_intelx86 |
Stderr | <core_client_version>6.10.58</core_client_version> <![CDATA[ <stderr_txt> 06:23:20 (4252): No heartbeat from core client for 30 sec - exiting 06:23:21 (4252): No heartbeat from core client for 30 sec - exiting 06:23:22 (4252): No heartbeat from core client for 30 sec - exiting 06:23:23 (4252): No heartbeat from core client for 30 sec - exiting 06:23:24 (4252): No heartbeat from core client for 30 sec - exiting 06:23:25 (4252): No heartbeat from core client for 30 sec - exiting 06:23:26 (4252): No heartbeat from core client for 30 sec - exiting 06:23:27 (4252): No heartbeat from core client for 30 sec - exiting 06:23:28 (4252): No heartbeat from core client for 30 sec - exiting 06:23:29 (4252): No heartbeat from core client for 30 sec - exiting 06:23:30 (4252): No heartbeat from core client for 30 sec - exiting 06:23:31 (4252): No heartbeat from core client for 30 sec - exiting 06:23:32 (4252): No heartbeat from core client for 30 sec - exiting 06:23:33 (4252): No heartbeat from core client for 30 sec - exiting 06:23:34 (4252): No heartbeat from core client for 30 sec - exiting 06:23:35 (4252): No heartbeat from core client for 30 sec - exiting 06:23:36 (4252): No heartbeat from core client for 30 sec - exiting 06:23:37 (4252): No heartbeat from core client for 30 sec - exiting 06:23:38 (4252): No heartbeat from core client for 30 sec - exiting 06:23:39 (4252): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4804, selfPID=5708, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... 04:57:56 (212): No heartbeat from core client for 30 sec - exiting 04:57:57 (212): No heartbeat from core client for 30 sec - exiting 04:57:58 (212): No heartbeat from core client for 30 sec - exiting 04:57:59 (212): No heartbeat from core client for 30 sec - exiting 04:58:00 (212): No heartbeat from core client for 30 sec - exiting 04:58:01 (212): No heartbeat from core client for 30 sec - exiting 04:58:02 (212): No heartbeat from core client for 30 sec - exiting 04:58:03 (212): No heartbeat from core client for 30 sec - exiting 04:58:04 (212): No heartbeat from core client for 30 sec - exiting 04:58:05 (212): No heartbeat from core client for 30 sec - exiting 04:58:06 (212): No heartbeat from core client for 30 sec - exiting 04:58:07 (212): No heartbeat from core client for 30 sec - exiting 04:58:08 (212): No heartbeat from core client for 30 sec - exiting 04:58:09 (212): No heartbeat from core client for 30 sec - exiting 04:58:10 (212): No heartbeat from core client for 30 sec - exiting 04:58:11 (212): No heartbeat from core client for 30 sec - exiting 04:58:12 (212): No heartbeat from core client for 30 sec - exiting 04:58:13 (212): No heartbeat from core client for 30 sec - exiting 04:58:14 (212): No heartbeat from core client for 30 sec - exiting 04:58:15 (212): No heartbeat from core client for 30 sec - exiting 04:58:16 (212): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1296, selfPID=1296, iMonCtr=2 CPDN Monitor - Quit request from BOINC... 05:22:45 (1672): No heartbeat from core client for 30 sec - exiting 05:22:46 (1672): No heartbeat from core client for 30 sec - exiting 05:22:47 (1672): No heartbeat from core client for 30 sec - exiting 05:22:48 (1672): No heartbeat from core client for 30 sec - exiting 05:22:49 (1672): No heartbeat from core client for 30 sec - exiting 05:22:50 (1672): No heartbeat from core client for 30 sec - exiting 05:22:51 (1672): No heartbeat from core client for 30 sec - exiting 05:22:52 (1672): No heartbeat from core client for 30 sec - exiting 05:22:53 (1672): No heartbeat from core client for 30 sec - exiting 05:22:54 (1672): No heartbeat from core client for 30 sec - exiting 05:22:55 (1672): No heartbeat from core client for 30 sec - exiting 05:22:56 (1672): No heartbeat from core client for 30 sec - exiting 05:22:57 (1672): No heartbeat from core client for 30 sec - exiting 05:22:58 (1672): No heartbeat from core client for 30 sec - exiting 05:22:59 (1672): No heartbeat from core client for 30 sec - exiting 05:23:00 (1672): No heartbeat from core client for 30 sec - exiting 05:23:01 (1672): No heartbeat from core client for 30 sec - exiting 05:23:02 (1672): No heartbeat from core client for 30 sec - exiting 05:23:03 (1672): No heartbeat from core client for 30 sec - exiting 05:23:04 (1672): No heartbeat from core client for 30 sec - exiting 05:23:05 (1672): No heartbeat from core client for 30 sec - exiting 05:23:06 (1672): No heartbeat from core client for 30 sec - exiting 05:23:07 (1672): No heartbeat from core client for 30 sec - exiting 05:23:08 (1672): No heartbeat from core client for 30 sec - exiting 05:23:09 (1672): No heartbeat from core client for 30 sec - exiting 05:23:10 (1672): No heartbeat from core client for 30 sec - exiting 05:23:11 (1672): No heartbeat from core client for 30 sec - exiting 05:23:12 (1672): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1640, selfPID=1640, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 06:39:51 (3480): No heartbeat from core client for 30 sec - exiting 06:39:52 (3480): No heartbeat from core client for 30 sec - exiting 06:39:53 (3480): No heartbeat from core client for 30 sec - exiting 06:39:54 (3480): No heartbeat from core client for 30 sec - exiting 06:39:55 (3480): No heartbeat from core client for 30 sec - exiting 06:39:56 (3480): No heartbeat from core client for 30 sec - exiting 06:39:57 (3480): No heartbeat from core client for 30 sec - exiting 06:39:58 (3480): No heartbeat from core client for 30 sec - exiting 06:39:59 (3480): No heartbeat from core client for 30 sec - exiting 06:40:00 (3480): No heartbeat from core client for 30 sec - exiting 06:40:01 (3480): No heartbeat from core client for 30 sec - exiting 06:40:02 (3480): No heartbeat from core client for 30 sec - exiting 06:40:03 (3480): No heartbeat from core client for 30 sec - exiting 06:40:04 (3480): No heartbeat from core client for 30 sec - exiting 06:40:05 (3480): No heartbeat from core client for 30 sec - exiting 06:40:06 (3480): No heartbeat from core client for 30 sec - exiting 06:40:07 (3480): No heartbeat from core client for 30 sec - exiting 06:40:08 (3480): No heartbeat from core client for 30 sec - exiting 06:40:09 (3480): No heartbeat from core client for 30 sec - exiting 06:40:10 (3480): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5740, selfPID=5380, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... 06:41:24 (3352): No heartbeat from core client for 30 sec - exiting 06:41:25 (3352): No heartbeat from core client for 30 sec - exiting 06:41:26 (3352): No heartbeat from core client for 30 sec - exiting 06:41:27 (3352): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... 16:19:58 (5956): No heartbeat from core client for 30 sec - exiting 16:19:59 (5956): No heartbeat from core client for 30 sec - exiting 16:20:00 (5956): No heartbeat from core client for 30 sec - exiting 16:20:01 (5956): No heartbeat from core client for 30 sec - exiting 16:20:02 (5956): No heartbeat from core client for 30 sec - exiting 16:20:03 (5956): No heartbeat from core client for 30 sec - exiting 16:20:04 (5956): No heartbeat from core client for 30 sec - exiting 16:20:05 (5956): No heartbeat from core client for 30 sec - exiting 16:20:06 (5956): No heartbeat from core client for 30 sec - exiting 16:20:07 (5956): No heartbeat from core client for 30 sec - exiting 16:20:08 (5956): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7792, iMonCtr=2 CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=400, selfPID=4852, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... 06:58:05 (4852): called boinc_finish </stderr_txt> <message> <file_xfer_error> <file_name>hadam3p_eu_wy7x_2004_1_006906901_0_7.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_eu_wy7x_2004_1_006906901_0_8.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_eu_wy7x_2004_1_006906901_0_9.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_eu_wy7x_2004_1_006906901_0_10.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_eu_wy7x_2004_1_006906901_0_11.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_eu_wy7x_2004_1_006906901_0_12.zip</file_name> <error_code>-161</error_code> </file_xfer_error> </message> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
22 Feb 2011 22:39:42 | 1118380 | 12182873 | hadam3p_eu_wy7x_2004_1_006906901_0 | 69,216 | 249,529 | 3.6051 |
21 Feb 2011 18:53:00 | 1118380 | 12182873 | hadam3p_eu_wy7x_2004_1_006906901_0 | 57,697 | 208,976 | 3.6220 |
21 Feb 2011 18:50:39 | 1118380 | 12182873 | hadam3p_eu_wy7x_2004_1_006906901_0 | 57,696 | 208,471 | 3.6133 |
20 Feb 2011 16:49:08 | 1118380 | 12182873 | hadam3p_eu_wy7x_2004_1_006906901_0 | 46,176 | 167,186 | 3.6206 |
19 Feb 2011 17:22:10 | 1118380 | 12182873 | hadam3p_eu_wy7x_2004_1_006906901_0 | 34,656 | 124,966 | 3.6059 |
18 Feb 2011 13:48:28 | 1118380 | 12182873 | hadam3p_eu_wy7x_2004_1_006906901_0 | 23,136 | 84,195 | 3.6391 |
16 Feb 2011 02:33:44 | 1118380 | 12182873 | hadam3p_eu_wy7x_2004_1_006906901_0 | 11,616 | 42,815 | 3.6859 |
©2024 cpdn.org