Name | hadam3p_eu_xpjv_2002_1_006995123_1 |
Workunit | 7198439 |
Created | 2 Jul 2012, 10:52:23 UTC |
Sent | 2 Jul 2012, 12:26:47 UTC |
Report deadline | 14 Jun 2013, 17:46:47 UTC |
Received | 4 Jul 2012, 21:41:53 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 0 (0x00000000) |
Computer ID | 1122757 |
Run time | 1 days 22 hours 27 min 28 sec |
CPU time | 1 days 19 hours 45 min 2 sec |
Validate state | Invalid |
Credit | 399.11 |
Device peak FLOPS | 1.69 GFLOPS |
Application version | UK Met Office HadAM3P-HadRM3P Europe v6.09 windows_intelx86 |
Stderr | <core_client_version>7.0.25</core_client_version> <![CDATA[ <stderr_txt> 18:58:00 (5100): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 18:58:09 (5100): No heartbeat from core client for 30 sec - exiting 18:58:10 (5100): No heartbeat from core client for 30 sec - exiting 18:58:11 (5100): No heartbeat from core client for 30 sec - exiting 18:58:12 (5100): No heartbeat from core client for 30 sec - exiting 18:58:13 (5100): No heartbeat from core client for 30 sec - exiting 18:58:14 (5100): No heartbeat from core client for 30 sec - exiting 18:58:15 (5100): No heartbeat from core client for 30 sec - exiting 18:58:16 (5100): No heartbeat from core client for 30 sec - exiting 18:58:17 (5100): No heartbeat from core client for 30 sec - exiting 18:58:18 (5100): No heartbeat from core client for 30 sec - exiting 18:58:19 (5100): No heartbeat from core client for 30 sec - exiting 18:58:20 (5100): No heartbeat from core client for 30 sec - exiting 18:58:21 (5100): No heartbeat from core client for 30 sec - exiting 02:16:27 (4712): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 02:21:53 (4272): No heartbeat from core client for 30 sec - exiting 02:21:54 (4272): No heartbeat from core client for 30 sec - exiting 02:21:55 (4272): No heartbeat from core client for 30 sec - exiting 02:21:56 (4272): No heartbeat from core client for 30 sec - exiting 02:21:57 (4272): No heartbeat from core client for 30 sec - exiting 02:21:58 (4272): No heartbeat from core client for 30 sec - exiting 02:21:59 (4272): No heartbeat from core client for 30 sec - exiting 02:22:00 (4272): No heartbeat from core client for 30 sec - exiting 02:22:02 (4272): No heartbeat from core client for 30 sec - exiting 02:22:03 (4272): No heartbeat from core client for 30 sec - exiting 02:22:04 (4272): No heartbeat from core client for 30 sec - exiting 02:22:05 (4272): No heartbeat from core client for 30 sec - exiting 02:22:09 (4272): No heartbeat from core client for 30 sec - exiting 02:22:11 (4272): No heartbeat from core client for 30 sec - exiting 02:22:12 (4272): No heartbeat from core client for 30 sec - exiting 02:22:13 (4272): No heartbeat from core client for 30 sec - exiting 02:22:14 (4272): No heartbeat from core client for 30 sec - exiting 02:22:15 (4272): No heartbeat from core client for 30 sec - exiting 02:22:16 (4272): No heartbeat from core client for 30 sec - exiting 02:22:17 (4272): No heartbeat from core client for 30 sec - exiting 02:22:22 (4272): No heartbeat from core client for 30 sec - exiting 02:22:23 (4272): No heartbeat from core client for 30 sec - exiting 02:22:24 (4272): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 06:41:02 (3284): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 06:52:25 (6140): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 06:53:42 (5800): No heartbeat from core client for 30 sec - exiting 06:55:02 (5800): No heartbeat from core client for 30 sec - exiting 06:55:03 (5800): No heartbeat from core client for 30 sec - exiting 06:55:04 (5800): No heartbeat from core client for 30 sec - exiting 06:55:05 (5800): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 06:57:02 (2160): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 06:59:27 (1812): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 07:01:13 (3916): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4976, selfPID=4976, iMonCtr=2 07:02:51 (4420): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 07:04:59 (5424): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 07:09:21 (4732): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 07:10:50 (4364): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2856, selfPID=2856, iMonCtr=2 07:12:11 (3364): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 07:14:22 (6112): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 07:56:58 (4328): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 08:06:31 (2648): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=4492, iMonCtr=1 Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5720, selfPID=5720, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5720, selfPID=1132, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Called boinc_finish </stderr_txt> <message> upload failure: <file_xfer_error> <file_name>hadam3p_eu_xpjv_2002_1_006995123_1_3.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_eu_xpjv_2002_1_006995123_1_4.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_eu_xpjv_2002_1_006995123_1_5.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_eu_xpjv_2002_1_006995123_1_6.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_eu_xpjv_2002_1_006995123_1_7.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_eu_xpjv_2002_1_006995123_1_8.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_eu_xpjv_2002_1_006995123_1_9.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_eu_xpjv_2002_1_006995123_1_10.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_eu_xpjv_2002_1_006995123_1_11.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_eu_xpjv_2002_1_006995123_1_12.zip</file_name> <error_code>-161</error_code> </file_xfer_error> </message> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
04 Jul 2012 21:46:20 | 1122757 | 14852981 | hadam3p_eu_xpjv_2002_1_006995123_1 | 23,136 | 154,604 | 6.6824 |
03 Jul 2012 15:29:17 | 1122757 | 14852981 | hadam3p_eu_xpjv_2002_1_006995123_1 | 11,616 | 84,333 | 7.2601 |
©2024 climateprediction.net