Name | hadam3p_eu_qg7t_2011_1_008346806_1 |
Workunit | 8497667 |
Created | 5 Apr 2013, 23:54:22 UTC |
Sent | 5 Apr 2013, 23:55:42 UTC |
Report deadline | 19 Mar 2014, 5:15:42 UTC |
Received | 12 Apr 2013, 23:21:46 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 0 (0x00000000) |
Computer ID | 1227741 |
Run time | 3 days 11 hours 29 min 54 sec |
CPU time | 3 days 3 hours 6 min 52 sec |
Validate state | Invalid |
Credit | 1,392.75 |
Device peak FLOPS | 1.98 GFLOPS |
Application version | UK Met Office HadAM3P-HadRM3P Europe v6.09 windows_intelx86 |
Stderr | <core_client_version>7.0.28</core_client_version> <![CDATA[ <stderr_txt> 07:09:49 (5908): No heartbeat from core client for 30 sec - exiting 07:09:50 (5908): No heartbeat from core client for 30 sec - exiting 07:09:51 (5908): No heartbeat from core client for 30 sec - exiting 07:09:52 (5908): No heartbeat from core client for 30 sec - exiting 07:09:53 (5908): No heartbeat from core client for 30 sec - exiting 07:09:54 (5908): No heartbeat from core client for 30 sec - exiting 07:09:55 (5908): No heartbeat from core client for 30 sec - exiting 07:09:56 (5908): No heartbeat from core client for 30 sec - exiting 07:09:57 (5908): No heartbeat from core client for 30 sec - exiting 07:09:58 (5908): No heartbeat from core client for 30 sec - exiting 07:09:59 (5908): No heartbeat from core client for 30 sec - exiting 07:10:00 (5908): No heartbeat from core client for 30 sec - exiting 07:10:01 (5908): No heartbeat from core client for 30 sec - exiting 07:10:02 (5908): No heartbeat from core client for 30 sec - exiting 07:10:03 (5908): No heartbeat from core client for 30 sec - exiting 07:10:04 (5908): No heartbeat from core client for 30 sec - exiting 07:10:05 (5908): No heartbeat from core client for 30 sec - exiting 07:10:06 (5908): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=8336, selfPID=4604, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Called boinc_finish CPDN Monitor - Quit request from BOINC... 16:34:51 (4608): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3384, selfPID=940, iMonCtr=1 Model crash detected, will try to restart... 16:55:22 (5432): No heartbeat from core client for 30 sec - exiting 16:55:23 (5432): No heartbeat from core client for 30 sec - exiting 16:55:24 (5432): No heartbeat from core client for 30 sec - exiting 16:55:25 (5432): No heartbeat from core client for 30 sec - exiting 16:55:26 (5432): No heartbeat from core client for 30 sec - exiting 16:55:27 (5432): No heartbeat from core client for 30 sec - exiting 16:55:28 (5432): No heartbeat from core client for 30 sec - exiting 16:55:29 (5432): No heartbeat from core client for 30 sec - exiting 16:55:30 (5432): No heartbeat from core client for 30 sec - exiting 16:55:31 (5432): No heartbeat from core client for 30 sec - exiting 16:55:32 (5432): No heartbeat from core client for 30 sec - exiting 16:55:33 (5432): No heartbeat from core client for 30 sec - exiting 16:55:34 (5432): No heartbeat from core client for 30 sec - exiting 16:55:35 (5432): No heartbeat from core client for 30 sec - exiting 16:55:36 (5432): No heartbeat from core client for 30 sec - exiting 16:55:37 (5432): No heartbeat from core client for 30 sec - exiting 16:55:38 (5432): No heartbeat from core client for 30 sec - exiting 16:55:39 (5432): No heartbeat from core client for 30 sec - exiting 16:55:40 (5432): No heartbeat from core client for 30 sec - exiting 16:55:41 (5432): No heartbeat from core client for 30 sec - exiting 16:55:42 (5432): No heartbeat from core client for 30 sec - exiting 16:55:43 (5432): No heartbeat from core client for 30 sec - exiting 16:55:44 (5432): No heartbeat from core client for 30 sec - exiting 16:55:45 (5432): No heartbeat from core client for 30 sec - exiting 16:55:46 (5432): No heartbeat from core client for 30 sec - exiting 16:55:47 (5432): No heartbeat from core client for 30 sec - exiting 16:55:48 (5432): No heartbeat from core client for 30 sec - exiting 16:55:49 (5432): No heartbeat from core client for 30 sec - exiting 16:55:50 (5432): No heartbeat from core client for 30 sec - exiting 16:55:51 (5432): No heartbeat from core client for 30 sec - exiting 16:55:52 (5432): No heartbeat from core client for 30 sec - exiting 16:55:53 (5432): No heartbeat from core client for 30 sec - exiting 16:55:54 (5432): No heartbeat from core client for 30 sec - exiting 16:55:55 (5432): No heartbeat from core client for 30 sec - exiting 16:55:56 (5432): No heartbeat from core client for 30 sec - exiting 16:55:57 (5432): No heartbeat from core client for 30 sec - exiting 16:55:58 (5432): No heartbeat from core client for 30 sec - exiting 16:55:59 (5432): No heartbeat from core client for 30 sec - exiting 16:56:00 (5432): No heartbeat from core client for 30 sec - exiting 16:56:01 (5432): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Model crashed: READHIST: End of file in READ from history file for namelist NLIHISTO tmp/xaakm.pipe_dummy 2048 Leaving CPDN_Main::Monitor... Called boinc_finish </stderr_txt> <message> upload failure: <file_xfer_error> <file_name>hadam3p_eu_qg7t_2011_1_008346806_1_8.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_eu_qg7t_2011_1_008346806_1_9.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_eu_qg7t_2011_1_008346806_1_10.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_eu_qg7t_2011_1_008346806_1_11.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_eu_qg7t_2011_1_008346806_1_12.zip</file_name> <error_code>-161</error_code> </file_xfer_error> </message> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
12 Apr 2013 11:55:28 | 1227741 | 15710900 | hadam3p_eu_qg7t_2011_1_008346806_1 | 80,736 | 245,447 | 3.0401 |
12 Apr 2013 02:37:12 | 1227741 | 15710900 | hadam3p_eu_qg7t_2011_1_008346806_1 | 69,216 | 211,222 | 3.0516 |
11 Apr 2013 09:14:09 | 1227741 | 15710900 | hadam3p_eu_qg7t_2011_1_008346806_1 | 57,696 | 176,569 | 3.0603 |
10 Apr 2013 22:55:47 | 1227741 | 15710900 | hadam3p_eu_qg7t_2011_1_008346806_1 | 46,176 | 141,918 | 3.0734 |
10 Apr 2013 09:53:37 | 1227741 | 15710900 | hadam3p_eu_qg7t_2011_1_008346806_1 | 34,656 | 106,310 | 3.0676 |
09 Apr 2013 23:34:34 | 1227741 | 15710900 | hadam3p_eu_qg7t_2011_1_008346806_1 | 23,136 | 70,732 | 3.0572 |
09 Apr 2013 09:10:07 | 1227741 | 15710900 | hadam3p_eu_qg7t_2011_1_008346806_1 | 11,616 | 35,624 | 3.0668 |
©2024 cpdn.org