Name | hadam3p_saf_1fax_1968_1_006947665_0 |
Workunit | 7150981 |
Created | 22 Nov 2010, 16:30:32 UTC |
Sent | 9 Mar 2011, 0:30:37 UTC |
Report deadline | 19 Feb 2012, 5:50:37 UTC |
Received | 12 Mar 2011, 14:33:51 UTC |
Server state | Over |
Outcome | No reply |
Client state | Compute error |
Exit status | 0 (0x00000000) |
Computer ID | 1191470 |
Run time | 2 days 12 hours 32 min 51 sec |
CPU time | 2 days 8 hours 26 min 54 sec |
Validate state | Invalid |
Credit | 1,683.45 |
Device peak FLOPS | 3.69 GFLOPS |
Application version | UK Met Office HadAM3P-HadRM3P Southern Africa v6.08 windows_intelx86 |
Stderr | <core_client_version>6.10.58</core_client_version> <![CDATA[ <stderr_txt> 12:53:10 (668): No heartbeat from core client for 30 sec - exiting 12:53:11 (668): No heartbeat from core client for 30 sec - exiting 12:53:12 (668): No heartbeat from core client for 30 sec - exiting 12:53:13 (668): No heartbeat from core client for 30 sec - exiting 12:53:14 (668): No heartbeat from core client for 30 sec - exiting 12:53:15 (668): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 13:08:59 (1096): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 13:09:00 (1096): No heartbeat from core client for 30 sec - exiting 13:09:01 (1096): No heartbeat from core client for 30 sec - exiting 13:09:02 (1096): No heartbeat from core client for 30 sec - exiting 13:09:03 (1096): No heartbeat from core client for 30 sec - exiting 13:09:04 (1096): No heartbeat from core client for 30 sec - exiting 13:09:05 (1096): No heartbeat from core client for 30 sec - exiting 13:09:06 (1096): No heartbeat from core client for 30 sec - exiting 13:09:07 (1096): No heartbeat from core client for 30 sec - exiting 13:09:08 (1096): No heartbeat from core client for 30 sec - exiting 13:09:09 (1096): No heartbeat from core client for 30 sec - exiting 13:16:17 (616): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3728, selfPID=3728, iMonCtr=2 13:32:15 (3320): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 13:32:16 (3320): No heartbeat from core client for 30 sec - exiting 13:32:17 (3320): No heartbeat from core client for 30 sec - exiting 13:32:18 (3320): No heartbeat from core client for 30 sec - exiting 13:32:19 (3320): No heartbeat from core client for 30 sec - exiting 13:32:20 (3320): No heartbeat from core client for 30 sec - exiting 13:32:21 (3320): No heartbeat from core client for 30 sec - exiting 13:32:22 (3320): No heartbeat from core client for 30 sec - exiting 13:32:23 (3320): No heartbeat from core client for 30 sec - exiting 13:32:24 (3320): No heartbeat from core client for 30 sec - exiting 13:32:25 (3320): No heartbeat from core client for 30 sec - exiting 13:44:46 (3068): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 14:01:32 (3588): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 15:14:08 (4060): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 16:16:31 (1140): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3144, selfPID=3144, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 04:14:01 (3080): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 04:14:02 (3080): No heartbeat from core client for 30 sec - exiting 04:14:03 (3080): No heartbeat from core client for 30 sec - exiting 04:14:04 (3080): No heartbeat from core client for 30 sec - exiting 04:14:06 (3080): No heartbeat from core client for 30 sec - exiting 04:14:07 (3080): No heartbeat from core client for 30 sec - exiting 04:14:08 (3080): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 09:20:21 (3956): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker::14:07:12 (2888): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 15:03:41 (3272): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 15:20:51 (2168): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 01:30:31 (3960): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 01:30:32 (3960): No heartbeat from core client for 30 sec - exiting Model crashed: READHIST: End of file in READ from history file for namelist NLIHISTO tmp/xaakg.pipe_dummy 2048 Leaving CPDN_Main::Monitor... 01:32:13 (3404): called boinc_finish </stderr_txt> <message> <file_xfer_error> <file_name>hadam3p_saf_1fax_1968_1_006947665_0_10.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_saf_1fax_1968_1_006947665_0_11.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_saf_1fax_1968_1_006947665_0_12.zip</file_name> <error_code>-161</error_code> </file_xfer_error> </message> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
12 Mar 2011 13:09:07 | 972612 | 12228218 | hadam3p_saf_1fax_1968_1_006947665_0 | 103,776 | 199,925 | 1.9265 |
12 Mar 2011 04:45:16 | 972612 | 12228218 | hadam3p_saf_1fax_1968_1_006947665_0 | 92,256 | 178,372 | 1.9334 |
11 Mar 2011 20:21:27 | 972612 | 12228218 | hadam3p_saf_1fax_1968_1_006947665_0 | 80,736 | 156,636 | 1.9401 |
11 Mar 2011 12:22:22 | 972612 | 12228218 | hadam3p_saf_1fax_1968_1_006947665_0 | 69,216 | 134,746 | 1.9467 |
10 Mar 2011 22:15:43 | 972612 | 12228218 | hadam3p_saf_1fax_1968_1_006947665_0 | 57,696 | 112,577 | 1.9512 |
10 Mar 2011 14:54:51 | 972612 | 12228218 | hadam3p_saf_1fax_1968_1_006947665_0 | 46,176 | 90,672 | 1.9636 |
10 Mar 2011 05:25:58 | 972612 | 12228218 | hadam3p_saf_1fax_1968_1_006947665_0 | 34,656 | 68,123 | 1.9657 |
10 Mar 2011 00:55:12 | 972612 | 12228218 | hadam3p_saf_1fax_1968_1_006947665_0 | 23,136 | 45,660 | 1.9735 |
10 Mar 2011 00:55:12 | 972612 | 12228218 | hadam3p_saf_1fax_1968_1_006947665_0 | 11,616 | 23,459 | 2.0195 |
©2024 cpdn.org