Name | hadam3p_saf_255y_1981_1_007038782_1 |
Workunit | 7242098 |
Created | 1 Feb 2011, 17:00:53 UTC |
Sent | 16 Feb 2011, 8:43:29 UTC |
Report deadline | 29 Jan 2012, 14:03:29 UTC |
Received | 8 Mar 2011, 9:58:08 UTC |
Server state | Over |
Outcome | No reply |
Client state | Compute error |
Exit status | 0 (0x00000000) |
Computer ID | 1119858 |
Run time | 23 hours 42 min 53 sec |
CPU time | 17 hours 19 min 27 sec |
Validate state | Invalid |
Credit | 375.31 |
Device peak FLOPS | 2.92 GFLOPS |
Application version | UK Met Office HadAM3P-HadRM3P Southern Africa v6.08 windows_intelx86 |
Stderr | <core_client_version>6.10.58</core_client_version> <![CDATA[ <stderr_txt> Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4040, selfPID=4416, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1476, selfPID=1476, iMonCtr=2 09:55:08 (3892): No heartbeat from core client for 30 sec - exiting 09:55:09 (3892): No heartbeat from core client for 30 sec - exiting 09:55:10 (3892): No heartbeat from core client for 30 sec - exiting 09:55:11 (3892): No heartbeat from core client for 30 sec - exiting 09:55:12 (3892): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3928, selfPID=3928, iMonCtr=2 CPDN Monitor - Quit request from BOINC... 09:35:05 (3992): No heartbeat from core client for 30 sec - exiting 09:35:06 (3992): No heartbeat from core client for 30 sec - exiting 09:35:08 (3992): No heartbeat from core client for 30 sec - exiting 09:35:09 (3992): No heartbeat from core client for 30 sec - exiting 09:35:10 (3992): No heartbeat from core client for 30 sec - exiting 09:35:11 (3992): No heartbeat from core client for 30 sec - exiting 09:35:12 (3992): No heartbeat from core client for 30 sec - exiting 09:35:13 (3992): No heartbeat from core client for 30 sec - exiting 09:35:14 (3992): No heartbeat from core client for 30 sec - exiting 09:35:15 (3992): No heartbeat from core client for 30 sec - exiting 09:35:16 (3992): No heartbeat from core client for 30 sec - exiting 09:35:17 (3992): No heartbeat from core client for 30 sec - exiting 09:35:18 (3992): No heartbeat from core client for 30 sec - exiting 09:35:20 (3992): No heartbeat from core client for 30 sec - exiting 09:35:21 (3992): No heartbeat from core client for 30 sec - exiting 09:35:22 (3992): No heartbeat from core client for 30 sec - exiting 09:35:23 (3992): No heartbeat from core client for 30 sec - exiting 09:35:24 (3992): No heartbeat from core client for 30 sec - exiting 09:35:25 (3992): No heartbeat from core client for 30 sec - exiting 09:35:26 (3992): No heartbeat from core client for 30 sec - exiting 09:35:27 (3992): No heartbeat from core client for 30 sec - exiting 09:35:28 (3992): No heartbeat from core client for 30 sec - exiting 09:35:29 (3992): No heartbeat from core client for 30 sec - exiting 09:35:30 (3992): No heartbeat from core client for 30 sec - exiting 09:35:32 (3992): No heartbeat from core client for 30 sec - exiting 09:35:33 (3992): No heartbeat from core client for 30 sec - exiting 09:35:34 (3992): No heartbeat from core client for 30 sec - exiting 09:35:35 (3992): No heartbeat from core client for 30 sec - exiting 09:35:36 (3992): No heartbeat from core client for 30 sec - exiting 09:35:37 (3992): No heartbeat from core client for 30 sec - exiting 09:35:38 (3992): No heartbeat from core client for 30 sec - exiting 09:35:39 (3992): No heartbeat from core client for 30 sec - exiting 09:35:40 (3992): No heartbeat from core client for 30 sec - exiting 09:35:41 (3992): No heartbeat from core client for 30 sec - exiting 09:35:42 (3992): No heartbeat from core client for 30 sec - exiting 09:35:44 (3992): No heartbeat from core client for 30 sec - exiting 09:35:45 (3992): No heartbeat from core client for 30 sec - exiting 09:35:46 (3992): No heartbeat from core client for 30 sec - exiting 09:35:48 (3992): No heartbeat from core client for 30 sec - exiting 09:35:49 (3992): No heartbeat from core client for 30 sec - exiting 09:35:50 (3992): No heartbeat from core client for 30 sec - exiting 09:35:51 (3992): No heartbeat from core client for 30 sec - exiting 09:35:52 (3992): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=0 Model crash detected, will try to restart... 10:11:19 (1344): No heartbeat from core client for 30 sec - exiting 10:11:26 (1344): No heartbeat from core client for 30 sec - exiting 10:11:27 (1344): No heartbeat from core client for 30 sec - exiting 10:11:28 (1344): No heartbeat from core client for 30 sec - exiting 10:11:30 (1344): No heartbeat from core client for 30 sec - exiting 10:11:31 (1344): No heartbeat from core client for 30 sec - exiting 10:11:32 (1344): No heartbeat from core client for 30 sec - exiting 10:11:33 (1344): No heartbeat from core client for 30 sec - exiting 10:11:34 (1344): No heartbeat from core client for 30 sec - exiting 10:11:35 (1344): No heartbeat from core client for 30 sec - exiting 10:11:36 (1344): No heartbeat from core client for 30 sec - exiting 10:11:37 (1344): No heartbeat from core client for 30 sec - exiting 10:11:39 (1344): No heartbeat from core client for 30 sec - exiting 10:11:40 (1344): No heartbeat from core client for 30 sec - exiting 10:11:41 (1344): No heartbeat from core client for 30 sec - exiting 10:11:42 (1344): No heartbeat from core client for 30 sec - exiting 10:11:43 (1344): No heartbeat from core client for 30 sec - exiting 10:11:45 (1344): No heartbeat from core client for 30 sec - exiting 10:11:46 (1344): No heartbeat from core client for 30 sec - exiting 10:11:47 (1344): No heartbeat from core client for 30 sec - exiting 10:11:48 (1344): No heartbeat from core client for 30 sec - exiting 10:11:49 (1344): No heartbeat from core client for 30 sec - exiting 10:11:50 (1344): No heartbeat from core client for 30 sec - exiting 10:11:51 (1344): No heartbeat from core client for 30 sec - exiting 10:11:52 (1344): No heartbeat from core client for 30 sec - exiting 10:11:53 (1344): No heartbeat from core client for 30 sec - exiting 10:11:54 (1344): No heartbeat from core client for 30 sec - exiting 10:11:55 (1344): No heartbeat from core client for 30 sec - exiting 10:11:57 (1344): No heartbeat from core client for 30 sec - exiting 10:11:58 (1344): No heartbeat from core client for 30 sec - exiting 10:11:59 (1344): No heartbeat from core client for 30 sec - exiting 10:12:00 (1344): No heartbeat from core client for 30 sec - exiting 10:12:01 (1344): No heartbeat from core client for 30 sec - exiting 10:12:02 (1344): No heartbeat from core client for 30 sec - exiting 10:12:03 (1344): No heartbeat from core client for 30 sec - exiting 10:12:04 (1344): No heartbeat from core client for 30 sec - exiting 10:12:05 (1344): No heartbeat from core client for 30 sec - exiting 10:12:06 (1344): No heartbeat from core client for 30 sec - exiting 10:12:07 (1344): No heartbeat from core client for 30 sec - exiting 10:12:09 (1344): No heartbeat from core client for 30 sec - exiting 10:12:10 (1344): No heartbeat from core client for 30 sec - exiting 10:12:11 (1344): No heartbeat from core client for 30 sec - exiting 10:12:12 (1344): No heartbeat from core client for 30 sec - exiting 10:12:13 (1344): No heartbeat from core client for 30 sec - exiting 10:12:14 (1344): No heartbeat from core client for 30 sec - exiting 10:12:15 (1344): No heartbeat from core client for 30 sec - exiting 10:12:16 (1344): No heartbeat from core client for 30 sec - exiting 10:12:17 (1344): No heartbeat from core client for 30 sec - exiting 10:12:18 (1344): No heartbeat from core client for 30 sec - exiting 10:12:19 (1344): No heartbeat from core client for 30 sec - exiting 10:12:21 (1344): No heartbeat from core client for 30 sec - exiting 10:12:22 (1344): No heartbeat from core client for 30 sec - exiting 10:12:23 (1344): No heartbeat from core client for 30 sec - exiting 10:12:24 (1344): No heartbeat from core client for 30 sec - exiting 10:12:25 (1344): No heartbeat from core client for 30 sec - exiting 10:12:26 (1344): No heartbeat from core client for 30 sec - exiting 10:12:27 (1344): No heartbeat from core client for 30 sec - exiting 10:12:28 (1344): No heartbeat from core client for 30 sec - exiting Model crashed: READHIST: End of file in READ from history file for namelist NLIHISTO tmp/xaakg.pipe_dummy 2048 CPDN Monitor - No 'heartbeat' from BOINC... Model crashed: READHIST: End of file in READ from history file for namelist NLIHISTO tmp/xaakg.pipe_dummy 2048 Leaving CPDN_Main::Monitor... 10:13:25 (2204): called boinc_finish </stderr_txt> <message> <file_xfer_error> <file_name>hadam3p_saf_255y_1981_1_007038782_1_3.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_saf_255y_1981_1_007038782_1_4.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_saf_255y_1981_1_007038782_1_5.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_saf_255y_1981_1_007038782_1_6.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_saf_255y_1981_1_007038782_1_7.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_saf_255y_1981_1_007038782_1_8.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_saf_255y_1981_1_007038782_1_9.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_saf_255y_1981_1_007038782_1_10.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_saf_255y_1981_1_007038782_1_11.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_saf_255y_1981_1_007038782_1_12.zip</file_name> <error_code>-161</error_code> </file_xfer_error> </message> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
08 Mar 2011 10:05:33 | 1119858 | 12551480 | hadam3p_saf_255y_1981_1_007038782_1 | 23,136 | 38,482 | 1.6633 |
25 Feb 2011 14:43:19 | 1119858 | 12551480 | hadam3p_saf_255y_1981_1_007038782_1 | 11,616 | 13,851 | 1.1924 |
©2024 cpdn.org