Name | hadam3p_pnw_azty_1981_1_007885513_1 |
Workunit | 8040625 |
Created | 17 Apr 2012, 10:30:49 UTC |
Sent | 17 Apr 2012, 10:39:17 UTC |
Report deadline | 30 Mar 2013, 15:59:17 UTC |
Received | 22 May 2012, 7:57:46 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 0 (0x00000000) |
Computer ID | 910262 |
Run time | 2 days 17 hours 50 min 45 sec |
CPU time | 2 days 10 hours 31 min 16 sec |
Validate state | Invalid |
Credit | 1,003.35 |
Device peak FLOPS | 2.59 GFLOPS |
Application version | UK Met Office HadAM3P-HadRM3P Pacific North West v6.09 windows_intelx86 |
Stderr | <core_client_version>7.0.25</core_client_version> <![CDATA[ <stderr_txt> Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Quit request from BOINC... 09:08:12 (4400): No heartbeat from core client for 30 sec - exiting 09:08:13 (4400): No heartbeat from core client for 30 sec - exiting 09:08:14 (4400): No heartbeat from core client for 30 sec - exiting 09:08:15 (4400): No heartbeat from core client for 30 sec - exiting 09:08:16 (4400): No heartbeat from core client for 30 sec - exiting 09:08:17 (4400): No heartbeat from core client for 30 sec - exiting 09:08:18 (4400): No heartbeat from core client for 30 sec - exiting 09:08:19 (4400): No heartbeat from core client for 30 sec - exiting 09:08:20 (4400): No heartbeat from core client for 30 sec - exiting 09:08:21 (4400): No heartbeat from core client for 30 sec - exiting 09:08:22 (4400): No heartbeat from core client for 30 sec - exiting 09:08:23 (4400): No heartbeat from core client for 30 sec - exiting 09:08:24 (4400): No heartbeat from core client for 30 sec - exiting 09:08:25 (4400): No heartbeat from core client for 30 sec - exiting 09:08:26 (4400): No heartbeat from core client for 30 sec - exiting 09:08:27 (4400): No heartbeat from core client for 30 sec - exiting 09:08:28 (4400): No heartbeat from core client for 30 sec - exiting 09:08:29 (4400): No heartbeat from core client for 30 sec - exiting 09:08:30 (4400): No heartbeat from core client for 30 sec - exiting 09:08:31 (4400): No heartbeat from core client for 30 sec - exiting 09:08:32 (4400): No heartbeat from core client for 30 sec - exiting 09:08:33 (4400): No heartbeat from core client for 30 sec - exiting 09:08:34 (4400): No heartbeat from core client for 30 sec - exiting 09:08:35 (4400): No heartbeat from core client for 30 sec - exiting 09:08:36 (4400): No heartbeat from core client for 30 sec - exiting 09:08:37 (4400): No heartbeat from core client for 30 sec - exiting 09:08:38 (4400): No heartbeat from core client for 30 sec - exiting 09:08:39 (4400): No heartbeat from core client for 30 sec - exiting 09:08:40 (4400): No heartbeat from core client for 30 sec - exiting 09:08:41 (4400): No heartbeat from core client for 30 sec - exiting 09:08:42 (4400): No heartbeat from core client for 30 sec - exiting 09:08:43 (4400): No heartbeat from core client for 30 sec - exiting 09:08:44 (4400): No heartbeat from core client for 30 sec - exiting 09:08:45 (4400): No heartbeat from core client for 30 sec - exiting 09:08:46 (4400): No heartbeat from core client for 30 sec - exiting 09:08:47 (4400): No heartbeat from core client for 30 sec - exiting 09:08:49 (4400): No heartbeat from core client for 30 sec - exiting 09:08:50 (4400): No heartbeat from core client for 30 sec - exiting 09:08:51 (4400): No heartbeat from core client for 30 sec - exiting 09:08:52 (4400): No heartbeat from core client for 30 sec - exiting 09:08:53 (4400): No heartbeat from core client for 30 sec - exiting 09:08:54 (4400): No heartbeat from core client for 30 sec - exiting 09:08:55 (4400): No heartbeat from core client for 30 sec - exiting 09:08:56 (4400): No heartbeat from core client for 30 sec - exiting 09:08:57 (4400): No heartbeat from core client for 30 sec - exiting 09:08:58 (4400): No heartbeat from core client for 30 sec - exiting 09:08:59 (4400): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6324, selfPID=6324, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1472, selfPID=1472, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5976, selfPID=5976, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 08:50:19 (5864): No heartbeat from core client for 30 sec - exiting 08:50:20 (5864): No heartbeat from core client for 30 sec - exiting 08:50:21 (5864): No heartbeat from core client for 30 sec - exiting 08:50:22 (5864): No heartbeat from core client for 30 sec - exiting 08:50:23 (5864): No heartbeat from core client for 30 sec - exiting 08:50:24 (5864): No heartbeat from core client for 30 sec - exiting 08:50:25 (5864): No heartbeat from core client for 30 sec - exiting 08:50:26 (5864): No heartbeat from core client for 30 sec - exiting 08:50:27 (5864): No heartbeat from core client for 30 sec - exiting 08:50:28 (5864): No heartbeat from core client for 30 sec - exiting 08:50:29 (5864): No heartbeat from core client for 30 sec - exiting 08:50:30 (5864): No heartbeat from core client for 30 sec - exiting 08:50:31 (5864): No heartbeat from core client for 30 sec - exiting 08:50:32 (5864): No heartbeat from core client for 30 sec - exiting 08:50:34 (5864): No heartbeat from core client for 30 sec - exiting 08:50:35 (5864): No heartbeat from core client for 30 sec - exiting 08:50:36 (5864): No heartbeat from core client for 30 sec - exiting 08:50:37 (5864): No heartbeat from core client for 30 sec - exiting 08:50:38 (5864): No heartbeat from core client for 30 sec - exiting 08:50:39 (5864): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6272, iMonCtr=2 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4900, selfPID=4900, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4712, selfPID=4712, iMonCtr=2 CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2100, selfPID=2100, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5712, selfPID=6508, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Regional yearly means requires 12 input files got 4 Called boinc_finish </stderr_txt> <message> upload failure: <file_xfer_error> <file_name>hadam3p_pnw_azty_1981_1_007885513_1_5.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_pnw_azty_1981_1_007885513_1_6.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_pnw_azty_1981_1_007885513_1_7.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_pnw_azty_1981_1_007885513_1_8.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_pnw_azty_1981_1_007885513_1_9.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_pnw_azty_1981_1_007885513_1_10.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_pnw_azty_1981_1_007885513_1_11.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_pnw_azty_1981_1_007885513_1_12.zip</file_name> <error_code>-161</error_code> </file_xfer_error> </message> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
21 May 2012 06:30:19 | 910262 | 14430634 | hadam3p_pnw_azty_1981_1_007885513_1 | 46,176 | 205,974 | 4.4606 |
20 May 2012 19:28:22 | 910262 | 14430634 | hadam3p_pnw_azty_1981_1_007885513_1 | 34,656 | 161,800 | 4.6687 |
06 May 2012 12:48:44 | 910262 | 14430634 | hadam3p_pnw_azty_1981_1_007885513_1 | 23,136 | 107,926 | 4.6649 |
04 May 2012 05:00:52 | 910262 | 14430634 | hadam3p_pnw_azty_1981_1_007885513_1 | 11,616 | 48,459 | 4.1717 |
©2024 climateprediction.net