Name | hadam3p_eu_xdtw_1979_1_006965532_0 |
Workunit | 7168848 |
Created | 23 Nov 2010, 11:24:27 UTC |
Sent | 31 Jan 2011, 0:05:55 UTC |
Report deadline | 13 Jan 2012, 5:25:55 UTC |
Received | 12 Feb 2011, 12:19:15 UTC |
Server state | Over |
Outcome | Didn't need |
Client state | Compute error |
Exit status | 0 (0x00000000) |
Computer ID | 1130955 |
Run time | 6 days 4 hours 28 min 26 sec |
CPU time | 5 days 12 hours 59 min 9 sec |
Validate state | Invalid |
Credit | 1,988.94 |
Device peak FLOPS | 1.53 GFLOPS |
Application version | UK Met Office HadAM3P-HadRM3P Europe v6.08 windows_intelx86 |
Stderr | <core_client_version>6.10.58</core_client_version> <![CDATA[ <stderr_txt> 19:51:52 (4416): No heartbeat from core client for 30 sec - exiting 19:51:53 (4416): No heartbeat from core client for 30 sec - exiting 19:51:54 (4416): No heartbeat from core client for 30 sec - exiting 19:51:55 (4416): No heartbeat from core client for 30 sec - exiting 19:51:56 (4416): No heartbeat from core client for 30 sec - exiting 19:51:58 (4416): No heartbeat from core client for 30 sec - exiting 19:51:59 (4416): No heartbeat from core client for 30 sec - exiting 19:52:00 (4416): No heartbeat from core client for 30 sec - exiting 19:52:01 (4416): No heartbeat from core client for 30 sec - exiting 19:52:02 (4416): No heartbeat from core client for 30 sec - exiting 19:52:03 (4416): No heartbeat from core client for 30 sec - exiting 19:52:04 (4416): No heartbeat from core client for 30 sec - exiting 19:52:05 (4416): No heartbeat from core client for 30 sec - exiting 19:52:06 (4416): No heartbeat from core client for 30 sec - exiting 19:52:07 (4416): No heartbeat from core client for 30 sec - exiting 19:52:08 (4416): No heartbeat from core client for 30 sec - exiting 19:52:10 (4416): No heartbeat from core client for 30 sec - exiting 19:52:11 (4416): No heartbeat from core client for 30 sec - exiting 19:52:12 (4416): No heartbeat from core client for 30 sec - exiting 19:52:13 (4416): No heartbeat from core client for 30 sec - exiting 19:52:14 (4416): No heartbeat from core client for 30 sec - exiting 19:52:15 (4416): No heartbeat from core client for 30 sec - exiting 19:52:16 (4416): No heartbeat from core client for 30 sec - exiting 19:52:17 (4416): No heartbeat from core client for 30 sec - exiting 19:52:18 (4416): No heartbeat from core client for 30 sec - exiting 19:52:19 (4416): No heartbeat from core client for 30 sec - exiting 19:52:20 (4416): No heartbeat from core client for 30 sec - exiting 19:52:22 (4416): No heartbeat from core client for 30 sec - exiting 19:52:23 (4416): No heartbeat from core client for 30 sec - exiting 19:52:24 (4416): No heartbeat from core client for 30 sec - exiting 19:52:25 (4416): No heartbeat from core client for 30 sec - exiting 19:52:26 (4416): No heartbeat from core client for 30 sec - exiting 19:52:27 (4416): No heartbeat from core client for 30 sec - exiting 19:52:28 (4416): No heartbeat from core client for 30 sec - exiting 19:52:29 (4416): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 19:52:30 (4416): No heartbeat from core client for 30 sec - exiting Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2592, selfPID=3348, iMonCtr=1 forrtl: Access is denied. Global Worker:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=3100, iMonCtr=1 Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4568, selfPID=4824, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... No Process Handle Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4932, selfPID=4948, iMonCtr=1 Global Worker:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=0 CPDN Monitor - Quit request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4668, selfPID=4668, iMonCtr=1 15:19:08 (1744): Can't acquire lockfile (32) - waiting 35s No Process Handle Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4668, selfPID=5892, iMonCtr=1 CPDN Monitor - Quit request from BOINC... No Process Handle Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5168, selfPID=5168, iMonCtr=1 No Process Handle Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5168, selfPID=3660, iMonCtr=1 Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2196, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... 03:20:46 (2196): No heartbeat from core client for 30 sec - exiting 02:10:12 (332): No heartbeat from core client for 30 sec - exiting 02:10:13 (332): No heartbeat from core client for 30 sec - exiting 02:10:14 (332): No heartbeat from core client for 30 sec - exiting 02:10:15 (332): No heartbeat from core client for 30 sec - exiting 02:10:16 (332): No heartbeat from core client for 30 sec - exiting 02:10:17 (332): No heartbeat from core client for 30 sec - exiting 02:10:18 (332): No heartbeat from core client for 30 sec - exiting 02:10:19 (332): No heartbeat from core client for 30 sec - exiting 02:10:20 (332): No heartbeat from core client for 30 sec - exiting 02:10:21 (332): No heartbeat from core client for 30 sec - exiting 02:10:23 (332): No heartbeat from core client for 30 sec - exiting 02:10:24 (332): No heartbeat from core client for 30 sec - exiting 02:10:25 (332): No heartbeat from core client for 30 sec - exiting 02:10:26 (332): No heartbeat from core client for 30 sec - exiting 02:10:27 (332): No heartbeat from core client for 30 sec - exiting 02:10:28 (332): No heartbeat from core client for 30 sec - exiting 02:10:29 (332): No heartbeat from core client for 30 sec - exiting 02:10:30 (332): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 02:10:31 (332): No heartbeat from core client for 30 sec - exiting 02:10:32 (332): No heartbeat from core client for 30 sec - exiting 02:10:33 (332): No heartbeat from core client for 30 sec - exiting 02:10:35 (332): No heartbeat from core client for 30 sec - exiting 02:10:36 (332): No heartbeat from core client for 30 sec - exiting 02:10:37 (332): No heartbeat from core client for 30 sec - exiting 02:10:38 (332): No heartbeat from core client for 30 sec - exiting 02:10:39 (332): No heartbeat from core client for 30 sec - exiting 02:10:40 (332): No heartbeat from core client for 30 sec - exiting 02:10:41 (332): No heartbeat from core client for 30 sec - exiting Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4512, selfPID=5040, iMonCtr=1 forrtl: Access is denied. Global Worker:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=1 Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5580, selfPID=5580, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5580, selfPID=5376, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... 02:11:26 (5376): called boinc_finish </stderr_txt> <message> <file_xfer_error> <file_name>hadam3p_eu_xdtw_1979_1_006965532_0_11.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_eu_xdtw_1979_1_006965532_0_12.zip</file_name> <error_code>-161</error_code> </file_xfer_error> </message> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
10 Feb 2011 09:02:40 | 1130955 | 12247892 | hadam3p_eu_xdtw_1979_1_006965532_0 | 115,296 | 471,720 | 4.0914 |
09 Feb 2011 19:10:42 | 1130955 | 12247892 | hadam3p_eu_xdtw_1979_1_006965532_0 | 103,776 | 425,203 | 4.0973 |
09 Feb 2011 12:30:27 | 1130955 | 12247892 | hadam3p_eu_xdtw_1979_1_006965532_0 | 92,256 | 379,772 | 4.1165 |
08 Feb 2011 13:48:34 | 1130955 | 12247892 | hadam3p_eu_xdtw_1979_1_006965532_0 | 80,736 | 333,116 | 4.1260 |
07 Feb 2011 23:45:17 | 1130955 | 12247892 | hadam3p_eu_xdtw_1979_1_006965532_0 | 69,216 | 287,420 | 4.1525 |
07 Feb 2011 12:18:38 | 1130955 | 12247892 | hadam3p_eu_xdtw_1979_1_006965532_0 | 57,696 | 240,128 | 4.1620 |
06 Feb 2011 11:19:26 | 1130955 | 12247892 | hadam3p_eu_xdtw_1979_1_006965532_0 | 46,176 | 192,673 | 4.1726 |
01 Feb 2011 23:16:19 | 1130955 | 12247892 | hadam3p_eu_xdtw_1979_1_006965532_0 | 34,656 | 144,783 | 4.1777 |
01 Feb 2011 13:02:23 | 1130955 | 12247892 | hadam3p_eu_xdtw_1979_1_006965532_0 | 23,136 | 96,910 | 4.1887 |
31 Jan 2011 18:18:36 | 1130955 | 12247892 | hadam3p_eu_xdtw_1979_1_006965532_0 | 11,616 | 49,506 | 4.2619 |
©2025 cpdn.org