Name | hadam3p_pnw_2x0j_1964_1_007176283_0 |
Workunit | 7374565 |
Created | 22 Feb 2011, 11:30:35 UTC |
Sent | 10 Mar 2011, 1:25:45 UTC |
Report deadline | 20 Feb 2012, 6:45:45 UTC |
Received | 17 Mar 2011, 20:47:24 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 0 (0x00000000) |
Computer ID | 1191470 |
Run time | 3 days 0 hours 39 min 10 sec |
CPU time | 2 days 17 hours 41 min 47 sec |
Validate state | Invalid |
Credit | 2,004.61 |
Device peak FLOPS | 3.69 GFLOPS |
Application version | UK Met Office HadAM3P-HadRM3P Pacific North West v6.08 windows_intelx86 |
Stderr | <core_client_version>6.10.58</core_client_version> <![CDATA[ <stderr_txt> 03:49:44 (1044): No heartbeat from core client for 30 sec - exiting 03:49:45 (1044): No heartbeat from core client for 30 sec - exiting 03:49:47 (1044): No heartbeat from core client for 30 sec - exiting 03:49:48 (1044): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:54:31 (3980): No heartbeat from core client for 30 sec - exiting 03:54:32 (3980): No heartbeat from core client for 30 sec - exiting 03:54:33 (3980): No heartbeat from core client for 30 sec - exiting 03:54:34 (3980): No heartbeat from core client for 30 sec - exiting 03:54:35 (3980): No heartbeat from core client for 30 sec - exiting 03:54:36 (3980): No heartbeat from core client for 30 sec - exiting 03:54:37 (3980): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:54:38 (3980): No heartbeat from core client for 30 sec - exiting 03:54:39 (3980): No heartbeat from core client for 30 sec - exiting 03:54:41 (3980): No heartbeat from core client for 30 sec - exiting 03:54:42 (3980): No heartbeat from core client for 30 sec - exiting 03:54:43 (3980): No heartbeat from core client for 30 sec - exiting 03:54:44 (3980): No heartbeat from core client for 30 sec - exiting 03:54:45 (3980): No heartbeat from core client for 30 sec - exiting 03:54:46 (3980): No heartbeat from core client for 30 sec - exiting 03:54:48 (3980): No heartbeat from core client for 30 sec - exiting 03:54:49 (3980): No heartbeat from core client for 30 sec - exiting 04:11:17 (3624): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 04:24:19 (3368): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 05:49:59 (3112): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 07:49:24 (3128): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 08:01:14 (3928): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 08:02:37 (3244): No heartbeat from core client for 30 sec - exiting 08:02:38 (3244): No heartbeat from core client for 30 sec - exiting 08:02:39 (3244): No heartbeat from core client for 30 sec - exiting 08:02:40 (3244): No heartbeat from core client for 30 sec - exiting 08:02:42 (3244): No heartbeat from core client for 30 sec - exiting 08:02:43 (3244): No heartbeat from core client for 30 sec - exiting 08:02:44 (3244): No heartbeat from core client for 30 sec - exiting 08:02:45 (3244): No heartbeat from core client for 30 sec - exiting 08:02:46 (3244): No heartbeat from core client for 30 sec - exiting 08:02:47 (3244): No heartbeat from core client for 30 sec - exiting 08:02:48 (3244): No heartbeat from core client for 30 sec - exiting 08:02:49 (3244): No heartbeat from core client for 30 sec - exiting 08:02:50 (3244): No heartbeat from core client for 30 sec - exiting 08:02:51 (3244): No heartbeat from core client for 30 sec - exiting 08:02:52 (3244): No heartbeat from core client for 30 sec - exiting 08:02:54 (3244): No heartbeat from core client for 30 sec - exiting 08:02:55 (3244): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 08:18:22 (784): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 08:47:22 (2168): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 09:03:51 (3920): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 09:20:10 (3244): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional WorkCPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3572, selfPID=3572, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CRegional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3156, selfPID=3828, iMonCtr=1 No Process Handle Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3156, selfPID=3156, iMonCtr=1 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3588, selfPID=3588, iMonCtr=2 17:57:05 (4092): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 17:57:06 (4092): No heartbeat from core client for 30 sec - exiting 17:57:07 (4092): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 00:24:24 (984): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 00:25:53 (3020): No heartbeat from core client for 30 sec - exiting 00:25:54 (3020): No heartbeat from core client for 30 sec - exiting 00:25:55 (3020): No heartbeat from core client for 30 sec - exiting 00:25:56 (3020): No heartbeat from core client for 30 sec - exiting 00:25:57 (3020): No heartbeat from core client for 30 sec - exiting 00:25:58 (3020): No heartbeat from core client for 30 sec - exiting 00:25:59 (3020): No heartbeat from core client for 30 sec - exiting 00:26:00 (3020): No heartbeat from core client for 30 sec - exiting 00:26:01 (3020): No heartbeat from core client for 30 sec - exiting 00:26:03 (3020): No heartbeat from core client for 30 sec - exiting 00:26:04 (3020): No heartbeat from core client for 30 sec - exiting 00:26:05 (3020): No heartbeat from core client for 30 sec - exiting 00:26:06 (3020): No heartbeat from core client for 30 sec - exiting 00:26:07 (3020): No heartbeat from core client for 30 sec - exiting 00:26:08 (3020): No heartbeat from core client for 30 sec - exiting 00:26:09 (3020): No heartbeat from core client for 30 sec - exiting 00:26:10 (3020): No heartbeat from core client for 30 sec - exiting 00:26:11 (3020): No heartbeat from core client for 30 sec - exiting 00:26:12 (3020): No heartbeat from core client for 30 sec - exiting 00:26:14 (3020): No heartbeat from core client for 30 sec - exiting 00:26:15 (3020): No heartbeat from core client for 30 sec - exiting 00:26:16 (3020): No heartbeat from core client for 30 sec - exiting 00:26:17 (3020): No heartbeat from core client for 30 sec - exiting 00:26:18 (3020): No heartbeat from core client for 30 sec - exiting 00:26:19 (3020): No heartbeat from core client for 30 sec - exiting 00:26:20 (3020): No heartbeat from core client for 30 sec - exiting 00:26:21 (3020): No heartbeat from core client for 30 sec - exiting 00:26:22 (3020): No heartbeat from core client for 30 sec - exiting 00:26:23 (3020): No heartbeat from core client for 30 sec - exiting 00:26:24 (3020): No heartbeat from core client for 30 sec - exiting 00:26:26 (3020): No heartbeat from core client for 30 sec - exiting 00:26:27 (3020): No heartbeat from core client for 30 sec - exiting 00:26:28 (3020): No heartbeat from core client for 30 sec - exiting 00:26:29 (3020): No heartbeat from core client for 30 sec - exiting 00:26:30 (3020): No heartbeat from core client for 30 sec - exiting 00:26:31 (3020): No heartbeat from core client for 30 sec - exiting 00:26:32 (3020): No heartbeat from core client for 30 sec - exiting 00:26:33 (3020): No heartbeat from core client for 30 sec - exiting Global Worker:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=2356, iMonCtr=1 CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 17:49:42 (888): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 17:49:43 (888): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2476, selfPID=2476, iMonCtr=2 CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=96, selfPID=96, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 07:27:25 (3700): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 07:27:26 (3700): No heartbeat from core client for 30 sec - exiting 07:27:27 (3700): No heartbeat from core client for 30 sec - exiting 07:27:28 (3700): No heartbeat from core client for 30 sec - exiting 07:27:29 (3700): No heartbeat from core client for 30 sec - exiting 07:27:30 (3700): No heartbeat from core client for 30 sec - exiting 07:27:31 (3700): No heartbeat from core client for 30 sec - exiting Global Worker:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=660, iMonCtr=1 Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=0, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1520, selfPID=3948, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Regional yearly means requires 12 input files got 8 07:28:51 (3948): called boinc_finish </stderr_txt> <message> <file_xfer_error> <file_name>hadam3p_pnw_2x0j_1964_1_007176283_0_9.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_pnw_2x0j_1964_1_007176283_0_10.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_pnw_2x0j_1964_1_007176283_0_11.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_pnw_2x0j_1964_1_007176283_0_12.zip</file_name> <error_code>-161</error_code> </file_xfer_error> </message> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
17 Mar 2011 12:57:43 | 972612 | 12615569 | hadam3p_pnw_2x0j_1964_1_007176283_0 | 92,256 | 218,089 | 2.3640 |
17 Mar 2011 02:21:25 | 972612 | 12615569 | hadam3p_pnw_2x0j_1964_1_007176283_0 | 80,736 | 191,039 | 2.3662 |
16 Mar 2011 18:37:14 | 972612 | 12615569 | hadam3p_pnw_2x0j_1964_1_007176283_0 | 69,216 | 164,403 | 2.3752 |
16 Mar 2011 09:48:36 | 972612 | 12615569 | hadam3p_pnw_2x0j_1964_1_007176283_0 | 57,696 | 137,273 | 2.3792 |
16 Mar 2011 00:25:29 | 972612 | 12615569 | hadam3p_pnw_2x0j_1964_1_007176283_0 | 46,176 | 109,913 | 2.3803 |
15 Mar 2011 14:47:52 | 972612 | 12615569 | hadam3p_pnw_2x0j_1964_1_007176283_0 | 34,656 | 82,446 | 2.3790 |
15 Mar 2011 04:16:43 | 972612 | 12615569 | hadam3p_pnw_2x0j_1964_1_007176283_0 | 23,136 | 55,179 | 2.3850 |
13 Mar 2011 04:52:00 | 972612 | 12615569 | hadam3p_pnw_2x0j_1964_1_007176283_0 | 11,616 | 27,750 | 2.3889 |
©2024 cpdn.org