Name | hadam3p_saf_10uh_2000_1_006890529_1 |
Workunit | 7093845 |
Created | 12 Apr 2011, 8:45:07 UTC |
Sent | 12 Apr 2011, 10:28:58 UTC |
Report deadline | 24 Mar 2012, 15:48:58 UTC |
Received | 30 Jul 2011, 10:04:10 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 0 (0x00000000) |
Computer ID | 1069880 |
Run time | 2 days 1 hours 26 min 5 sec |
CPU time | 8 hours 41 min 3 sec |
Validate state | Invalid |
Credit | 935.95 |
Device peak FLOPS | 2.84 GFLOPS |
Application version | UK Met Office HadAM3P-HadRM3P Southern Africa v6.09 windows_intelx86 |
Stderr | <core_client_version>6.10.18</core_client_version> <![CDATA[ <stderr_txt> CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 13:10:46 (5008): No heartbeat from core client for 30 sec - exiting 13:10:47 (5008): No heartbeat from core client for 30 sec - exiting 13:10:48 (5008): No heartbeat from core client for 30 sec - exiting 13:10:49 (5008): No heartbeat from core client for 30 sec - exiting 13:10:50 (5008): No heartbeat from core client for 30 sec - exiting 13:10:52 (5008): No heartbeat from core client for 30 sec - exiting 13:10:53 (5008): No heartbeat from core client for 30 sec - exiting 13:10:54 (5008): No heartbeat from core client for 30 sec - exiting 13:10:55 (5008): No heartbeat from core client for 30 sec - exiting 13:10:56 (5008): No heartbeat from core client for 30 sec - exiting 13:10:57 (5008): No heartbeat from core client for 30 sec - exiting 13:10:58 (5008): No heartbeat from core client for 30 sec - exiting 13:10:59 (5008): No heartbeat from core client for 30 sec - exiting 13:11:00 (5008): No heartbeat from core client for 30 sec - exiting 13:11:01 (5008): No heartbeat from core client for 30 sec - exiting 13:11:02 (5008): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 21:36:59 (2296): No heartbeat from core client for 30 sec - exiting 21:37:00 (2296): No heartbeat from core client for 30 sec - exiting 21:37:01 (2296): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 17:10:42 (2284): No heartbeat from core client for 30 sec - exiting 17:10:43 (2284): No heartbeat from core client for 30 sec - exiting 17:10:44 (2284): No heartbeat from core client for 30 sec - exiting 17:10:45 (2284): No heartbeat from core client for 30 sec - exiting 17:10:46 (2284): No heartbeat from core client for 30 sec - exiting 17:10:47 (2284): No heartbeat from core client for 30 sec - exiting 17:10:48 (2284): No heartbeat from core client for 30 sec - exiting 17:10:49 (2284): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 13:59:58 (5104): No heartbeat from core client for 30 sec - exiting 13:59:59 (5104): No heartbeat from core client for 30 sec - exiting 14:00:00 (5104): No heartbeat from core client for 30 sec - exiting 14:00:01 (5104): No heartbeat from core client for 30 sec - exiting 14:00:03 (5104): No heartbeat from core client for 30 sec - exiting 14:00:04 (5104): No heartbeat from core client for 30 sec - exiting 14:00:05 (5104): No heartbeat from core client for 30 sec - exiting 14:00:06 (5104): No heartbeat from core client for 30 sec - exiting 14:00:07 (5104): No heartbeat from core client for 30 sec - exiting 14:00:08 (5104): No heartbeat from core client for 30 sec - exiting 14:00:09 (5104): No heartbeat from core client for 30 sec - exiting 14:00:10 (5104): No heartbeat from core client for 30 sec - exiting 14:00:11 (5104): No heartbeat from core client for 30 sec - exiting 14:00:12 (5104): No heartbeat from core client for 30 sec - exiting 14:00:13 (5104): No heartbeat from core client for 30 sec - exiting 14:00:15 (5104): No heartbeat from core client for 30 sec - exiting 14:00:16 (5104): No heartbeat from core client for 30 sec - exiting 14:00:17 (5104): No heartbeat from core client for 30 sec - exiting 14:00:18 (5104): No heartbeat from core client for 30 sec - exiting 14:00:19 (5104): No heartbeat from core client for 30 sec - exiting 14:00:20 (5104): No heartbeat from core client for 30 sec - exiting 14:00:21 (5104): No heartbeat from core client for 30 sec - exiting 14:00:22 (5104): No heartbeat from core client for 30 sec - exiting 14:00:23 (5104): No heartbeat from core client for 30 sec - exiting 14:00:24 (5104): No heartbeat from core client for 30 sec - exiting 14:00:25 (5104): No heartbeat from core client for 30 sec - exiting 14:00:27 (5104): No heartbeat from core client for 30 sec - exiting 14:00:28 (5104): No heartbeat from core client for 30 sec - exiting 14:00:29 (5104): No heartbeat from core client for 30 sec - exiting 14:00:30 (5104): No heartbeat from core client for 30 sec - exiting 14:00:31 (5104): No heartbeat from core client for 30 sec - exiting 14:00:32 (5104): No heartbeat from core client for 30 sec - exiting 14:00:33 (5104): No heartbeat from core client for 30 sec - exiting 14:00:34 (5104): No heartbeat from core client for 30 sec - exiting 14:00:35 (5104): No heartbeat from core client for 30 sec - exiting 14:00:36 (5104): No heartbeat from core client for 30 sec - exiting 14:00:38 (5104): No heartbeat from core client for 30 sec - exiting 14:00:39 (5104): No heartbeat from core client for 30 sec - exiting 14:00:40 (5104): No heartbeat from core client for 30 sec - exiting 14:00:41 (5104): No heartbeat from core client for 30 sec - exiting 14:00:42 (5104): No heartbeat from core client for 30 sec - exiting 14:00:43 (5104): No heartbeat from core client for 30 sec - exiting 14:00:44 (5104): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 15:53:43 (6024): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4516, selfPID=4516, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 10:00:33 (4380): No heartbeat from core client for 30 sec - exiting 10:00:35 (4380): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... RegioCPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=836, selfPID=836, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 12:50:30 (3348): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2664, selfPID=2664, iMonCtr=2 14:04:25 (2792): No heartbeat from core client for 30 sec - exiting 14:04:26 (2792): No heartbeat from core client for 30 sec - exiting 14:04:27 (2792): No heartbeat from core client for 30 sec - exiting 14:04:28 (2792): No heartbeat from core client for 30 sec - exiting 14:04:29 (2792): No heartbeat from core client for 30 sec - exiting 14:04:30 (2792): No heartbeat from core client for 30 sec - exiting 14:04:31 (2792): No heartbeat from core client for 30 sec - exiting 14:04:32 (2792): No heartbeat from core client for 30 sec - exiting 14:04:33 (2792): No heartbeat from core client for 30 sec - exiting 14:04:34 (2792): No heartbeat from core client for 30 sec - exiting 14:04:36 (2792): No heartbeat from core client for 30 sec - exiting 14:04:37 (2792): No heartbeat from core client for 30 sec - exiting 14:04:38 (2792): No heartbeat from core client for 30 sec - exiting 14:04:39 (2792): No heartbeat from core client for 30 sec - exiting 14:04:40 (2792): No heartbeat from core client for 30 sec - exiting 14:04:41 (2792): No heartbeat from core client for 30 sec - exiting 14:04:42 (2792): No heartbeat from core client for 30 sec - exiting 14:04:43 (2792): No heartbeat from core client for 30 sec - exiting 14:04:44 (2792): No heartbeat from core client for 30 sec - exiting 14:04:45 (2792): No heartbeat from core client for 30 sec - exiting 14:04:46 (2792): No heartbeat from core client for 30 sec - exiting 14:04:48 (2792): No heartbeat from core client for 30 sec - exiting 14:04:49 (2792): No heartbeat from core client for 30 sec - exiting 14:04:50 (2792): No heartbeat from core client for 30 sec - exiting 14:04:51 (2792): No heartbeat from core client for 30 sec - exiting 14:04:52 (2792): No heartbeat from core client for 30 sec - exiting 14:04:53 (2792): No heartbeat from core client for 30 sec - exiting 14:04:54 (2792): No heartbeat from core client for 30 sec - exiting 14:04:55 (2792): No heartbeat from core client for 30 sec - exiting 14:04:56 (2792): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6672, selfPID=6672, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3608, selfPID=3608, iMonCtr=2 21:27:47 (5224): No heartbeat from core client for 30 sec - exiting 21:27:48 (5224): No heartbeat from core client for 30 sec - exiting 13:36:41 (4412): No heartbeat from core client for 30 sec - exiting 13:36:42 (4412): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=692, selfPID=3804, iMonCtr=1 Model crash detected, will try to restart... 11:39:35 (1164): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 16:31:06 (4880): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4844, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4432, selfPID=2816, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Called boinc_finish CCPDN Monitor - Quit request from BOINC... </stderr_txt> <message> <file_xfer_error> <file_name>hadam3p_saf_10uh_2000_1_006890529_1_6.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_saf_10uh_2000_1_006890529_1_7.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_saf_10uh_2000_1_006890529_1_8.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_saf_10uh_2000_1_006890529_1_9.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_saf_10uh_2000_1_006890529_1_10.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_saf_10uh_2000_1_006890529_1_11.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_saf_10uh_2000_1_006890529_1_12.zip</file_name> <error_code>-161</error_code> </file_xfer_error> </message> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
28 Jul 2011 21:36:09 | 1069880 | 12794841 | hadam3p_saf_10uh_2000_1_006890529_1 | 57,696 | 145,471 | 2.5213 |
28 Jul 2011 13:36:56 | 1069880 | 12794841 | hadam3p_saf_10uh_2000_1_006890529_1 | 46,176 | 117,056 | 2.5350 |
27 Jul 2011 16:04:32 | 1069880 | 12794841 | hadam3p_saf_10uh_2000_1_006890529_1 | 34,656 | 88,467 | 2.5527 |
07 Jul 2011 15:42:38 | 1069880 | 12794841 | hadam3p_saf_10uh_2000_1_006890529_1 | 23,136 | 61,149 | 2.6430 |
15 May 2011 15:16:08 | 1069880 | 12794841 | hadam3p_saf_10uh_2000_1_006890529_1 | 11,616 | 31,380 | 2.7014 |
©2024 cpdn.org