Name | hadam3p_saf_0ogn_1982_1_006845679_0 |
Workunit | 7048995 |
Created | 18 Nov 2010, 17:28:49 UTC |
Sent | 26 Apr 2011, 9:41:22 UTC |
Report deadline | 7 Apr 2012, 15:01:22 UTC |
Received | 23 Jun 2011, 12:31:35 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 0 (0x00000000) |
Computer ID | 884485 |
Run time | 17 hours 36 min 16 sec |
CPU time | 11 hours 45 min 41 sec |
Validate state | Invalid |
Credit | 188.53 |
Device peak FLOPS | 2.57 GFLOPS |
Application version | UK Met Office HadAM3P-HadRM3P Southern Africa v6.09 windows_intelx86 |
Stderr | <core_client_version>6.12.26</core_client_version> <![CDATA[ <stderr_txt> Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 12:25:34 (6692): No heartbeat from core client for 30 sec - exiting 12:25:35 (6692): No heartbeat from core client for 30 sec - exiting 12:25:36 (6692): No heartbeat from core client for 30 sec - exiting 12:25:37 (6692): No heartbeat from core client for 30 sec - exiting 12:25:38 (6692): No heartbeat from core client for 30 sec - exiting 12:25:39 (6692): No heartbeat from core client for 30 sec - exiting 12:25:40 (6692): No heartbeat from core client for 30 sec - exiting 12:25:41 (6692): No heartbeat from core client for 30 sec - exiting 12:25:42 (6692): No heartbeat from core client for 30 sec - exiting 12:25:43 (6692): No heartbeat from core client for 30 sec - exiting 12:25:44 (6692): No heartbeat from core client for 30 sec - exiting 12:25:45 (6692): No heartbeat from core client for 30 sec - exiting 12:25:46 (6692): No heartbeat from core client for 30 sec - exiting 12:25:47 (6692): No heartbeat from core client for 30 sec - exiting 12:25:48 (6692): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7284, selfPID=7284, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7060, selfPID=7060, iMonCtr=2 CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=9076, selfPID=9076, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7888, selfPID=7888, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 12:32:09 (4928): No heartbeat from core client for 30 sec - exiting 12:32:10 (4928): No heartbeat from core client for 30 sec - exiting 12:32:11 (4928): No heartbeat from core client for 30 sec - exiting 12:32:12 (4928): No heartbeat from core client for 30 sec - exiting 12:32:13 (4928): No heartbeat from core client for 30 sec - exiting 12:32:14 (4928): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 14:00:57 (9112): No heartbeat from core client for 30 sec - exiting 14:00:59 (9112): No heartbeat from core client for 30 sec - exiting 14:01:00 (9112): No heartbeat from core client for 30 sec - exiting 14:01:01 (9112): No heartbeat from core client for 30 sec - exiting 14:01:02 (9112): No heartbeat from core client for 30 sec - exiting 14:01:03 (9112): No heartbeat from core client for 30 sec - exiting 14:01:04 (9112): No heartbeat from core client for 30 sec - exiting 14:01:05 (9112): No heartbeat from core client for 30 sec - exiting 14:01:06 (9112): No heartbeat from core client for 30 sec - exiting 14:01:07 (9112): No heartbeat from core client for 30 sec - exiting 14:01:08 (9112): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 20:10:22 (232): No heartbeat from core client for 30 sec - exiting 20:10:23 (232): No heartbeat from core client for 30 sec - exiting 20:10:24 (232): No heartbeat from core client for 30 sec - exiting 20:10:25 (232): No heartbeat from core client for 30 sec - exiting 20:10:26 (232): No heartbeat from core client for 30 sec - exiting 20:10:27 (232): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=9060, selfPID=9060, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 19:38:41 (5392): No heartbeat from core client for 30 sec - exiting 19:38:42 (5392): No heartbeat from core client for 30 sec - exiting 19:38:43 (5392): No heartbeat from core client for 30 sec - exiting 19:38:44 (5392): No heartbeat from core client for 30 sec - exiting 19:38:45 (5392): No heartbeat from core client for 30 sec - exiting 19:38:46 (5392): No heartbeat from core client for 30 sec - exiting 19:38:47 (5392): No heartbeat from core client for 30 sec - exiting 19:38:48 (5392): No heartbeat from core client for 30 sec - exiting 19:38:49 (5392): No heartbeat from core client for 30 sec - exiting 19:38:50 (5392): No heartbeat from core client for 30 sec - exiting 19:38:51 (5392): No heartbeat from core client for 30 sec - exiting 19:38:52 (5392): No heartbeat from core client for 30 sec - exiting 19:38:53 (5392): No heartbeat from core client for 30 sec - exiting 19:38:54 (5392): No heartbeat from core client for 30 sec - exiting 19:38:55 (5392): No heartbeat from core client for 30 sec - exiting 19:38:56 (5392): No heartbeat from core client for 30 sec - exiting 19:38:57 (5392): No heartbeat from core client for 30 sec - exiting 19:38:58 (5392): No heartbeat from core client for 30 sec - exiting 19:38:59 (5392): No heartbeat from core client for 30 sec - exiting 19:39:00 (5392): No heartbeat from core client for 30 sec - exiting 19:39:01 (5392): No heartbeat from core client for 30 sec - exiting 19:39:02 (5392): No heartbeat from core client for 30 sec - exiting 19:39:03 (5392): No heartbeat from core client for 30 sec - exiting 19:39:04 (5392): No heartbeat from core client for 30 sec - exiting 19:39:05 (5392): No heartbeat from core client for 30 sec - exiting 19:39:06 (5392): No heartbeat from core client for 30 sec - exiting 19:39:07 (5392): No heartbeat from core client for 30 sec - exiting 19:39:08 (5392): No heartbeat from core client for 30 sec - exiting 19:39:09 (5392): No heartbeat from core client for 30 sec - exiting 19:39:10 (5392): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 19:39:11 (5392): No heartbeat from core client for 30 sec - exiting 19:39:12 (5392): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 17:33:20 (2724): No heartbeat from core client for 30 sec - exiting 17:33:21 (2724): No heartbeat from core client for 30 sec - exiting 17:33:22 (2724): No heartbeat from core client for 30 sec - exiting 17:33:23 (2724): No heartbeat from core client for 30 sec - exiting 17:33:24 (2724): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 17:33:25 (2724): No heartbeat from core client for 30 sec - exiting 19:52:42 (8920): No heartbeat from core client for 30 sec - exiting 19:52:43 (8920): No heartbeat from core client for 30 sec - exiting 19:52:44 (8920): No heartbeat from core client for 30 sec - exiting 19:52:45 (8920): No heartbeat from core client for 30 sec - exiting 19:52:46 (8920): No heartbeat from core client for 30 sec - exiting 19:52:47 (8920): No heartbeat from core client for 30 sec - exiting 19:52:48 (8920): No heartbeat from core client for 30 sec - exiting 19:52:49 (8920): No heartbeat from core client for 30 sec - exiting 19:52:50 (8920): No heartbeat from core client for 30 sec - exiting 19:52:51 (8920): No heartbeat from core client for 30 sec - exiting 19:52:52 (8920): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=9344, selfPID=9344, iMonCtr=2 CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=9708, selfPID=9708, iMonCtr=2 20:07:49 (11320): No heartbeat from core client for 30 sec - exiting 20:07:50 (11320): No heartbeat from core client for 30 sec - exiting 20:07:51 (11320): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5508, selfPID=5508, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 20:09:03 (8652): No heartbeat from core client for 30 sec - exiting 20:09:04 (8652): No heartbeat from core client for 30 sec - exiting 20:09:05 (8652): No heartbeat from core client for 30 sec - exiting 20:09:06 (8652): No heartbeat from core client for 30 sec - exiting 20:09:07 (8652): No heartbeat from core client for 30 sec - exiting 20:09:08 (8652): No heartbeat from core client for 30 sec - exiting 20:09:09 (8652): No heartbeat from core client for 30 sec - exiting 20:09:10 (8652): No heartbeat from core client for 30 sec - exiting 20:09:11 (8652): No heartbeat from core client for 30 sec - exiting 20:09:12 (8652): No heartbeat from core client for 30 sec - exiting 20:09:13 (8652): No heartbeat from core client for 30 sec - exiting 20:09:14 (8652): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 22:44:49 (6420): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 22:44:51 (6420): No heartbeat from core client for 30 sec - exiting 22:44:52 (6420): No heartbeat from core client for 30 sec - exiting 22:44:53 (6420): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7080, selfPID=7080, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Signal 11 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4416, selfPID=5560, iMonCtr=1 Model crash detected, will try to restart... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4416, selfPID=4416, iMonCtr=2 Leaving CPDN_Main::Monitor... Called boinc_finish </stderr_txt> <message> upload failure: <file_xfer_error> <file_name>hadam3p_saf_0ogn_1982_1_006845679_0_2.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_saf_0ogn_1982_1_006845679_0_3.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_saf_0ogn_1982_1_006845679_0_4.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_saf_0ogn_1982_1_006845679_0_5.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_saf_0ogn_1982_1_006845679_0_6.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_saf_0ogn_1982_1_006845679_0_7.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_saf_0ogn_1982_1_006845679_0_8.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_saf_0ogn_1982_1_006845679_0_9.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_saf_0ogn_1982_1_006845679_0_10.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_saf_0ogn_1982_1_006845679_0_11.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_saf_0ogn_1982_1_006845679_0_12.zip</file_name> <error_code>-161</error_code> </file_xfer_error> </message> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
27 May 2011 20:30:16 | 884485 | 12115295 | hadam3p_saf_0ogn_1982_1_006845679_0 | 11,622 | 34,256 | 2.9475 |
27 May 2011 19:44:49 | 884485 | 12115295 | hadam3p_saf_0ogn_1982_1_006845679_0 | 11,616 | 33,782 | 2.9082 |
©2024 cpdn.org