Name | hadam3p_saf_7gcf_2008_1_007624858_2 |
Workunit | 7803177 |
Created | 31 Dec 2011, 2:28:20 UTC |
Sent | 31 Dec 2011, 2:36:45 UTC |
Report deadline | 12 Dec 2012, 7:56:45 UTC |
Received | 2 Jan 2012, 15:11:34 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 0 (0x00000000) |
Computer ID | 945156 |
Run time | 14 hours 7 min 48 sec |
CPU time | 14 hours 7 min 48 sec |
Validate state | Invalid |
Credit | 375.31 |
Device peak FLOPS | 2.87 GFLOPS |
Application version | UK Met Office HadAM3P-HadRM3P Southern Africa v6.09 windows_intelx86 |
Stderr | <core_client_version>6.10.58</core_client_version> <![CDATA[ <stderr_txt> Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6792, iMonCtr=2 Model crash detected, will try to restart... 15:56:53 (4912): No heartbeat from core client for 30 sec - exiting 15:56:54 (4912): No heartbeat from core client for 30 sec - exiting 15:56:55 (4912): No heartbeat from core client for 30 sec - exiting 15:56:56 (4912): No heartbeat from core client for 30 sec - exiting 15:56:57 (4912): No heartbeat from core client for 30 sec - exiting 15:56:58 (4912): No heartbeat from core client for 30 sec - exiting 15:56:59 (4912): No heartbeat from core client for 30 sec - exiting 15:57:00 (4912): No heartbeat from core client for 30 sec - exiting 15:57:01 (4912): No heartbeat from core client for 30 sec - exiting 15:57:02 (4912): No heartbeat from core client for 30 sec - exiting 15:57:03 (4912): No heartbeat from core client for 30 sec - exiting 15:57:04 (4912): No heartbeat from core client for 30 sec - exiting 15:57:05 (4912): No heartbeat from core client for 30 sec - exiting 15:57:06 (4912): No heartbeat from core client for 30 sec - exiting 15:57:07 (4912): No heartbeat from core client for 30 sec - exiting 15:57:08 (4912): No heartbeat from core client for 30 sec - exiting 15:57:09 (4912): No heartbeat from core client for 30 sec - exiting 15:57:10 (4912): No heartbeat from core client for 30 sec - exiting 15:57:11 (4912): No heartbeat from core client for 30 sec - exiting 15:57:12 (4912): No heartbeat from core client for 30 sec - exiting 15:57:13 (4912): No heartbeat from core client for 30 sec - exiting 15:57:14 (4912): No heartbeat from core client for 30 sec - exiting 15:57:15 (4912): No heartbeat from core client for 30 sec - exiting 15:57:16 (4912): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 15:57:58 (4912): No heartbeat from core client for 30 sec - exiting Global Worker:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=0 Regional Worker:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=0 23:06:34 (4580): No heartbeat from core client for 30 sec - exiting 23:06:35 (4580): No heartbeat from core client for 30 sec - exiting 23:06:36 (4580): No heartbeat from core client for 30 sec - exiting 23:06:37 (4580): No heartbeat from core client for 30 sec - exiting 23:06:38 (4580): No heartbeat from core client for 30 sec - exiting 23:06:39 (4580): No heartbeat from core client for 30 sec - exiting 23:06:40 (4580): No heartbeat from core client for 30 sec - exiting 23:06:41 (4580): No heartbeat from core client for 30 sec - exiting 23:06:42 (4580): No heartbeat from core client for 30 sec - exiting 23:06:43 (4580): No heartbeat from core client for 30 sec - exiting 23:06:44 (4580): No heartbeat from core client for 30 sec - exiting 23:06:45 (4580): No heartbeat from core client for 30 sec - exiting 23:06:46 (4580): No heartbeat from core client for 30 sec - exiting 23:06:47 (4580): No heartbeat from core client for 30 sec - exiting 23:06:48 (4580): No heartbeat from core client for 30 sec - exiting 23:06:49 (4580): No heartbeat from core client for 30 sec - exiting 23:06:50 (4580): No heartbeat from core client for 30 sec - exiting 23:06:51 (4580): No heartbeat from core client for 30 sec - exiting 23:06:52 (4580): No heartbeat from core client for 30 sec - exiting 23:06:53 (4580): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 23:07:56 (4580): No heartbeat from core client for 30 sec - exiting 23:07:57 (4580): No heartbeat from core client for 30 sec - exiting 23:07:58 (4580): No heartbeat from core client for 30 sec - exiting Regional Worker:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=0 forrtl: Access is denied. forrtl: severe (38): error during write, unit 8, file C:\ProgramData\BOINC\projects\climateprediction.net\hadam3p_saf_7gcf_2008_1_007624858\tmp\xaakm.pipe_dummy Image PC Routine Line Source hadam3p_saf_um_6. 00C6A39A Unknown Unknown Unknown hadam3p_saf_um_6. 00C12CD0 Unknown Unknown Unknown hadam3p_saf_um_6. 00C11E9A Unknown Unknown Unknown hadam3p_saf_um_6. 00BEAA9D Unknown Unknown Unknown hadam3p_saf_um_6. 00B8F27C Unknown Unknown Unknown hadam3p_saf_um_6. 00909BD2 Unknown Unknown Unknown hadam3p_saf_um_6. 00C4E638 Unknown Unknown Unknown kernel32.dll 76A4E8DE Unknown Unknown Unknown ntdll.dll 7728D4BD Unknown Unknown Unknown ntdll.dll 7728D6CF Unknown Unknown Unknown Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3856, selfPID=3856, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3856, selfPID=4960, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... No Process Handle Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4228, iMonCtr=2 Called boinc_finish </stderr_txt> <message> <file_xfer_error> <file_name>hadam3p_saf_7gcf_2008_1_007624858_2_3.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_saf_7gcf_2008_1_007624858_2_4.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_saf_7gcf_2008_1_007624858_2_5.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_saf_7gcf_2008_1_007624858_2_6.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_saf_7gcf_2008_1_007624858_2_7.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_saf_7gcf_2008_1_007624858_2_8.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_saf_7gcf_2008_1_007624858_2_9.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_saf_7gcf_2008_1_007624858_2_10.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_saf_7gcf_2008_1_007624858_2_11.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_saf_7gcf_2008_1_007624858_2_12.zip</file_name> <error_code>-161</error_code> </file_xfer_error> </message> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
01 Jan 2012 21:21:57 | 945156 | 13843048 | hadam3p_saf_7gcf_2008_1_007624858_2 | 23,136 | 43,949 | 1.8996 |
01 Jan 2012 15:23:51 | 945156 | 13843048 | hadam3p_saf_7gcf_2008_1_007624858_2 | 11,616 | 22,016 | 1.8953 |
©2024 cpdn.org