Name | hadam3p_eu_2j18_2000_1_007291444_0 |
Workunit | 7488648 |
Created | 14 Jun 2011, 15:21:21 UTC |
Sent | 14 Jun 2011, 15:21:26 UTC |
Report deadline | 26 May 2012, 20:41:26 UTC |
Received | 21 Jun 2011, 22:00:38 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 0 (0x00000000) |
Computer ID | 957844 |
Run time | |
CPU time | 23 hours 18 min 43 sec |
Validate state | Invalid |
Credit | 597.84 |
Device peak FLOPS | 2.95 GFLOPS |
Application version | UK Met Office HadAM3P-HadRM3P Europe v6.09 windows_intelx86 |
Stderr | <core_client_version>6.4.7</core_client_version> <![CDATA[ <stderr_txt> Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7772, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7644, iMonCtr=2 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3388, selfPID=2588, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... 18:14:28 (1516): No heartbeat from core client for 30 sec - exiting 18:14:29 (1516): No heartbeat from core client for 30 sec - exiting 18:14:30 (1516): No heartbeat from core client for 30 sec - exiting 18:14:31 (1516): No heartbeat from core client for 30 sec - exiting 18:14:32 (1516): No heartbeat from core client for 30 sec - exiting 18:14:33 (1516): No heartbeat from core client for 30 sec - exiting 18:14:34 (1516): No heartbeat from core client for 30 sec - exiting 18:14:35 (1516): No heartbeat from core client for 30 sec - exiting 18:14:36 (1516): No heartbeat from core client for 30 sec - exiting 18:14:37 (1516): No heartbeat from core client for 30 sec - exiting 18:14:38 (1516): No heartbeat from core client for 30 sec - exiting 18:14:39 (1516): No heartbeat from core client for 30 sec - exiting 18:14:40 (1516): No heartbeat from core client for 30 sec - exiting 18:14:41 (1516): No heartbeat from core client for 30 sec - exiting 18:14:42 (1516): No heartbeat from core client for 30 sec - exiting 18:14:43 (1516): No heartbeat from core client for 30 sec - exiting 18:14:44 (1516): No heartbeat from core client for 30 sec - exiting 18:14:45 (1516): No heartbeat from core client for 30 sec - exiting 18:14:46 (1516): No heartbeat from core client for 30 sec - exiting 18:14:47 (1516): No heartbeat from core client for 30 sec - exiting 18:14:48 (1516): No heartbeat from core client for 30 sec - exiting 18:14:49 (1516): No heartbeat from core client for 30 sec - exiting 18:14:50 (1516): No heartbeat from core client for 30 sec - exiting 18:14:51 (1516): No heartbeat from core client for 30 sec - exiting 18:14:52 (1516): No heartbeat from core client for 30 sec - exiting 18:14:53 (1516): No heartbeat from core client for 30 sec - exiting 18:14:54 (1516): No heartbeat from core client for 30 sec - exiting 18:14:55 (1516): No heartbeat from core client for 30 sec - exiting 18:14:56 (1516): No heartbeat from core client for 30 sec - exiting 18:14:57 (1516): No heartbeat from core client for 30 sec - exiting 18:14:58 (1516): No heartbeat from core client for 30 sec - exiting 18:14:59 (1516): No heartbeat from core client for 30 sec - exiting 18:15:00 (1516): No heartbeat from core client for 30 sec - exiting 18:15:01 (1516): No heartbeat from core client for 30 sec - exiting 18:15:02 (1516): No heartbeat from core client for 30 sec - exiting 18:15:03 (1516): No heartbeat from core client for 30 sec - exiting 18:15:04 (1516): No heartbeat from core client for 30 sec - exiting 18:15:05 (1516): No heartbeat from core client for 30 sec - exiting 18:15:06 (1516): No heartbeat from core client for 30 sec - exiting 18:15:07 (1516): No heartbeat from core client for 30 sec - exiting 18:15:08 (1516): No heartbeat from core client for 30 sec - exiting 18:15:09 (1516): No heartbeat from core client for 30 sec - exiting 18:15:10 (1516): No heartbeat from core client for 30 sec - exiting 18:15:11 (1516): No heartbeat from core client for 30 sec - exiting 18:15:12 (1516): No heartbeat from core client for 30 sec - exiting 18:15:13 (1516): No heartbeat from core client for 30 sec - exiting 18:15:14 (1516): No heartbeat from core client for 30 sec - exiting 18:15:15 (1516): No heartbeat from core client for 30 sec - exiting 18:15:16 (1516): No heartbeat from core client for 30 sec - exiting 18:15:17 (1516): No heartbeat from core client for 30 sec - exiting 18:15:18 (1516): No heartbeat from core client for 30 sec - exiting 18:15:19 (1516): No heartbeat from core client for 30 sec - exiting 18:15:20 (1516): No heartbeat from core client for 30 sec - exiting 18:15:21 (1516): No heartbeat from core client for 30 sec - exiting 18:15:22 (1516): No heartbeat from core client for 30 sec - exiting 18:15:23 (1516): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 18:15:25 (1516): No heartbeat from core client for 30 sec - exiting 23:53:48 (3400): No heartbeat from core client for 30 sec - exiting 23:53:49 (3400): No heartbeat from core client for 30 sec - exiting 23:53:50 (3400): No heartbeat from core client for 30 sec - exiting 23:53:51 (3400): No heartbeat from core client for 30 sec - exiting 23:53:52 (3400): No heartbeat from core client for 30 sec - exiting 23:53:53 (3400): No heartbeat from core client for 30 sec - exiting 23:53:54 (3400): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 23:53:55 (3400): No heartbeat from core client for 30 sec - exiting Model crashed: Leaving CPDN_Main::Monitor... Called boinc_finish </stderr_txt> <message> <file_xfer_error> <file_name>hadam3p_eu_2j18_2000_1_007291444_0_4.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_eu_2j18_2000_1_007291444_0_5.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_eu_2j18_2000_1_007291444_0_6.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_eu_2j18_2000_1_007291444_0_7.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_eu_2j18_2000_1_007291444_0_8.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_eu_2j18_2000_1_007291444_0_9.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_eu_2j18_2000_1_007291444_0_10.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_eu_2j18_2000_1_007291444_0_11.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_eu_2j18_2000_1_007291444_0_12.zip</file_name> <error_code>-161</error_code> </file_xfer_error> </message> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
21 Jun 2011 09:19:36 | 957844 | 12975247 | hadam3p_eu_2j18_2000_1_007291444_0 | 34,656 | 65,847 | 1.9000 |
21 Jun 2011 03:01:54 | 957844 | 12975247 | hadam3p_eu_2j18_2000_1_007291444_0 | 23,136 | 44,615 | 1.9284 |
20 Jun 2011 20:37:11 | 957844 | 12975247 | hadam3p_eu_2j18_2000_1_007291444_0 | 11,616 | 23,006 | 1.9805 |
©2024 cpdn.org