climateprediction.net home page
Task 12213090

Task 12213090

Name hadam3p_saf_1bci_1977_1_006932938_0
Workunit 7136254
Created 22 Nov 2010, 12:48:37 UTC
Sent 13 Mar 2011, 0:34:36 UTC
Report deadline 23 Feb 2012, 5:54:36 UTC
Received 14 Mar 2011, 10:50:44 UTC
Server state Over
Outcome No reply
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 1026415
Run time 20 hours 19 min 21 sec
CPU time 19 hours 2 min 11 sec
Validate state Invalid
Credit 562.19
Device peak FLOPS 3.29 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Southern Africa v6.08
windows_intelx86
Stderr
<core_client_version>6.10.58</core_client_version>
<![CDATA[
<stderr_txt>
20:13:52 (7988): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5020, selfPID=5020, iMonCtr=2
20:23:19 (4000): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6588, selfPID=6588, iMonCtr=2
20:24:45 (7148): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
20:30:19 (5768): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
20:30:19 (4020): Can't acquire lockfile (32) - waiting 35s
20:35:28 (4020): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
20:35:29 (4020): No heartbeat from core client for 30 sec - exiting
20:35:30 (4020): No heartbeat from core client for 30 sec - exiting
20:35:31 (4020): No heartbeat from core client for 30 sec - exiting
20:35:32 (4020): No heartbeat from core client for 30 sec - exiting
20:35:33 (4020): No heartbeat from core client for 30 sec - exiting
20:35:34 (4020): No heartbeat from core client for 30 sec - exiting
20:35:35 (4020): No heartbeat from core client for 30 sec - exiting
20:35:36 (4020): No heartbeat from core client for 30 sec - exiting
20:35:37 (4020): No heartbeat from core client for 30 sec - exiting
20:35:38 (4020): No heartbeat from core client for 30 sec - exiting
20:36:24 (4680): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
20:37:34 (4444): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
20:42:42 (3696): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
20:44:36 (3768): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
20:44:37 (7904): Can't acquire lockfile (32) - waiting 35s
20:46:17 (7904): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7656, selfPID=7656, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=8612, selfPID=8612, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=1
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=0, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3272, selfPID=7740, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
06:46:29 (7740): called boinc_finish

</stderr_txt>
<message>
<file_xfer_error>
  <file_name>hadam3p_saf_1bci_1977_1_006932938_0_4.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_saf_1bci_1977_1_006932938_0_5.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_saf_1bci_1977_1_006932938_0_6.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_saf_1bci_1977_1_006932938_0_7.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_saf_1bci_1977_1_006932938_0_8.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_saf_1bci_1977_1_006932938_0_9.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_saf_1bci_1977_1_006932938_0_10.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_saf_1bci_1977_1_006932938_0_11.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_saf_1bci_1977_1_006932938_0_12.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>

</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
14 Mar 2011 06:10:15 1026415 12213090 hadam3p_saf_1bci_1977_1_006932938_0 34,656 55,097 1.5898
14 Mar 2011 00:48:01 1026415 12213090 hadam3p_saf_1bci_1977_1_006932938_0 23,136 37,290 1.6118
13 Mar 2011 23:12:32 1026415 12213090 hadam3p_saf_1bci_1977_1_006932938_0 11,616 18,698 1.6097


©2024 cpdn.org