climateprediction.net home page
Task 17544395

Task 17544395

Name hadam3p_pnw_ule7_1990_1_009273444_0
Workunit 9366360
Created 4 Dec 2014, 10:42:06 UTC
Sent 9 Dec 2014, 17:23:29 UTC
Report deadline 21 Nov 2015, 22:43:29 UTC
Received 3 Feb 2015, 22:53:46 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 952218
Run time 3 days 8 hours 50 min 6 sec
CPU time 3 days 8 hours 50 min 6 sec
Validate state Invalid
Credit 1,007.76
Device peak FLOPS 1.37 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Pacific North West v7.22
windows_intelx86
Stderr
<core_client_version>6.4.5</core_client_version>
<![CDATA[
<stderr_txt>
CPDN Monitor - Quit request from BOINC...
14:43:16 (3312): No heartbeat from client for 30 sec - exiting
14:43:27 (3312): timer handler: client dead, exiting
CPDN Monitor - No 'heartbeat' from BOINC...
14:44:18 (3312): No heartbeat from client for 30 sec - exiting
14:44:52 (3312): timer handler: client dead, exiting
14:45:48 (3312): No heartbeat from client for 30 sec - exiting
14:45:53 (3312): timer handler: client dead, exiting
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
RegCPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
RegionaCPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
RegCPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
10:35:14 (2036): No heartbeat from client for 30 sec - exiting


Unhandled Exception Detected...

Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=744, selfPID=2112, iMonCtr=1
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=744, selfPID=744, iMonCtr=1
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3184, selfPID=3184, iMonCtr=1
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3184, selfPID=0, iMonCtr=1
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3460, selfPID=3460, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3156, selfPID=3156, iMonCtr=2
Signal 11 received, exiting...
02:09:30 (1932): called boinc_finish
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2872, selfPID=2872, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2872, selfPID=3896, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
02:10:27 (3896): called boinc_finish

</stderr_txt>
<message>
<file_xfer_error>
  <file_name>hadam3p_pnw_ule7_1990_1_009273444_0_5.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_ule7_1990_1_009273444_0_6.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_ule7_1990_1_009273444_0_7.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_ule7_1990_1_009273444_0_8.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_ule7_1990_1_009273444_0_9.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_ule7_1990_1_009273444_0_10.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_ule7_1990_1_009273444_0_11.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_ule7_1990_1_009273444_0_12.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>

</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
27 Jan 2015 10:01:38 952218 17544395 hadam3p_pnw_ule7_1990_1_009273444_0 46,379 281,201 6.0631
23 Jan 2015 08:11:16 952218 17544395 hadam3p_pnw_ule7_1990_1_009273444_0 34,859 211,415 6.0649
21 Dec 2014 23:18:49 952218 17544395 hadam3p_pnw_ule7_1990_1_009273444_0 23,339 141,989 6.0838
20 Dec 2014 06:38:11 952218 17544395 hadam3p_pnw_ule7_1990_1_009273444_0 11,819 72,654 6.1472


©2024 cpdn.org