climateprediction.net home page
Task 14906196

Task 14906196

Name hadam3p_pnw_znr6_1977_1_006999706_1
Workunit 7203022
Created 12 Jul 2012, 10:13:47 UTC
Sent 14 Jul 2012, 23:35:01 UTC
Report deadline 27 Jun 2013, 4:55:01 UTC
Received 15 Aug 2012, 19:41:43 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 1186987
Run time 3 days 6 hours 24 min 35 sec
CPU time 2 days 19 hours 9 min 38 sec
Validate state Invalid
Credit 2,254.93
Device peak FLOPS 3.15 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Pacific North West v6.09
windows_intelx86
Stderr
<core_client_version>6.10.58</core_client_version>
<![CDATA[
<stderr_txt>
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
No Process Handle
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6112, selfPID=6112, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
No Process Handle
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=84540, selfPID=84540, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
No Process Handle
Regional Worker:: CPDN process is nCPDN Monitor - Quit request from BOINC...
No Process Handle
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5368, selfPID=5368, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4360, iMonCtr=2
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7092, selfPID=8188, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4652, selfPID=3300, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Regional yearly means requires 12 input files got 4
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7432, selfPID=6592, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7112, selfPID=6456, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7552, selfPID=7100, iMonCtr=1
Model crash detected, will try to restart...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3852, iMonCtr=2
05:21:25 (4896): No heartbeat from core client for 30 sec - exiting
05:21:26 (4896): No heartbeat from core client for 30 sec - exiting
05:21:28 (4896): No heartbeat from core client for 30 sec - exiting
05:21:29 (4896): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6516, iMonCtr=2
Model crash detected, will try to restart...
18:36:33 (2692): No heartbeat from core client for 30 sec - exiting
18:36:34 (2692): No heartbeat from core client for 30 sec - exiting
18:36:36 (2692): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3672, selfPID=3264, iMonCtr=1
Model crash detected, will try to restart...
CSuspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3212, selfPID=3764, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Regional yearly means requires 12 input files got 9
Called boinc_finish

</stderr_txt>
<message>
<file_xfer_error>
  <file_name>hadam3p_pnw_znr6_1977_1_006999706_1_10.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_znr6_1977_1_006999706_1_11.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_znr6_1977_1_006999706_1_12.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>

</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
14 Aug 2012 20:42:15 1186987 14906196 hadam3p_pnw_znr6_1977_1_006999706_1 103,776 239,744 2.3102
11 Aug 2012 00:45:23 1186987 14906196 hadam3p_pnw_znr6_1977_1_006999706_1 92,256 213,160 2.3105
05 Aug 2012 11:16:56 1186987 14906196 hadam3p_pnw_znr6_1977_1_006999706_1 80,736 187,501 2.3224
05 Aug 2012 00:40:18 1186987 14906196 hadam3p_pnw_znr6_1977_1_006999706_1 69,216 159,873 2.3098
31 Jul 2012 21:02:22 1186987 14906196 hadam3p_pnw_znr6_1977_1_006999706_1 57,696 133,253 2.3096
29 Jul 2012 07:59:13 1186987 14906196 hadam3p_pnw_znr6_1977_1_006999706_1 46,176 106,347 2.3031
22 Jul 2012 18:36:24 1186987 14906196 hadam3p_pnw_znr6_1977_1_006999706_1 34,656 79,892 2.3053
22 Jul 2012 05:31:16 1186987 14906196 hadam3p_pnw_znr6_1977_1_006999706_1 23,136 53,475 2.3113
16 Jul 2012 07:21:12 1186987 14906196 hadam3p_pnw_znr6_1977_1_006999706_1 11,616 27,000 2.3244


©2024 cpdn.org