climateprediction.net home page
Task 19584032

Task 19584032

Name wah2_sas50_fq2k_201412_13_367_010380566_3
Workunit 10380566
Created 5 May 2016, 9:33:42 UTC
Sent 18 Jul 2016, 14:37:50 UTC
Report deadline 30 Jun 2017, 19:57:50 UTC
Received 4 Aug 2016, 20:33:03 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 1392614
Run time 4 days 6 hours 6 min 34 sec
CPU time
Validate state Invalid
Credit 3,059.47
Device peak FLOPS 4.95 GFLOPS
Application version Weather At Home 2 (wah2) (region independent) v8.12
windows_intelx86
Peak working set size 235.42 MB
Peak swap size 199.55 MB
Peak disk usage 73.51 MB
Stderr
<core_client_version>7.6.22</core_client_version>
<![CDATA[
<stderr_txt>
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1460, selfPID=3400, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5808, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4956, selfPID=4432, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
19:53:57 (4432): called boinc_finish(0)
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6428, selfPID=5576, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
20:25:19 (5576): called boinc_finish(0)
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6956, selfPID=3616, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
22:46:09 (3616): called boinc_finish(0)
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6868, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7068, selfPID=5612, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6220, selfPID=3792, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
19:24:59 (3792): called boinc_finish(0)
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6264, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6568, selfPID=1288, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
10:16:38 (1288): called boinc_finish(0)
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5044, iMonCtr=2
Model crash detected, will try to restart...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6844, iMonCtr=2
Leaving CPDN_Main::Monitor...
20:11:26 (5044): called boinc_finish(0)
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5628, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6620, selfPID=5488, iMonCtr=1
Model crash detected, will try to restart...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2836, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5908, selfPID=3532, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
18:42:07 (3532): called boinc_finish(0)
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6044, selfPID=5428, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/wah2_sas50_fq2k_201412_13_367_010380566/dataout/atmos_restart.day after 11 attempts
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2952, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
07:36:38 (2952): called boinc_finish(0)

</stderr_txt>
<message>
upload failure: <file_xfer_error>
  <file_name>wah2_sas50_fq2k_201412_13_367_010380566_3_5.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_sas50_fq2k_201412_13_367_010380566_3_6.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_sas50_fq2k_201412_13_367_010380566_3_7.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_sas50_fq2k_201412_13_367_010380566_3_8.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_sas50_fq2k_201412_13_367_010380566_3_9.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_sas50_fq2k_201412_13_367_010380566_3_10.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_sas50_fq2k_201412_13_367_010380566_3_11.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_sas50_fq2k_201412_13_367_010380566_3_12.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_sas50_fq2k_201412_13_367_010380566_3_13.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_sas50_fq2k_201412_13_367_010380566_3_14.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>

</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
28 Jul 2016 12:50:46 1392614 19584032 wah2_sas50_fq2k_201412_13_367_010380566_3 46,379 119,786 2.5828
23 Jul 2016 17:22:17 1392614 19584032 wah2_sas50_fq2k_201412_13_367_010380566_3 34,859 86,404 2.4787
20 Jul 2016 09:22:15 1392614 19584032 wah2_sas50_fq2k_201412_13_367_010380566_3 23,339 57,062 2.4449
19 Jul 2016 09:42:55 1392614 19584032 wah2_sas50_fq2k_201412_13_367_010380566_3 11,819 28,831 2.4394


©2024 cpdn.org