climateprediction.net home page
Task 21563288

Task 21563288

Name wah2_eu25_n3y5_201712_13_801_011787043_0
Workunit 11787043
Created 14 Mar 2019, 10:18:16 UTC
Sent 17 Mar 2019, 8:31:33 UTC
Report deadline 27 Feb 2020, 13:51:33 UTC
Received 22 Mar 2019, 9:53:51 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 1273646
Run time 2 days 14 hours 57 min 32 sec
CPU time 2 days 12 hours 46 min 25 sec
Validate state Invalid
Credit 3,059.47
Device peak FLOPS 4.09 GFLOPS
Application version Weather At Home 2 (wah2) v8.24
windows_intelx86
Peak working set size 351.80 MB
Peak swap size 279.38 MB
Peak disk usage 0.02 MB
Stderr
<core_client_version>7.14.2</core_client_version>
<![CDATA[
<stderr_txt>
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6420, selfPID=6420, iMonCtr=2
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8208, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
21:40:27 (816): BOINC client no longer exists - exiting
21:40:27 (816): timer handler: client dead, exiting
21:40:37 (816): BOINC client no longer exists - exiting
21:40:37 (816): timer handler: client dead, exiting
21:40:47 (816): BOINC client no longer exists - exiting
21:40:47 (816): timer handler: client dead, exiting
21:40:57 (816): BOINC client no longer exists - exiting
21:40:57 (816): timer handler: client dead, exiting
21:41:07 (816): BOINC client no longer exists - exiting
21:41:07 (816): timer handler: client dead, exiting
21:41:17 (816): BOINC client no longer exists - exiting
21:41:17 (816): timer handler: client dead, exiting
21:41:27 (816): BOINC client no longer exists - exiting
21:41:27 (816): timer handler: client dead, exiting
CPDN Monitor - No 'heartbeat' from BOINC...
21:41:37 (816)CPDN Monitor - Quit request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7692, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7636, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_ain::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=8744, selfPID=7532, iMonCtr=1
Model crash detected, will try to restart...
17:10:52 (7188): BOINC client no longer exists - exiting
17:10:52 (7188): timer handler: client dead, exiting
CPDN Monitor - No 'heartbeat' from BOINC...
17:10:52 (11088): Can't acquire lockfile (32) - waiting 35s
17:11:02 (7188): BOINC client no longer exists - exiting
17:11:17 (7188): timer handler: client dead, exiting
17:11:27 (7188)::11:27 (11088): Can't acquire lockfile (32) - exiting
17:11:28 (11088): Error: The process cannot access the file because it is being used by another process.

 (0x20)
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7952, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_ain::Monitor...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5936, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7604, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_ain::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2252, iMonCtr=2
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=9492, iMonCtr=2
Model crash detected, will try to restart...
GCM : BUFFIN: C I/O Error feof - Unit 61 - Return code = 16
GCM : BUFFIN: C I/O Error feof - Unit 61 - Return code = 16

GCM : BUFFIN: C I/O Error feof - Unit 63 - Return code = 16
GCM : BUFFIN: C I/O Error feof - Unit 63 - Return code = 16

GCM : BUFFIN: C I/O Error feof - Unit 64 - Return code = 16
GCM : BUFFIN: C I/O Error feof - Unit 64 - Return code = 16

Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=9868, selfPID=4636, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8516, iMonCtr=2
Model crash detected, will try to restart...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=9068, iMonCtr=2
Leaving CPDN_ain::Monitor...
cpdnmonitor: cannot open input file D:\User\BOINC/projects/climateprediction.net/wah2_eu25_n3y5_201712_13_801_011787043/dataout/atmos_restart.day after 11 attempts
cpdnmonitor: cannot open input file D:\User\BOINC/projects/climateprediction.net/wah2_eu25_n3y5_201712_13_801_011787043/dataout/region_restart.day after 11 attempts

Model crash
: Model crashed: f file in READ from history file for namelist NLIHISTO                                                                                                                                                                                           tmp/xadae.pipe_dummy                                                             
              2048    
Leaving CPDN_ain::Monitor...
08:52:45 (10220): called boinc_finish(0)

</stderr_txt>
<message>
upload failure: <file_xfer_error>
  <file_name>wah2_eu25_n3y5_201712_13_801_011787043_0_r1876079580_5.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_eu25_n3y5_201712_13_801_011787043_0_r1876079580_6.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_eu25_n3y5_201712_13_801_011787043_0_r1876079580_7.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_eu25_n3y5_201712_13_801_011787043_0_r1876079580_8.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_eu25_n3y5_201712_13_801_011787043_0_r1876079580_9.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_eu25_n3y5_201712_13_801_011787043_0_r1876079580_10.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_eu25_n3y5_201712_13_801_011787043_0_r1876079580_11.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_eu25_n3y5_201712_13_801_011787043_0_r1876079580_12.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_eu25_n3y5_201712_13_801_011787043_0_r1876079580_13.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_eu25_n3y5_201712_13_801_011787043_0_r1876079580_restart.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
21 Mar 2019 20:14:33 1273646 21563288 wah2_eu25_n3y5_201712_13_801_011787043_0 46,379 206,067 4.4431
20 Mar 2019 23:00:19 1273646 21563288 wah2_eu25_n3y5_201712_13_801_011787043_0 34,859 157,180 4.5090
20 Mar 2019 08:54:32 1273646 21563288 wah2_eu25_n3y5_201712_13_801_011787043_0 23,339 108,882 4.6652
19 Mar 2019 09:09:18 1273646 21563288 wah2_eu25_n3y5_201712_13_801_011787043_0 11,819 57,901 4.8990


©2024 cpdn.org